[shardformer/sequence parallel] not support opt of seq-parallel, add warning...
[shardformer/sequence parallel] not support opt of seq-parallel, add warning and fix a bug in gpt2 pp (#4488)
Showing
Please register or sign in to comment
[shardformer/sequence parallel] not support opt of seq-parallel, add warning and fix a bug in gpt2 pp (#4488)