Merge branch 'sequence_parallel' into 'main'
fix for sequence parallelism in bert pooling See merge request ADLR/megatron-lm!418
Showing
Please register or sign in to comment
fix for sequence parallelism in bert pooling See merge request ADLR/megatron-lm!418