Merge branch 'fused_softmax_kernel_fixes' into 'main'
support for all mask in fused kernel + avoiding inplace operation in bwd pass See merge request ADLR/megatron-lm!435
Showing
Please register or sign in to comment
support for all mask in fused kernel + avoiding inplace operation in bwd pass See merge request ADLR/megatron-lm!435