Merge branch 'flash_attention_inference' into 'main'
Flash Attention inference fix See merge request ADLR/megatron-lm!562
Showing
Please register or sign in to comment
Flash Attention inference fix See merge request ADLR/megatron-lm!562