Merge branch 'tridao-flashattn' into 'main'
Integrate FlashAttention into Megatron-LM See merge request ADLR/megatron-lm!488
Showing
Please register or sign in to comment
Integrate FlashAttention into Megatron-LM See merge request ADLR/megatron-lm!488