Merge branch 'vijay/diff_query_key_lengths' into 'main'
support for different query key sequence lengths See merge request ADLR/megatron-lm!151
Showing
Please register or sign in to comment
support for different query key sequence lengths See merge request ADLR/megatron-lm!151