Flash-Attn: fix generation when no attention mask or no pading (#32241)
* fix * fix prev test (half of failures) * [run-slow] llama, gemma2 * [run-slow] llama, gemma2
Showing
Please register or sign in to comment
* fix * fix prev test (half of failures) * [run-slow] llama, gemma2 * [run-slow] llama, gemma2