Add the return_softmax_lse parameter to the flash_attn_with_kvcache function...
Add the return_softmax_lse parameter to the flash_attn_with_kvcache function to allow returning the logsumexp of the attention scores. (#989)
Showing
Please register or sign in to comment