[Bugfix] Disable cascade attention with FlashInfer (#26130)
Signed-off-by:mgoin <mgoin64@gmail.com> Signed-off-by:
Michael Goin <mgoin64@gmail.com> Co-authored-by:
gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by:
simon-mo <simon.mo@hey.com>
Showing
Please register or sign in to comment