[BUG] fix crash on flashinfer backend with cudagraph disabled, when attention...
[BUG] fix crash on flashinfer backend with cudagraph disabled, when attention group_size not in [1,2,4,8] (#7509)
Showing
Please register or sign in to comment