[Bugfix] Enable FP8 KV cache for FlashInfer and Triton backend on non-sm100 GPUs (#24577)
Signed-off-by:
Thien Tran <gau.nernst@yahoo.com.sg>
Showing
Please register or sign in to comment
Signed-off-by:
Thien Tran <gau.nernst@yahoo.com.sg>