[Core][Kernels] Enable FP8 KV Cache with Flashinfer backend. + BugFix for...
[Core][Kernels] Enable FP8 KV Cache with Flashinfer backend. + BugFix for kv_cache_dtype=auto (#7985) Co-authored-by:Simon Mo <simon.mo@hey.com> Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
Showing
Please register or sign in to comment