[Core/Bugfix] Add FP8 K/V Scale and dtype conversion for prefix/prefill Triton Kernel (#7208)
Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>
Showing
Please register or sign in to comment
Co-authored-by:
Cody Yu <hao.yu.cody@gmail.com>