-
Xiaoyu Zhang authored
[sgl-kernel performace] fix fp8 quant kernels dispatch __nv_fp8_e4m3 bug to improve performance 10%-20% (#8499) Co-authored-by:Ke Bao <ispobaoke@gmail.com>
7a4309cc
[sgl-kernel performace] fix fp8 quant kernels dispatch __nv_fp8_e4m3 bug to improve performance 10%-20% (#8499)
Co-authored-by:
Ke Bao <ispobaoke@gmail.com>