[sgl-kernel performace] fix fp8 quant kernels dispatch __nv_fp8_e4m3 bug to...
[sgl-kernel performace] fix fp8 quant kernels dispatch __nv_fp8_e4m3 bug to improve performance 10%-20% (#8499)
Co-authored-by:
Ke Bao <ispobaoke@gmail.com>
Showing
Please register or sign in to comment