pass a_scale from fp8 quant result instead of hard code to 1.0f (#10241)
Co-authored-by:Yichen Wang <yichen.wang@bytedance.com> Co-authored-by:
Jinwu Guo <641876696@qq.com>
Showing
Please register or sign in to comment
Co-authored-by:Yichen Wang <yichen.wang@bytedance.com> Co-authored-by:
Jinwu Guo <641876696@qq.com>