[Perf] Using `__nv_fp8_e4m3` instead of `c10::e4m3` for `per_token_group_quant` (#21867)
Signed-off-by:
yewentao256 <zhyanwentao@126.com>
Showing
Please register or sign in to comment
Signed-off-by:
yewentao256 <zhyanwentao@126.com>