use sglang_per_token_group_quant_fp8 from sgl-kernel instead of trion kernel (#5473)
Co-authored-by:
Zhang Kaihong <zhangkaihong.zkh@alibaba-inc.com>
Showing
Please register or sign in to comment
Co-authored-by:
Zhang Kaihong <zhangkaihong.zkh@alibaba-inc.com>