"megatron/legacy/model/vision/utils.py" did not exist on "4554c3fed9a5b7daa5f564c84c71b8c689ba4f02"
[sgl-kernel] Opt per_token_quant_fp8 with warp reduce (#8130)
Co-authored-by:
luoyuan.luo <luoyuan.luo@antgroup.com>
Showing
Please register or sign in to comment