"vscode:/vscode.git/clone" did not exist on "f27ea94ecb6578dc99ef2fe14a3fb1ee158982d6"
Optimize Qwen3-moe model by using flashinfer fused allreduce (#9973)
Co-authored-by:
luoyuan.luo <luoyuan.luo@antgroup.com>
Showing
Please register or sign in to comment