"vscode:/vscode.git/clone" did not exist on "13c46302a8569990916daa78821d884814e19f84"
Optimize Qwen3-moe model by using flashinfer fused allreduce (#9973)
Co-authored-by:
luoyuan.luo <luoyuan.luo@antgroup.com>
Showing
Please register or sign in to comment