"vscode:/vscode.git/clone" did not exist on "927e3ba4a45115c015a4cb06a06eb73e8715484a"
Disable tp for shared experts under expert parallelism for GLM4.5 model (#8647) (#8647)
Co-authored-by:Stefan He <hebiaobuaa@gmail.com> Co-authored-by:
Cheng Wan <54331508+ch-wan@users.noreply.github.com>
Showing
Please register or sign in to comment