"examples/git@developer.sourcefind.cn:OpenDAS/ollama.git" did not exist on "267e25a750cb2e44e48206408f96d60b3dabc0b9"
Disable tp for shared experts under expert parallelism for GLM4.5 model (#8647) (#8647)
Co-authored-by:Stefan He <hebiaobuaa@gmail.com> Co-authored-by:
Cheng Wan <54331508+ch-wan@users.noreply.github.com>
Showing
Please register or sign in to comment