"vscode:/vscode.git/clone" did not exist on "0ddfaa0b37399ebe634760b584f25f6fed70a0b7"
- 07 Feb, 2024 3 commits
-
-
Hongxin Liu authored
-
Hongxin Liu authored
* [moe] add mixtral block for single expert * [moe] mixtral block fwd support uneven ep * [moe] mixtral block bwd support uneven ep * [moe] add mixtral moe layer * [moe] simplify replace * [meo] support save sharded mixtral * [meo] support load sharded mixtral * [meo] support save sharded optim * [meo] integrate moe manager into plug * [meo] fix optimizer load * [meo] fix mixtral layer
-
Xuanlei Zhao authored
-