"docs/vscode:/vscode.git/clone" did not exist on "d202cc28c0e7707762bb5f94d944575b327ba903"
-
HELSON authored
* support existing sharded and unsharded parameters in zero * add unitest for moe-zero model init * polish moe gradient handler
e6d50ec1