- 08 Apr, 2022 1 commit
-
-
HELSON authored
-
- 07 Apr, 2022 1 commit
-
-
HELSON authored
* adapt model weight initialization for methods in Pytorch nn.init
-
- 03 Apr, 2022 1 commit
-
-
Jiarui Fang authored
-
- 01 Apr, 2022 1 commit
-
-
HELSON authored
-
- 31 Mar, 2022 2 commits
-
-
HELSON authored
* support existing sharded and unsharded parameters in zero * add unitest for moe-zero model init * polish moe gradient handler
-
Jiarui Fang authored
-
- 29 Mar, 2022 1 commit
-
-
HELSON authored
-