- 14 Apr, 2022 1 commit
-
-
Frank Lee authored
* [test] refactored with the new rerun decorator * polish test case
-
- 13 Apr, 2022 2 commits
- 12 Apr, 2022 2 commits
-
-
Jiarui Fang authored
-
HELSON authored
-
- 11 Apr, 2022 2 commits
-
-
Jiarui Fang authored
-
HELSON authored
* adapt post grad hooks for not-shard parameters * adapt optimizer for not-shard parameters * offload gradients for not-replicated parameters
-
- 08 Apr, 2022 1 commit
-
-
HELSON authored
-
- 07 Apr, 2022 1 commit
-
-
HELSON authored
* adapt model weight initialization for methods in Pytorch nn.init
-
- 03 Apr, 2022 1 commit
-
-
Jiarui Fang authored
-
- 02 Apr, 2022 2 commits
- 01 Apr, 2022 1 commit
-
-
HELSON authored
-
- 31 Mar, 2022 2 commits
-
-
HELSON authored
* support existing sharded and unsharded parameters in zero * add unitest for moe-zero model init * polish moe gradient handler
-
Jiarui Fang authored
-
- 29 Mar, 2022 2 commits
-
-
HELSON authored
-
Liang Bowen authored
-
- 25 Mar, 2022 1 commit
-
-
Frank Lee authored
-
- 23 Mar, 2022 1 commit
-
-
Jiarui Fang authored
-
- 21 Mar, 2022 2 commits
-
-
Jiarui Fang authored
-
HELSON authored
-
- 18 Mar, 2022 1 commit
-
-
HELSON authored
-
- 11 Mar, 2022 2 commits