"vscode:/vscode.git/clone" did not exist on "2a2ec49aa70c95f73b6017624e32cdad6b36b0e1"
- 07 Apr, 2022 1 commit
-
-
HELSON authored
* adapt model weight initialization for methods in Pytorch nn.init
-
- 01 Apr, 2022 1 commit
-
-
Jiarui Fang authored
-
- 31 Mar, 2022 1 commit
-
-
Jiarui Fang authored
-
- 29 Mar, 2022 2 commits
-
-
Jiarui Fang authored
-
ver217 authored
-
- 28 Mar, 2022 1 commit
-
-
Jiarui Fang authored
-
- 25 Mar, 2022 3 commits
-
-
Jiarui Fang authored
-
Frank Lee authored
-
Jiarui Fang authored
-
- 22 Mar, 2022 1 commit
-
-
Jiarui Fang authored
* [zero] polish sharded param name * polish code * polish * polish code * polish * polsih * polish
-
- 18 Mar, 2022 2 commits
- 15 Mar, 2022 1 commit
-
-
Jiarui Fang authored
-
- 14 Mar, 2022 2 commits
-
-
Jiarui Fang authored
-
ver217 authored
-
- 11 Mar, 2022 7 commits
-
-
Jiarui Fang authored
-
Jiarui Fang authored
* place params on cpu after zero init context * polish code
-
Jiarui Fang authored
-
ver217 authored
-
jiaruifang authored
-
Jiarui Fang authored
-
Jiarui Fang authored
* add zero init context * add more flags for zero init context fix bug of repeated converting param to ShardedParamV2 * polish code
-