- 30 Nov, 2022 6 commits
-
-
HELSON authored
* [gemini] fix init bugs for modules * fix bugs
-
Zihao authored
-
Jiarui Fang authored
-
Jiarui Fang authored
-
HELSON authored
-
HELSON authored
-
- 29 Nov, 2022 4 commits
-
-
Jiarui Fang authored
* [Gemini] more tests for Gemini * polish code
-
HELSON authored
* [testing] fix testing models * roll back
-
Jiarui Fang authored
-
Jiarui Fang authored
-
- 26 Nov, 2022 1 commit
-
-
Zihao authored
-
- 25 Nov, 2022 1 commit
-
-
Jiarui Fang authored
-
- 24 Nov, 2022 3 commits
-
-
Jiarui Fang authored
-
Jiarui Fang authored
-
Zihao authored
-
- 23 Nov, 2022 1 commit
-
-
Jiarui Fang authored
-
- 18 Nov, 2022 3 commits
-
-
Jiarui Fang authored
-
Jiarui Fang authored
-
Jiarui Fang authored
-
- 16 Nov, 2022 1 commit
-
-
Jiarui Fang authored
-
- 02 Nov, 2022 1 commit
-
-
HELSON authored
* [hotfix] fix zero's incompatibility with checkpoint in torch-1.12 * [zero] add cpu shard init * [zero] add tiny example test * [colo_tensor] fix bugs for torch-1.11
-
- 18 Oct, 2022 1 commit
-
-
HELSON authored
* add chunk manager init function * fix unit tests * add comment * add flush=True
-
- 14 Oct, 2022 1 commit
-
-
HELSON authored
* fixes memory leak when paramter is in fp16 in ZeroDDP init. * bans chunk releasement in CUDA. Only when a chunk is about to offload, it is allowed to release. * adds a constant placement policy. With it, users can allocate a reserved caching memory space for parameters.
-
- 09 Oct, 2022 1 commit
-
-
HELSON authored
-
- 26 Sep, 2022 1 commit
-
-
Jiarui Fang authored
This reverts commit 5be118f4.
-
- 24 Sep, 2022 1 commit
-
-
HELSON authored
-
- 11 Aug, 2022 2 commits
- 10 Aug, 2022 2 commits
- 09 Aug, 2022 1 commit
-
-
HELSON authored
-
- 14 Jul, 2022 1 commit
-
-
Jiarui Fang authored
-
- 26 Apr, 2022 1 commit
-
-
ver217 authored
* add lru cache * polish code * update unit test * fix sharded optim
-
- 25 Apr, 2022 1 commit
-
-
HELSON authored
-
- 24 Apr, 2022 1 commit
-
-
HELSON authored
* refactor StatefulTensor, tensor utilities * add unitest for GeminiMemoryManager
-
- 21 Apr, 2022 2 commits
-
-
Jiarui Fang authored
-
Jiarui Fang authored
* Revert "[zero] add ZeroTensorShardStrategy (#793)" This reverts commit 88759e28. * [gemini] set cpu memory capacity * [log] local throughput collecting * polish * polish * polish * polish code * polish * polish code * add a new tensor structure and override linear for it * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish
-
- 19 Apr, 2022 1 commit
-
-
Jiarui Fang authored
-