- 14 Apr, 2022 4 commits
-
-
Jiarui Fang authored
-
ver217 authored
-
ver217 authored
* fix reuse_fp16_shard * disable test stm * polish code
-
HELSON authored
-
- 13 Apr, 2022 2 commits
- 12 Apr, 2022 5 commits
-
-
Frank Lee authored
-
Jiarui Fang authored
-
HELSON authored
-
FrankLeeeee authored
-
FrankLeeeee authored
-
- 11 Apr, 2022 6 commits
-
-
Jiarui Fang authored
-
Frank Lee authored
-
Jiarui Fang authored
-
HELSON authored
-
HELSON authored
* adapt post grad hooks for not-shard parameters * adapt optimizer for not-shard parameters * offload gradients for not-replicated parameters
-
ver217 authored
* refactor memstats collector * fix disposable * polish code
-
- 08 Apr, 2022 2 commits
-
-
HELSON authored
-
ver217 authored
* [WIP] stateful tensor manager * add eviction strategy * polish code * polish code * polish comment * add unit test * fix sampler bug * polish code * fix max sampling cnt resetting bug * fix sampler bug * polish code * fix bug * fix unit test Co-authored-by:jiaruifang <fangjiarui123@gmail.com>
-
- 07 Apr, 2022 1 commit
-
-
HELSON authored
* adapt model weight initialization for methods in Pytorch nn.init
-
- 03 Apr, 2022 2 commits
-
-
Jiarui Fang authored
-
YuliangLiu0306 authored
-
- 02 Apr, 2022 2 commits
- 01 Apr, 2022 4 commits
-
-
HELSON authored
-
アマデウス authored
-
FredHuang99 authored
-
Jiarui Fang authored
-
- 31 Mar, 2022 3 commits
-
-
HELSON authored
* support existing sharded and unsharded parameters in zero * add unitest for moe-zero model init * polish moe gradient handler
-
ver217 authored
-
Jiarui Fang authored
-
- 30 Mar, 2022 3 commits
-
-
ver217 authored
* hijack p.grad in sharded model * polish comments * polish comments
-
Jiarui Fang authored
-
Jiarui Fang authored
-
- 29 Mar, 2022 5 commits
-
-
HELSON authored
-
Liang Bowen authored
-
Jiarui Fang authored
-
ver217 authored
-
Jiarui Fang authored
-
- 28 Mar, 2022 1 commit
-
-
HELSON authored
only process module's own parameters in Zero context add zero hooks for all modules that contrain parameters gather parameters only belonging to module itself
-