- 01 Apr, 2022 11 commits
- 31 Mar, 2022 9 commits
-
-
ver217 authored
-
HELSON authored
* support existing sharded and unsharded parameters in zero * add unitest for moe-zero model init * polish moe gradient handler
-
LuGY authored
-
Wesley authored
-
Wesley authored
-
BoxiangW authored
Change the clang-format style to google style
-
ver217 authored
-
Jiarui Fang authored
-
Liang Bowen authored
-
- 30 Mar, 2022 8 commits
-
-
Jiarui Fang authored
-
LuGY authored
-
ver217 authored
* hijack p.grad in sharded model * polish comments * polish comments
-
Jiarui Fang authored
-
github-actions[bot] authored
-
Jiarui Fang authored
-
Jiarui Fang authored
-
Ziyue Jiang authored
-
- 29 Mar, 2022 8 commits
-
-
HELSON authored
-
Liang Bowen authored
-
Jiarui Fang authored
-
Jie Zhu authored
* add memory trainer hook * fix bug * add memory trainer hook * fix import bug * fix import bug * add trainer hook * fix #370 git log bug * modify `to_tensorboard` function to support better output * remove useless output * change the name of `MemProfiler` * complete memory profiler * replace error with warning * finish trainer hook * modify interface of MemProfiler * modify `__init__.py` in profiler * remove unnecessary pass statement * add usage to doc string * add usage to trainer hook * new location to store temp data file
-
ver217 authored
* optimize grad offload * polish code * polish code
-
Jiarui Fang authored
-
ver217 authored
-
Jiarui Fang authored
-
- 28 Mar, 2022 4 commits
-
-
HELSON authored
only process module's own parameters in Zero context add zero hooks for all modules that contrain parameters gather parameters only belonging to module itself
-
Jiarui Fang authored
-
Jiarui Fang authored
-
Jiarui Fang authored
-