- 15 Jul, 2022 1 commit
-
-
HELSON authored
-
- 14 Jul, 2022 3 commits
-
-
Jiarui Fang authored
-
Jiarui Fang authored
-
HELSON authored
-
- 13 Jul, 2022 1 commit
-
-
HELSON authored
-
- 12 Jul, 2022 5 commits
-
-
Jiarui Fang authored
-
Jiarui Fang authored
* make it faster * [hotfix] torchvison fx tests * [hotfix] rename duplicated named test_gpt.py
-
HELSON authored
-
Jiarui Fang authored
-
Jiarui Fang authored
* make it faster * [tensor] rename convert_to_dist -> redistribute * [tensor] ShardSpec and ReplicaSpec * [tensor] redistribute among diff pgs * polish code
-
- 11 Jul, 2022 3 commits
-
-
Jiarui Fang authored
-
Jiarui Fang authored
-
HELSON authored
-
- 08 Jul, 2022 4 commits
-
-
Jiarui Fang authored
-
Jiarui Fang authored
-
HELSON authored
-
Jiarui Fang authored
* init a checkpoint dir * [checkpoint]support resume for cosinewarmuplr * [checkpoint]add unit test * fix some bugs but still not OK * fix bugs * make it faster * [checkpoint]support generalized scheduler * polish * [tensor] torch function return colotensor * polish * fix bugs * remove debug info * polish * polish * [tensor] test_model pass unittests * polish * [hotfix] fx get comm size bug Co-authored-by:ZhaoYi1222 <zhaoyi9499@gmail.com>
-
- 07 Jul, 2022 3 commits
-
-
HELSON authored
-
Jiarui Fang authored
-
Jiarui Fang authored
-
- 06 Jul, 2022 1 commit
-
-
Jiarui Fang authored
-
- 04 Jul, 2022 1 commit
-
-
Jiarui Fang authored
-
- 29 Jun, 2022 3 commits
-
-
Jiarui Fang authored
-
Jiarui Fang authored
-
Jiarui Fang authored
-
- 27 Jun, 2022 3 commits
-
-
Jiarui Fang authored
-
Jiarui Fang authored
-
Jiarui Fang authored
-
- 24 Jun, 2022 1 commit
-
-
Jiarui Fang authored
-
- 23 Jun, 2022 2 commits
-
-
Jiarui Fang authored
-
Jiarui Fang authored
-
- 22 Jun, 2022 4 commits
-
-
Jiarui Fang authored
-
ver217 authored
-
ver217 authored
* add more element-wise ops * update test_op * polish unit test
-
ver217 authored
* dist spec s2s uses all-to-all * update unit test * add sanity check * polish unitest test with titans * add sanity check for DistMgr * add sanity check Co-authored-by:jiaruifang <fangjiarui123@gmail.com>
-
- 21 Jun, 2022 2 commits
-
-
Jiarui Fang authored
-
ver217 authored
* ColoDDP supports overwriting default process group * rename ColoDDPV2 to ZeroDDP * add docstr for ZeroDDP * polish docstr
-
- 17 Jun, 2022 1 commit
-
-
ver217 authored
* fix param op hook * update zero tp test * fix bugs
-
- 15 Jun, 2022 1 commit
-
-
ver217 authored
* update gemini mgr * update chunk * add docstr * polish placement policy * update test chunk * update test zero * polish unit test * remove useless unit test
-
- 10 Jun, 2022 1 commit
-
-
ver217 authored
* add placement policy * add gemini mgr * update mem stats collector * update zero * update zero optim * fix bugs * zero optim monitor os * polish unit test * polish unit test * add assert
-