- 01 Aug, 2023 5 commits
-
-
Hongxin Liu authored
* [test] remove legacy zero test * [test] remove lazy distribute test * [test] remove outdated checkpoint io
-
caption authored
-
Hongxin Liu authored
* [release] update version * [devops] hotfix cuda extension building * [devops] pytest ignore useless folders
-
Wenhao Chen authored
-
LuGY authored
-
- 31 Jul, 2023 6 commits
-
-
LuGY authored
* optimize the optimizer step time * fix corner case * polish * replace all-reduce with all-gather * set comm device to cuda
-
LuGY authored
* support shard optimizer of zero * polish code * support sync grad manually
-
LuGY authored
* add state dict for zero * fix unit test * polish
-
LuGY authored
* allow passing process group to zero12 * union tp-zero and normal-zero * polish code
-
LuGY authored
* support no sync for zero1 plugin * polish * polish
-
LuGY authored
* refactor low level zero * fix zero2 and support cpu offload * avg gradient and modify unit test * refactor grad store, support layer drop * refactor bucket store, support grad accumulation * fix and update unit test of zero and ddp * compatible with tp, ga and unit test * fix memory leak and polish * add zero layer drop unittest * polish code * fix import err in unit test * support diffenert comm dtype, modify docstring style * polish code * test padding and fix * fix unit test of low level zero * fix pad recording in bucket store * support some models * polish
-
- 28 Jul, 2023 1 commit
-
-
Yuanchen authored
Co-authored-by:Yuanchen Xu <yuanchen.xu00@gmail.com>
-
- 26 Jul, 2023 21 commits
-
-
binmakeswell authored
-
yuxuan-lou authored
* [NFC] polish colossalai/context/random/__init__.py code style * [NFC] polish applications/Chat/coati/models/utils.py code style
-
Zirui Zhu authored
-
Ziheng Qin authored
Co-authored-by:henryqin1997 <henryqin1997@gamil.com>
-
RichardoLuo authored
-
Yuanchen authored
Co-authored-by:Yuanchen Xu <yuanchen.xu00@gmail.com>
-
アマデウス authored
-
Xu Kai authored
-
dayellow authored
* [NFC] polish colossalai/fx/profiler/experimental/profiler_module/embedding.py code style * [NFC] polish colossalai/communication/utils.py code style --------- Co-authored-by:Minghao Huang <huangminghao@luchentech.com>
-
Wenhao Chen authored
-
YeAnbang authored
Co-authored-by:aye42 <aye42@gatech.edu>
-
shenggan authored
-
Zheng Zangwei (Alex Zheng) authored
-
梁爽 authored
Co-authored-by:supercooledith <893754954@qq.com>
-
Yanjia0 authored
-
ocd_with_naming authored
-
CZYCW authored
-
Junming Wu authored
-
Camille Zhong authored
-
Michelle authored
* revise shardformer readme (#4246) * [example] add llama pretraining (#4257) * [NFC] polish colossalai/communication/p2p.py code style --------- Co-authored-by:
Jianghai <72591262+CjhHa1@users.noreply.github.com> Co-authored-by:
binmakeswell <binmakeswell@gmail.com> Co-authored-by:
Qianran Ma <qianranm@luchentech.com>
-
Jianghai authored
* [NFC] polish colossalai/booster/mixed_precision/mixed_precision_base.py code style
-
- 21 Jul, 2023 2 commits
-
-
Hongxin Liu authored
-
Baizhou Zhang authored
* sharded optimizer checkpoint for gemini plugin * modify test to reduce testing time * update doc * fix bug when keep_gatherd is true under GeminiPlugin
-
- 19 Jul, 2023 1 commit
-
-
Hongxin Liu authored
* [lazy] support init on cuda * [test] update lazy init test * [test] fix transformer version
-
- 18 Jul, 2023 1 commit
-
-
Cuiqing Li authored
* added softmax kernel * added qkv_kernel * added ops * adding tests * upload tets * fix tests * debugging * debugging tests * debugging * added * fixed errors * added softmax kernel * clean codes * added tests * update tests * update tests * added attention * add * fixed pytest checking * add cuda check * fix cuda version * fix typo
-
- 17 Jul, 2023 2 commits
-
-
binmakeswell authored
-
Jianghai authored
-
- 12 Jul, 2023 1 commit
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-