- 14 Aug, 2023 2 commits
-
-
Tian Siyuan authored
Co-authored-by:Siyuan Tian <siyuant@vmware.com>
-
Wenhao Chen authored
* style: apply formatter * fix: add outdated warnings * docs: add dataset format and polish * docs: polish README * fix: fix json format * fix: fix typos * revert: revert 7b example
-
- 11 Aug, 2023 1 commit
-
-
LuGY authored
* improve stablility of zero * fix wrong index * add record stream
-
- 10 Aug, 2023 1 commit
-
-
Baizhou Zhang authored
-
- 09 Aug, 2023 1 commit
-
-
flybird1111 authored
Updated coloattention tests of checking outputs and gradients
-
- 04 Aug, 2023 4 commits
-
-
binmakeswell authored
* [doc] add Series A Funding and NeurIPS news * [kernal] fix mha kernal * [CI] skip moe * [CI] fix requirements
-
flybird1111 authored
* [doc] fix gradient accumulation doc * [doc] fix gradient accumulation doc
-
flybird1111 authored
fixed an import error
-
flybird1111 authored
Improved ColoAttention interface to support flash attention 2. Solved #4322
-
- 02 Aug, 2023 1 commit
-
-
Wenhao Chen authored
* style: rename replay buffer Experience replay is typically for off policy algorithms. Use this name in PPO maybe misleading. * fix: fix wrong zero2 default arg * test: update experience tests * style: rename zero_pad fn * fix: defer init in CycledDataLoader * test: add benchmark test * style: rename internal fn of generation * style: rename internal fn of lora * fix: remove unused loss fn * fix: remove unused utils fn * refactor: remove generate_with_actor fn * fix: fix type annotation * test: add models tests * fix: skip llama due to long execution time * style: modify dataset * style: apply formatter * perf: update reward dataset * fix: fix wrong IGNORE_INDEX in sft dataset * fix: remove DataCollatorForSupervisedDataset * test: add dataset tests * style: apply formatter * style: rename test_ci to test_train * feat: add llama in inference * test: add inference tests * test: change test scripts directory * fix: update ci * fix: fix typo * fix: skip llama due to oom * fix: fix file mod * style: apply formatter * refactor: remove duplicated llama_gptq * style: apply formatter * to: update rm test * feat: add tokenizer arg * feat: add download model script * test: update train tests * fix: modify gemini load and save pretrained * test: update checkpoint io test * to: modify nproc_per_node * fix: do not remove existing dir * fix: modify save path * test: add random choice * fix: fix sft path * fix: enlarge nproc_per_node to avoid oom * fix: add num_retry * fix: make lora config of rm and critic consistent * fix: add warning about lora weights * fix: skip some gpt2 tests * fix: remove grad ckpt in rm and critic due to errors * refactor: directly use Actor in train_sft * test: add more arguments * fix: disable grad ckpt when using lora * fix: fix save_pretrained and related tests * test: enable zero2 tests * revert: remove useless fn * style: polish code * test: modify test args
-
- 01 Aug, 2023 5 commits
-
-
Hongxin Liu authored
* [test] remove legacy zero test * [test] remove lazy distribute test * [test] remove outdated checkpoint io
-
caption authored
-
Hongxin Liu authored
* [release] update version * [devops] hotfix cuda extension building * [devops] pytest ignore useless folders
-
Wenhao Chen authored
-
LuGY authored
-
- 31 Jul, 2023 6 commits
-
-
LuGY authored
* optimize the optimizer step time * fix corner case * polish * replace all-reduce with all-gather * set comm device to cuda
-
LuGY authored
* support shard optimizer of zero * polish code * support sync grad manually
-
LuGY authored
* add state dict for zero * fix unit test * polish
-
LuGY authored
* allow passing process group to zero12 * union tp-zero and normal-zero * polish code
-
LuGY authored
* support no sync for zero1 plugin * polish * polish
-
LuGY authored
* refactor low level zero * fix zero2 and support cpu offload * avg gradient and modify unit test * refactor grad store, support layer drop * refactor bucket store, support grad accumulation * fix and update unit test of zero and ddp * compatible with tp, ga and unit test * fix memory leak and polish * add zero layer drop unittest * polish code * fix import err in unit test * support diffenert comm dtype, modify docstring style * polish code * test padding and fix * fix unit test of low level zero * fix pad recording in bucket store * support some models * polish
-
- 28 Jul, 2023 1 commit
-
-
Yuanchen authored
Co-authored-by:Yuanchen Xu <yuanchen.xu00@gmail.com>
-
- 26 Jul, 2023 18 commits
-
-
binmakeswell authored
-
yuxuan-lou authored
* [NFC] polish colossalai/context/random/__init__.py code style * [NFC] polish applications/Chat/coati/models/utils.py code style
-
Zirui Zhu authored
-
Ziheng Qin authored
Co-authored-by:henryqin1997 <henryqin1997@gamil.com>
-
RichardoLuo authored
-
Yuanchen authored
Co-authored-by:Yuanchen Xu <yuanchen.xu00@gmail.com>
-
アマデウス authored
-
Xu Kai authored
-
dayellow authored
* [NFC] polish colossalai/fx/profiler/experimental/profiler_module/embedding.py code style * [NFC] polish colossalai/communication/utils.py code style --------- Co-authored-by:Minghao Huang <huangminghao@luchentech.com>
-
Wenhao Chen authored
-
YeAnbang authored
Co-authored-by:aye42 <aye42@gatech.edu>
-
shenggan authored
-
Zheng Zangwei (Alex Zheng) authored
-
梁爽 authored
Co-authored-by:supercooledith <893754954@qq.com>
-
Yanjia0 authored
-
ocd_with_naming authored
-
CZYCW authored
-
Junming Wu authored
-