Commits · 07c2e3d09cd6bf42f280f20f0cc2ba2eb47677cc · OpenDAS / ColossalAI

"tests/test_zero/test_sharded_optim_v2.py" did not exist on "a30e2b4c24ec0de82206b6f2943df96cc80d7cad"

19 Sep, 2023 1 commit

[misc] update pre-commit and run all files (#4752) · 079bf3cb

Hongxin Liu authored Sep 19, 2023

* [misc] update pre-commit

* [misc] run pre-commit

* [misc] remove useless configuration files

* [misc] ignore cuda for clang-format

079bf3cb

29 Jun, 2023 2 commits

[chat] remove naive strategy and split colossalai strategy (#4094) · edd75a59

Wenhao Chen authored Jun 29, 2023

* feat: remove on_learn_epoch fn as not used

* revert: add _on_learn_epoch fn

* to: remove the use of NaiveStrategy

* test: remove NaiveStrategy tests

* feat: remove NaiveStrategy

* style: modify comments and params

* feat: split ColossalAIStrategy into LowLevelZeroStrategy and GeminiStrategy

* fix: remove naive

* fix: align with modified colossal strategy

* fix: fix ddp _try_init_dist arg

edd75a59

[chat] refactor trainer class (#4080) · b03d64d0

Wenhao Chen authored Jun 29, 2023

* to: add SLTrainer

* refactor: refactor RMTrainer and SFTTrainer

* fix: fix init file

* feat: remove on_learn_epoch fn as not used

* fix: align with modified gemini arguments

* to: add OnPolicyTrainer

* revert: add _on_learn_epoch fn

* refactor: refactor PPOTrainer

* style: rename PPOTrainer argument

* fix: align with modified PPO arguments

* test: align with modified train_prompts arguments

* chore: modify train_prompts

* docs: align with modified arguments

* fix: remove unnecessary output

* fix: move dataloader to fit fn of SLTrainer

* fix: move dataloader to fit fn of OnPolicyTrainer

* fix: modify usage of prompt and pretrain dataloader

b03d64d0

25 Jun, 2023 1 commit

[chat] refactor strategy class with booster api (#3987) · 153b957a

Wenhao Chen authored Jun 25, 2023

* refactor: adapt boost API in base and naive strategies

* fix: initialize plugin after setup_distributed

* fix: fix save_pretrained fn

* refactor: adapt boost API in DDPStrategy

* to: add _post_init check

* to: fix ddp backward, modify ddp dataloader and unwrap

* feat: adapt boost API in ColossalAIStrategy

* fix: call setup_distributed before use get_current_device

* fix: fix save_model and save_optimizer

* test: remove save_sharded_optimizer test

* style: apply formatter

* fix: fix stage check and add comments

* feat: allow dict type arg in strategy.prepare

* to: temporarily remove lr_scheduler for testing

* style: simplify init of ColossalAIStrategy

* fix: fix lr_scheduler in sft and rm

* style: modify comments

* test: add train_prompts tests

* fix: fix inference only case and use in train_prompts

* test: skip failed tests in ci

* style: fix CodeFactor check

* fix: do not use model.to('cpu') with GeminiPlugin

* test: enable colossalai_gemini tests

* test: set CUDA_VISIBLE_DEVICES in ci

* docs: add note

153b957a

04 May, 2023 1 commit
- [chat] add opt attn kernel (#3655) · 7bd0bee8
  Hongxin Liu authored May 04, 2023
```
* [chat] add opt attn kernel

* [chat] disable xformer during fwd
```
  7bd0bee8
26 Apr, 2023 2 commits

[chat] refactor trainer (#3648) · 2a951955

Hongxin Liu authored Apr 26, 2023

* [chat] ppo trainer remove useless args

* [chat] update examples

* [chat] update benchmark

* [chat] update examples

* [chat] fix sft training with wandb

* [chat] polish docstr

2a951955

[gemini] accelerate inference (#3641) · 50793b35

Hongxin Liu authored Apr 26, 2023

* [gemini] support don't scatter after inference

* [chat] update colossalai strategy

* [chat] fix opt benchmark

* [chat] update opt benchmark

* [gemini] optimize inference

* [test] add gemini inference test

* [chat] fix unit test ci

* [chat] fix ci

* [chat] fix ci

* [chat] skip checkpoint test

50793b35

18 Apr, 2023 1 commit
- reconstruct chat trainer and fix training script (#3588) · 1ec0d386
  Yuanchen authored Apr 18, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  1ec0d386
28 Mar, 2023 1 commit
- [Coati] first commit (#3283) · b0ce5a10
  Fazzie-Maqianli authored Mar 28, 2023
  
  b0ce5a10
07 Mar, 2023 1 commit
- change nn to models (#3032) · c21b11ed
  Fazzie-Maqianli authored Mar 07, 2023
  
  c21b11ed
17 Feb, 2023 1 commit

[chatgpt] startegy add prepare method (#2766) · 4ee311c0

ver217 authored Feb 17, 2023

* [chatgpt] startegy add prepare method

* [chatgpt] refactor examples

* [chatgpt] refactor strategy.prepare

* [chatgpt] support save/load checkpoint

* [chatgpt] fix unwrap actor

* [chatgpt] fix unwrap actor

4ee311c0

15 Feb, 2023 1 commit

[chatgpt] optimize generation kwargs (#2717) · 9c0943ec

ver217 authored Feb 15, 2023

* [chatgpt] ppo trainer use default generate args

* [chatgpt] example remove generation preparing fn

* [chatgpt] benchmark remove generation preparing fn

* [chatgpt] fix ci

9c0943ec

14 Feb, 2023 1 commit
- [app] add chatgpt application (#2698) · 1b347010
  ver217 authored Feb 14, 2023
  
  1b347010