Commits · aaeb520ce365f1ee19513d17a1c1f652ab4e9d90 · OpenDAS / ColossalAI

29 Aug, 2023 1 commit

[coati] add chatglm model (#4539) · 1467e3b4

yingliu-hpc authored Aug 29, 2023

* update configuration of chatglm and add support in coati

* add unit test & update chatglm default config & fix bos index issue

* remove chatglm due to oom

* add dataset pkg in requirement-text

* fix parameter issue in test_models

* add ref in tokenize & rm unnessary parts

* separate source & target tokenization in chatglm

* add unit test to chatglm

* fix test dataset issue

* update truncation of chatglm

* fix Colossalai version

* fix colossal ai version in test

1467e3b4

29 Jun, 2023 2 commits

[chat] remove naive strategy and split colossalai strategy (#4094) · edd75a59

Wenhao Chen authored Jun 29, 2023

* feat: remove on_learn_epoch fn as not used

* revert: add _on_learn_epoch fn

* to: remove the use of NaiveStrategy

* test: remove NaiveStrategy tests

* feat: remove NaiveStrategy

* style: modify comments and params

* feat: split ColossalAIStrategy into LowLevelZeroStrategy and GeminiStrategy

* fix: remove naive

* fix: align with modified colossal strategy

* fix: fix ddp _try_init_dist arg

edd75a59

[chat] refactor trainer class (#4080) · b03d64d0

Wenhao Chen authored Jun 29, 2023

* to: add SLTrainer

* refactor: refactor RMTrainer and SFTTrainer

* fix: fix init file

* feat: remove on_learn_epoch fn as not used

* fix: align with modified gemini arguments

* to: add OnPolicyTrainer

* revert: add _on_learn_epoch fn

* refactor: refactor PPOTrainer

* style: rename PPOTrainer argument

* fix: align with modified PPO arguments

* test: align with modified train_prompts arguments

* chore: modify train_prompts

* docs: align with modified arguments

* fix: remove unnecessary output

* fix: move dataloader to fit fn of SLTrainer

* fix: move dataloader to fit fn of OnPolicyTrainer

* fix: modify usage of prompt and pretrain dataloader

b03d64d0

25 Jun, 2023 1 commit

[chat] refactor strategy class with booster api (#3987) · 153b957a

Wenhao Chen authored Jun 25, 2023

* refactor: adapt boost API in base and naive strategies

* fix: initialize plugin after setup_distributed

* fix: fix save_pretrained fn

* refactor: adapt boost API in DDPStrategy

* to: add _post_init check

* to: fix ddp backward, modify ddp dataloader and unwrap

* feat: adapt boost API in ColossalAIStrategy

* fix: call setup_distributed before use get_current_device

* fix: fix save_model and save_optimizer

* test: remove save_sharded_optimizer test

* style: apply formatter

* fix: fix stage check and add comments

* feat: allow dict type arg in strategy.prepare

* to: temporarily remove lr_scheduler for testing

* style: simplify init of ColossalAIStrategy

* fix: fix lr_scheduler in sft and rm

* style: modify comments

* test: add train_prompts tests

* fix: fix inference only case and use in train_prompts

* test: skip failed tests in ci

* style: fix CodeFactor check

* fix: do not use model.to('cpu') with GeminiPlugin

* test: enable colossalai_gemini tests

* test: set CUDA_VISIBLE_DEVICES in ci

* docs: add note

153b957a

28 Apr, 2023 1 commit
- [chat] typo accimulation_steps -> accumulation_steps (#3662) · 1a60dc07
  tanitna authored Apr 28, 2023
  
  1a60dc07
27 Apr, 2023 2 commits

[chat] refactor model save/load logic (#3654) · 842768a1

Hongxin Liu authored Apr 27, 2023

* [chat] strategy refactor unwrap model

* [chat] strategy refactor save model

* [chat] add docstr

* [chat] refactor trainer save model

* [chat] fix strategy typing

* [chat] refactor trainer save model

* [chat] update readme

* [chat] fix unit test

842768a1

[chat] remove lm model class (#3653) · 6ef70114

Hongxin Liu authored Apr 27, 2023

* [chat] refactor lora

* [chat] remove lm class

* [chat] refactor save model

* [chat] refactor train sft

* [chat] fix ci

* [chat] fix ci

6ef70114

26 Apr, 2023 1 commit

[chat] refactor trainer (#3648) · 2a951955

Hongxin Liu authored Apr 26, 2023

* [chat] ppo trainer remove useless args

* [chat] update examples

* [chat] update benchmark

* [chat] update examples

* [chat] fix sft training with wandb

* [chat] polish docstr

2a951955

18 Apr, 2023 1 commit
- reconstruct chat trainer and fix training script (#3588) · 1ec0d386
  Yuanchen authored Apr 18, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  1ec0d386
17 Apr, 2023 1 commit
- fix: fix sft (#3568) · 7788e0b0
  tingfeng cao authored Apr 17, 2023
  
  7788e0b0
28 Mar, 2023 1 commit
- [Coati] first commit (#3283) · b0ce5a10
  Fazzie-Maqianli authored Mar 28, 2023
  
  b0ce5a10
24 Mar, 2023 1 commit
- support instrcut training (#3230) · bd39877d
  Fazzie-Maqianli authored Mar 24, 2023
  
  bd39877d
23 Mar, 2023 2 commits
- [chatgpt] unnify datasets (#3218) · fa97a9ca
  Fazzie-Maqianli authored Mar 23, 2023
  
  fa97a9ca
- [chatgpt] support instuct training (#3216) · 4fd4bd9d
  Fazzie-Maqianli authored Mar 23, 2023
  
  4fd4bd9d
22 Mar, 2023 1 commit

[chatgpt] add supervised learning fine-tune code (#3183) · b4295293

pgzhang authored Mar 22, 2023



* [chatgpt] add supervised fine-tune code

* [chatgpt] delete unused code and modified comment code

* [chatgpt] use pytorch distributed sampler instead

---------
Co-authored-by: zhangpengpeng <zhangpengpeng@joyy.com>

b4295293