Commits · 16bf4c022150fea303d23437b0190c46204e722c · OpenDAS / ColossalAI

01 Aug, 2023 1 commit
- [chat] fix compute_approx_kl (#4338) · 75c53890
  Wenhao Chen authored Aug 01, 2023
  
  75c53890
28 Jul, 2023 1 commit
- support session-based training (#4313) · 5187c96b
  Yuanchen authored Jul 28, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  5187c96b
26 Jul, 2023 12 commits
- [NFC] polish applications/Chat/coati/models/utils.py codestyle (#4277) · 09914053
  yuxuan-lou authored Jul 19, 2023
```
* [NFC] polish colossalai/context/random/__init__.py code style

* [NFC] polish applications/Chat/coati/models/utils.py code style
```
  09914053
- [NFC] polish applications/Chat/coati/trainer/strategies/base.py code style (#4278) · 9e512938
  Zirui Zhu authored Jul 19, 2023
  
  9e512938
- applications/Chat/.gitignore (#4279) · c972d653
  Ziheng Qin authored Jul 19, 2023
```
Co-authored-by: henryqin1997 <henryqin1997@gamil.com>
```
  c972d653
- [NFC] polish applications/Chat/coati/models/generation.py code style (#4275) · 709e121c
  RichardoLuo authored Jul 18, 2023
  
  709e121c
- [NFC] polish applications/Chat/inference/server.py code style (#4274) · dc1b6127
  Yuanchen authored Jul 18, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  dc1b6127
- [NFC] fix format of application/Chat/coati/trainer/utils.py (#4273) · caa44330
  アマデウス authored Jul 18, 2023
  
  caa44330
- [NFC] polish applications/Chat/examples/train_reward_model.py code style (#4271) · 1ce997da
  Xu Kai authored Jul 18, 2023
  
  1ce997da
- [NFC] polish applications/Chat/coati/trainer/base.py code style (#4260) · 798cb729
  shenggan authored Jul 18, 2023
  
  798cb729
- [NFC] polish applications/Chat/coati/dataset/sft_dataset.py code style (#4259) · b2debdc0
  Zheng Zangwei (Alex Zheng) authored Jul 18, 2023
  
  b2debdc0
- [NFC] policy applications/Chat/examples/ray/mmmt_prompt.py code style (#4250) · dee1c963
  CZYCW authored Jul 18, 2023
  
  dee1c963
- [NFC] polish applications/Chat/coati/models/base/actor.py code style (#4248) · 77c469e1
  Junming Wu authored Jul 18, 2023
  
  77c469e1
- [NFC] polish applications/Chat/inference/requirements.txt code style (#4265) · 915ed8be
  Camille Zhong authored Jul 18, 2023
  
  915ed8be
04 Jul, 2023 3 commits

[chat] removed cache file (#4155) · f447ca18
Frank Lee authored Jul 04, 2023

f447ca18
[shardformer] shardformer support t5 model (#3994) · c1c672d0
wukong1992 authored Jun 15, 2023
```
test t5
```
c1c672d0

[chat] use official transformers and fix some issues (#4117) · 3d8d5d0d

Wenhao Chen authored Jul 04, 2023

* feat: remove on_learn_epoch fn as not used

* revert: add _on_learn_epoch fn

* feat: remove NaiveStrategy

* test: update train_prompts tests

* fix: remove prepare_llama_tokenizer_and_embedding

* test: add lora arg

* feat: remove roberta support in train_prompts due to runtime errs

* feat: remove deberta & roberta in rm as not used

* test: remove deberta and roberta tests

* feat: remove deberta and roberta models as not used

* fix: remove calls to roberta

* fix: remove prepare_llama_tokenizer_and_embedding

* chore: update transformers version

* docs: update transformers version

* fix: fix actor inference

* fix: fix ci

* feat: change llama pad token to unk

* revert: revert ddp setup_distributed

* fix: change llama pad token to unk

* revert: undo unnecessary changes

* fix: use pip to install transformers

3d8d5d0d

29 Jun, 2023 2 commits

[chat] remove naive strategy and split colossalai strategy (#4094) · edd75a59

Wenhao Chen authored Jun 29, 2023

* feat: remove on_learn_epoch fn as not used

* revert: add _on_learn_epoch fn

* to: remove the use of NaiveStrategy

* test: remove NaiveStrategy tests

* feat: remove NaiveStrategy

* style: modify comments and params

* feat: split ColossalAIStrategy into LowLevelZeroStrategy and GeminiStrategy

* fix: remove naive

* fix: align with modified colossal strategy

* fix: fix ddp _try_init_dist arg

edd75a59

[chat] refactor trainer class (#4080) · b03d64d0

Wenhao Chen authored Jun 29, 2023

* to: add SLTrainer

* refactor: refactor RMTrainer and SFTTrainer

* fix: fix init file

* feat: remove on_learn_epoch fn as not used

* fix: align with modified gemini arguments

* to: add OnPolicyTrainer

* revert: add _on_learn_epoch fn

* refactor: refactor PPOTrainer

* style: rename PPOTrainer argument

* fix: align with modified PPO arguments

* test: align with modified train_prompts arguments

* chore: modify train_prompts

* docs: align with modified arguments

* fix: remove unnecessary output

* fix: move dataloader to fit fn of SLTrainer

* fix: move dataloader to fit fn of OnPolicyTrainer

* fix: modify usage of prompt and pretrain dataloader

b03d64d0

26 Jun, 2023 2 commits
- [hotfix]fix argument naming in docs and examples (#4083) · 4da324cd
  Baizhou Zhang authored Jun 26, 2023
  
  4da324cd
- [chat]: fix chat evaluation possible bug (#4064) · e89b127d
  Michelle authored Jun 26, 2023
```
* fix chat eval

* fix utils

* fix utils

* add comment

---------
Co-authored-by: Qianran Ma <qianranm@luchentech.com>
```
  e89b127d
25 Jun, 2023 1 commit

[chat] refactor strategy class with booster api (#3987) · 153b957a

Wenhao Chen authored Jun 25, 2023

* refactor: adapt boost API in base and naive strategies

* fix: initialize plugin after setup_distributed

* fix: fix save_pretrained fn

* refactor: adapt boost API in DDPStrategy

* to: add _post_init check

* to: fix ddp backward, modify ddp dataloader and unwrap

* feat: adapt boost API in ColossalAIStrategy

* fix: call setup_distributed before use get_current_device

* fix: fix save_model and save_optimizer

* test: remove save_sharded_optimizer test

* style: apply formatter

* fix: fix stage check and add comments

* feat: allow dict type arg in strategy.prepare

* to: temporarily remove lr_scheduler for testing

* style: simplify init of ColossalAIStrategy

* fix: fix lr_scheduler in sft and rm

* style: modify comments

* test: add train_prompts tests

* fix: fix inference only case and use in train_prompts

* test: skip failed tests in ci

* style: fix CodeFactor check

* fix: do not use model.to('cpu') with GeminiPlugin

* test: enable colossalai_gemini tests

* test: set CUDA_VISIBLE_DEVICES in ci

* docs: add note

153b957a

19 Jun, 2023 1 commit
- [nfc] fix dim not defined and fix typo (#3991) · 727c4598
  digger yu authored Jun 19, 2023
  
  727c4598
15 Jun, 2023 1 commit
- fix typo applications/Chat/coati/ (#3947) · d4fb7bfd
  digger yu authored Jun 15, 2023
  
  d4fb7bfd
13 Jun, 2023 2 commits

[evaluate] support gpt evaluation with reference (#3972) · 2925f473
Yuanchen authored Jun 13, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
2925f473

[chat] refactor actor class (#3968) · 9d02590c

Wenhao Chen authored Jun 13, 2023

* refactor: separate log_probs fn from Actor forward fn

* refactor: separate generate fn from Actor class

* feat: update unwrap_model and get_base_model
* unwrap_model returns model not wrapped by Strategy
* get_base_model returns HF model for Actor, Critic and RewardModel

* feat: simplify Strategy.prepare

* style: remove get_base_model method of Actor

* perf: tokenize text in batches

* refactor: move calc_action_log_probs to utils of model

* test: update test with new forward fn

* style: rename forward fn args

* fix: do not unwrap model in save_model fn of naive strategy

* test: add gemini test for train_prompts

* fix: fix _set_default_generate_kwargs

9d02590c

08 Jun, 2023 1 commit
- support UniEval and add CHRF metric (#3924) · 21c4c0b1
  Yuanchen authored Jun 08, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  21c4c0b1
07 Jun, 2023 1 commit

[chat] add distributed PPO trainer (#3740) · b5f05663

Hongxin Liu authored Jun 07, 2023



* Detached ppo (#9)

* run the base

* working on dist ppo

* sync

* detached trainer

* update detached trainer. no maker update function

* facing init problem

* 1 maker 1 trainer detached run. but no model update

* facing cuda problem

* fix save functions

* verified maker update

* nothing

* add ignore

* analyize loss issue

* remove some debug codes

* facing 2m1t stuck issue

* 2m1t verified

* do not use torchrun

* working on 2m2t

* working on 2m2t

* initialize strategy in ray actor env

* facing actor's init order issue

* facing ddp model update issue (need unwarp ddp)

* unwrap ddp actor

* checking 1m2t stuck problem

* nothing

* set timeout for trainer choosing. It solves the stuck problem!

* delete some debug output

* rename to sync with upstream

* rename to sync with upstream

* coati rename

* nothing

* I am going to detach the replaybuffer from trainer and make it a Ray Actor. Two benefits: 1. support TP trainer. 2. asynchronized buffer operations

* experience_maker_holder performs target-revolving _send_experience() instead of length comparison.

* move code to ray subfolder

* working on pipeline inference

* apply comments

* working on pipeline strategy. in progress.

* remove pipeline code. clean this branch

* update remote parameters by state_dict. no test

* nothing

* state_dict sharding transfer

* merge debug branch

* gemini _unwrap_model fix

* simplify code

* simplify code & fix LoRALinear AttributeError

* critic unwrapped state_dict

---------
Co-authored-by: csric <richcsr256@gmail.com>

* [chat] add perfomance evaluator and fix bugs (#10)

* [chat] add performance evaluator for ray

* [chat] refactor debug arg

* [chat] support hf config

* [chat] fix generation

* [chat] add 1mmt dummy example

* [chat] fix gemini ckpt

* split experience to send (#11)
Co-authored-by: csric <richcsr256@gmail.com>

* [chat] refactor trainer and maker (#12)

* [chat] refactor experience maker holder

* [chat] refactor model init

* [chat] refactor trainer args

* [chat] refactor model init

* [chat] refactor trainer

* [chat] refactor experience sending logic and training loop args (#13)

* [chat] refactor experience send logic

* [chat] refactor trainer

* [chat] refactor trainer

* [chat] refactor experience maker

* [chat] refactor pbar

* [chat] refactor example folder (#14)

* [chat] support quant (#15)

* [chat] add quant

* [chat] add quant example

* prompt example (#16)

* prompt example

* prompt load csv data

* remove legacy try

---------
Co-authored-by: csric <richcsr256@gmail.com>

* [chat] add mmmt dummy example and refactor experience sending (#17)

* [chat] add mmmt dummy example

* [chat] refactor naive strategy

* [chat] fix struck problem

* [chat] fix naive strategy

* [chat] optimize experience maker sending logic

* [chat] refactor sending assignment

* [chat] refactor performance evaluator (#18)

* Prompt Example & requires_grad state_dict & sharding state_dict (#19)

* prompt example

* prompt load csv data

* remove legacy try

* maker models require_grad set to False

* working on zero redundancy update

* mmmt_prompt example; naive strategy requires_grad state_dict & sharding; maker model requires_no_grad.

* remove legacy examples

* remove legacy examples

* remove replay buffer tp state. bad design

---------
Co-authored-by: csric <richcsr256@gmail.com>

* state_dict sending adapts to new unwrap function (#20)

* prompt example

* prompt load csv data

* remove legacy try

* maker models require_grad set to False

* working on zero redundancy update

* mmmt_prompt example; naive strategy requires_grad state_dict & sharding; maker model requires_no_grad.

* remove legacy examples

* remove legacy examples

* remove replay buffer tp state. bad design

* opt benchmark

* better script

* nothing

* [chat] strategy refactor unwrap model

* [chat] strategy refactor save model

* [chat] add docstr

* [chat] refactor trainer save model

* [chat] fix strategy typing

* [chat] refactor trainer save model

* [chat] update readme

* [chat] fix unit test

* working on lora reconstruction

* state_dict sending adapts to new unwrap function

* remove comments

---------
Co-authored-by: csric <richcsr256@gmail.com>
Co-authored-by: ver217 <lhx0217@gmail.com>

* [chat-ray] add readme (#21)

* add readme

* transparent graph

* add note background

---------
Co-authored-by: csric <richcsr256@gmail.com>

* [chat] get images from url (#22)

* Refactor/chat ray (#23)

* [chat] lora add todo

* [chat] remove unused pipeline strategy

* [chat] refactor example structure

* [chat] setup ci for ray

* [chat-ray] Support LoRA trainer. LoRA weights reconstruction. (#24)

* lora support prototype

* lora support

* 1mmt lora & remove useless code

---------
Co-authored-by: csric <richcsr256@gmail.com>

* [chat] fix test ci for ray

* [chat] fix test ci requirements for ray

* [chat] fix ray runtime env

* [chat] fix ray runtime env

* [chat] fix example ci docker args

* [chat] add debug info in trainer

* [chat] add nccl debug info

* [chat] skip ray test

* [doc] fix typo

---------
Co-authored-by: csric <59389055+CsRic@users.noreply.github.com>
Co-authored-by: csric <richcsr256@gmail.com>

b5f05663

05 Jun, 2023 1 commit
- support evaluation for english (#3880) · 57a6d768
  Yuanchen authored Jun 05, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  57a6d768
30 May, 2023 1 commit

[evaluation] improvement on evaluation (#3862) · 2506e275

Yuanchen authored May 30, 2023



* fix a bug when the config file contains one category but the answer file doesn't contains that category

* fix Chinese prompt file

* support gpt-3.5-turbo and gpt-4 evaluation

* polish and update README

* resolve pr comments

---------
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>

2506e275

25 May, 2023 1 commit

[nfc] fix typo colossalai/ applications/ (#3831) · e2d81eba

digger yu authored May 25, 2023

* fix typo colossalai/autochunk auto_parallel amp

* fix typo colossalai/auto_parallel nn utils etc.

* fix typo colossalai/auto_parallel autochunk fx/passes  etc.

* fix typo docs/

* change placememt_policy to placement_policy in docs/ and examples/

* fix typo colossalai/ applications/

e2d81eba

24 May, 2023 1 commit

[evaluation] add automatic evaluation pipeline (#3821) · 34966378

Yuanchen authored May 24, 2023



* add functions for gpt evaluation

* add automatic eval

Update eval.py

* using jload and modify the type of answers1 and answers2

* Update eval.py

Update eval.py

* Update evaluator.py

* support gpt evaluation

* update readme.md

update README.md

update READNE.md

modify readme.md

* add Chinese example for config, battle prompt and evaluation prompt file

* remove GPT-4 config

* remove sample folder

---------
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
Co-authored-by: Camille Zhong <44392324+Camille7777@users.noreply.github.com>

34966378

23 May, 2023 1 commit
- [NFC]fix typo colossalai/auto_parallel nn utils etc. (#3779) · 9265f2d4
  digger yu authored May 23, 2023
```
* fix typo colossalai/autochunk auto_parallel amp

* fix typo colossalai/auto_parallel nn utils etc.
```
  9265f2d4
22 May, 2023 1 commit
- [format] applied code formatting on changed files in pull request 3786 (#3787) · 62c7e67f
  github-actions[bot] authored May 22, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
  62c7e67f
19 May, 2023 1 commit
- [chat] add performance and tutorial (#3786) · ad2cf58f
  binmakeswell authored May 19, 2023
  
  ad2cf58f
17 May, 2023 1 commit
- [chat] fix bugs in stage 3 training (#3759) · 05759839
  Yuanchen authored May 17, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  05759839
15 May, 2023 1 commit
- [NFC] fix typo applications/ and colossalai/ (#3735) · ad6460cf
  digger-yu authored May 15, 2023
  
  ad6460cf
10 May, 2023 2 commits

[CI] fix some spelling errors (#3707) · b7141c36

digger-yu authored May 10, 2023

* fix spelling error with examples/comminity/

* fix spelling error with tests/

* fix some spelling error with tests/ colossalai/ etc.

b7141c36

[chat] fix community example ray (#3719) · f7361ee1
MisterLin1995 authored May 10, 2023
```
Co-authored-by: jiangwen <zxl265370@antgroup.com>
```
f7361ee1

06 May, 2023 1 commit
- [chat] fix train_prompts.py gemini strategy bug (#3666) · 2da5d81d
  zhang-yi-chi authored May 06, 2023
```
* fix gemini strategy bug

* add comment

* add comment

* better solution
```
  2da5d81d