Commits · 62f4e2eb0760ac8bfe28834b061dbc2bda93ade9 · OpenDAS / ColossalAI

06 Apr, 2023 4 commits

[Chat]Add Peft support & fix the ptx bug (#3433) · 62f4e2eb

YY Lin authored Apr 06, 2023

* Update ppo.py

Fix the bug of fetching wrong batch data

* Add peft model support in SFT and Prompts training

In stage-1 and stage-3, the peft model supports are added. So the trained artifacts will be only a small lora additions instead of the whole bunch of files.

* Delete test_prompts.txt

* Delete test_pretrained.txt

* Move the peft stuffs to a community folder.

* Move the demo sft to community

* delete dirty files

* Add instructions to install peft using source

* Remove Chinese comments

* remove the Chinese comments

62f4e2eb

[chat]fix save_model(#3377) · 73afb635
Dr-Corgi authored Apr 06, 2023
```
The function save_model should be a part of PPOTrainer.
```
73afb635
[chat]fix readme (#3429) · 57a3c4db
kingkingofall authored Apr 06, 2023
```
* fix stage 2

fix stage 2

* add torch
```
57a3c4db

[Chat] fix the tokenizer "int too big to convert" error in SFT training (#3453) · 72cb4dd4

Camille Zhong authored Apr 06, 2023

* Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

* Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.

* Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

* Update test_ci.sh

* Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

* Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

* Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.

* Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

* Update test_ci.sh

* Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

* update roberta with coati

* chat ci update

* Revert "chat ci update"

This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846.

* [Chat] fix the tokenizer "int too big to convert" error in SFT training

fix the tokenizer error during SFT training using Bloom and OPT

72cb4dd4

05 Apr, 2023 1 commit
- fix save_model indent error in ppo trainer (#3450) · b9231390
  Yuanchen authored Apr 05, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  b9231390
04 Apr, 2023 3 commits

fix save_model inin naive and ddp strategy (#3436) · 773955ab
Yuanchen authored Apr 04, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
773955ab

[zero] reorganize zero/gemini folder structure (#3424) · 26b7aac0

ver217 authored Apr 04, 2023

* [zero] refactor low-level zero folder structure

* [zero] fix legacy zero import path

* [zero] fix legacy zero import path

* [zero] remove useless import

* [zero] refactor gemini folder structure

* [zero] refactor gemini folder structure

* [zero] refactor legacy zero import path

* [zero] refactor gemini folder structure

* [zero] refactor gemini folder structure

* [zero] refactor gemini folder structure

* [zero] refactor legacy zero import path

* [zero] fix test import path

* [zero] fix test

* [zero] fix circular import

* [zero] update import

26b7aac0

[chat]fix sft training for bloom, gpt and opt (#3418) · b09adff7
Yuanchen authored Apr 04, 2023
```
fix sft training for bloom, gpt and opt 
```
b09adff7

03 Apr, 2023 1 commit

[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223) · 30412866

Camille Zhong authored Apr 03, 2023

* Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

* Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.

* Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

* add test for reward model training

* Update test_ci.sh

* Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

* Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

* Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.

* Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

* Update test_ci.sh

* Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

* update roberta with coati

30412866

30 Mar, 2023 1 commit
- [chat] correcting a few obvious typos and grammars errors (#3338) · 82132f4e
  Andrew authored Mar 29, 2023
  
  82132f4e
29 Mar, 2023 6 commits
- [doc] added authors to the chat application (#3307) · 0fbadce7
  Fazzie-Maqianli authored Mar 29, 2023
  
  0fbadce7
- Polish readme link (#3306) · b5128936
  BlueRum authored Mar 29, 2023
  
  b5128936
- [format] applied code formatting on changed files in pull request 3300 (#3302) · cb413ccf
  github-actions[bot] authored Mar 29, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
  cb413ccf
- [doc] add ColossalChat news (#3304) · 31c78f2b
  binmakeswell authored Mar 29, 2023
```
* [doc] add ColossalChat news

* [doc] add ColossalChat news
```
  31c78f2b
- [application] updated the README (#3301) · e235a246
  Frank Lee authored Mar 29, 2023
```
* [application] updated the README

* polish code
```
  e235a246
- [chat]polish prompts training (#3300) · 8257e105
  BlueRum authored Mar 29, 2023
```
* polish train_prompts

* polish readme
```
  8257e105
28 Mar, 2023 12 commits
- [coati] fix inference profanity check (#3299) · 62f71561
  ver217 authored Mar 29, 2023
  
  62f71561
- [format] applied code formatting on changed files in pull request 3296 (#3298) · 5134ad5d
  github-actions[bot] authored Mar 29, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
  5134ad5d
- [chat]Update Readme (#3296) · c8b723d6
  BlueRum authored Mar 29, 2023
```
* Update README.md

* Update README.md

* Update README.md

* update example readme
```
  c8b723d6
- [coati] inference supports profanity check (#3295) · 73b542a1
  ver217 authored Mar 29, 2023
  
  73b542a1
- [coati] add repetition_penalty for inference (#3294) · ce2cafae
  ver217 authored Mar 29, 2023
  
  ce2cafae
- add limit (#3293) · a88ed0f8
  Fazzie-Maqianli authored Mar 29, 2023
  
  a88ed0f8
- [ColossalChat]add cite for datasets (#3292) · c5484281
  Fazzie-Maqianli authored Mar 29, 2023
  
  c5484281
- fix image (#3288) · ec7af22a
  Fazzie-Maqianli authored Mar 28, 2023
  
  ec7af22a
- add example (#3286) · 1f7d9afb
  Fazzie-Maqianli authored Mar 28, 2023
  
  1f7d9afb
- [coati] fix inference output (#3285) · 4905b21b
  ver217 authored Mar 28, 2023
```
* [coati] fix inference requirements

* [coati] add output postprocess

* [coati] update inference readme

* [coati] fix inference requirements
```
  4905b21b
- remove chatgpt (#3284) · bb6196e7
  Fazzie-Maqianli authored Mar 28, 2023
  
  bb6196e7
- [Coati] first commit (#3283) · b0ce5a10
  Fazzie-Maqianli authored Mar 28, 2023
  
  b0ce5a10
24 Mar, 2023 4 commits
- [doc] fix typo (#3222) · d32ef94a
  binmakeswell authored Mar 24, 2023
```
* [doc] fix typo

* [doc] fix typo
```
  d32ef94a
- [chatgpt] add precision option for colossalai (#3233) · 78fd31f9
  ver217 authored Mar 24, 2023
  
  78fd31f9
- support instrcut training (#3230) · bd39877d
  Fazzie-Maqianli authored Mar 24, 2023
  
  bd39877d
- [doc] update chatgpt doc paper link (#3229) · 9bc702ab
  Camille Zhong authored Mar 24, 2023
```
#issue 3189
```
  9bc702ab
23 Mar, 2023 3 commits
- fix torch version (#3225) · bbac6760
  Fazzie-Maqianli authored Mar 23, 2023
  
  bbac6760
- [chatgpt] unnify datasets (#3218) · fa97a9ca
  Fazzie-Maqianli authored Mar 23, 2023
  
  fa97a9ca
- [chatgpt] support instuct training (#3216) · 4fd4bd9d
  Fazzie-Maqianli authored Mar 23, 2023
  
  4fd4bd9d
22 Mar, 2023 3 commits
- [chatgpt]add reward model code for deberta (#3199) · 9998d5ef
  Yuanchen authored Mar 22, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  9998d5ef
- [chatgpt]support llama (#3070) · 1e1b9d2f
  Fazzie-Maqianli authored Mar 22, 2023
  
  1e1b9d2f
- [chatgpt] add supervised learning fine-tune code (#3183) · b4295293
  pgzhang authored Mar 22, 2023
```
* [chatgpt] add supervised fine-tune code

* [chatgpt] delete unused code and modified comment code

* [chatgpt] use pytorch distributed sampler instead

---------
Co-authored-by: zhangpengpeng <zhangpengpeng@joyy.com>
```
  b4295293
20 Mar, 2023 1 commit

[chatgpt]Reward Model Training Process update (#3133) · 7548ca5a

BlueRum authored Mar 20, 2023

* add normalize function to value_head in bloom rm

* add normalization to value_function in gpt_rm

* add normalization to value_head of opt_rm

* add Anthropic/hh-rlhf dataset

* Update __init__.py

* Add LogExpLoss in RM training

* Update __init__.py

* update rm trainer to use acc as target

* update example/train_rm

* Update train_rm.sh

* code style

* Update README.md

* Update README.md

* add rm test to ci

* fix tokenier

* fix typo

* change batchsize to avoid oom in ci

* Update test_ci.sh

7548ca5a

17 Mar, 2023 1 commit
- [chatgpt] fix trainer generate kwargs (#3166) · 1e58d31b
  ver217 authored Mar 17, 2023
  
  1e58d31b