Commits · 6d41c3f2aa7c859fe2b87889e6b02b4febbfa4f6 · OpenDAS / ColossalAI

14 Aug, 2023 1 commit

[doc] update Coati README (#4405) · 6d41c3f2

Wenhao Chen authored Aug 14, 2023

* style: apply formatter

* fix: add outdated warnings

* docs: add dataset format and polish

* docs: polish README

* fix: fix json format

* fix: fix typos

* revert: revert 7b example

6d41c3f2

28 Jul, 2023 1 commit
- support session-based training (#4313) · 5187c96b
  Yuanchen authored Jul 28, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  5187c96b
29 Jun, 2023 2 commits

[chat] remove naive strategy and split colossalai strategy (#4094) · edd75a59

Wenhao Chen authored Jun 29, 2023

* feat: remove on_learn_epoch fn as not used

* revert: add _on_learn_epoch fn

* to: remove the use of NaiveStrategy

* test: remove NaiveStrategy tests

* feat: remove NaiveStrategy

* style: modify comments and params

* feat: split ColossalAIStrategy into LowLevelZeroStrategy and GeminiStrategy

* fix: remove naive

* fix: align with modified colossal strategy

* fix: fix ddp _try_init_dist arg

edd75a59

[chat] refactor trainer class (#4080) · b03d64d0

Wenhao Chen authored Jun 29, 2023

* to: add SLTrainer

* refactor: refactor RMTrainer and SFTTrainer

* fix: fix init file

* feat: remove on_learn_epoch fn as not used

* fix: align with modified gemini arguments

* to: add OnPolicyTrainer

* revert: add _on_learn_epoch fn

* refactor: refactor PPOTrainer

* style: rename PPOTrainer argument

* fix: align with modified PPO arguments

* test: align with modified train_prompts arguments

* chore: modify train_prompts

* docs: align with modified arguments

* fix: remove unnecessary output

* fix: move dataloader to fit fn of SLTrainer

* fix: move dataloader to fit fn of OnPolicyTrainer

* fix: modify usage of prompt and pretrain dataloader

b03d64d0

19 May, 2023 1 commit
- [chat] add performance and tutorial (#3786) · ad2cf58f
  binmakeswell authored May 19, 2023
  
  ad2cf58f
17 May, 2023 1 commit
- [chat] fix bugs in stage 3 training (#3759) · 05759839
  Yuanchen authored May 17, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  05759839
06 May, 2023 1 commit
- fix some spelling error with applications/Chat/examples/ (#3692) · 65bdc315
  digger-yu authored May 06, 2023
```
* fix spelling error with examples/comminity/

* fix spelling error with example/
```
  65bdc315
05 May, 2023 2 commits

[chat] PPO stage3 doc enhancement (#3679) · 0f785cb1

Camille Zhong authored May 05, 2023

* Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.

Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

Update test_ci.sh

Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.

Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

Update test_ci.sh

Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

update roberta with coati

chat ci update

Revert "chat ci update"

This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846.

* Update README.md

Update README.md

* update readme

* Update test_ci.sh

* update readme and add a script

update readme and add a script

modify readme

Update README.md

0f785cb1

[doc] fix chat spelling error (#3671) · 6650daeb

digger-yu authored May 05, 2023

* Update README.md

change "huggingaface" to "huggingface"

* Update README.md

change "Colossa-AI" to "Colossal-AI"

6650daeb

28 Apr, 2023 2 commits
- [chat] typo accimulation_steps -> accumulation_steps (#3662) · 1a60dc07
  tanitna authored Apr 28, 2023
  
  1a60dc07
- [chat] set default zero2 strategy (#3667) · 268b3cd8
  binmakeswell authored Apr 28, 2023
```
* [chat] set default gemini strategy

* [chat] set default zero2 strategy

* [chat] set default zero2 strategy
```
  268b3cd8
27 Apr, 2023 2 commits

[chat] refactor model save/load logic (#3654) · 842768a1

Hongxin Liu authored Apr 27, 2023

* [chat] strategy refactor unwrap model

* [chat] strategy refactor save model

* [chat] add docstr

* [chat] refactor trainer save model

* [chat] fix strategy typing

* [chat] refactor trainer save model

* [chat] update readme

* [chat] fix unit test

842768a1

[Doc] enhancement on README.md for chat examples (#3646) · 8bccb72c

Camille Zhong authored Apr 27, 2023

* Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.

Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

Update test_ci.sh

Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.

Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

Update test_ci.sh

Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

update roberta with coati

chat ci update

Revert "chat ci update"

This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846.

* Update README.md

Update README.md

* update readme

* Update test_ci.sh

8bccb72c

20 Apr, 2023 1 commit
- [chat] polish code note typo (#3612) · d7bf2847
  digger-yu authored Apr 20, 2023
  
  d7bf2847
17 Apr, 2023 1 commit
- [coati] add costom model suppor tguide (#3579) · 6b1a39b1
  Fazzie-Maqianli authored Apr 17, 2023
  
  6b1a39b1
06 Apr, 2023 1 commit
- [chat]fix readme (#3429) · 57a3c4db
  kingkingofall authored Apr 06, 2023
```
* fix stage 2

fix stage 2

* add torch
```
  57a3c4db
29 Mar, 2023 1 commit
- [chat]polish prompts training (#3300) · 8257e105
  BlueRum authored Mar 29, 2023
```
* polish train_prompts

* polish readme
```
  8257e105
28 Mar, 2023 3 commits
- [format] applied code formatting on changed files in pull request 3296 (#3298) · 5134ad5d
  github-actions[bot] authored Mar 29, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
  5134ad5d
- [chat]Update Readme (#3296) · c8b723d6
  BlueRum authored Mar 29, 2023
```
* Update README.md

* Update README.md

* Update README.md

* update example readme
```
  c8b723d6
- [Coati] first commit (#3283) · b0ce5a10
  Fazzie-Maqianli authored Mar 28, 2023
  
  b0ce5a10
24 Mar, 2023 2 commits
- [doc] fix typo (#3222) · d32ef94a
  binmakeswell authored Mar 24, 2023
```
* [doc] fix typo

* [doc] fix typo
```
  d32ef94a
- [doc] update chatgpt doc paper link (#3229) · 9bc702ab
  Camille Zhong authored Mar 24, 2023
```
#issue 3189
```
  9bc702ab
20 Mar, 2023 1 commit

[chatgpt]Reward Model Training Process update (#3133) · 7548ca5a

BlueRum authored Mar 20, 2023

* add normalize function to value_head in bloom rm

* add normalization to value_function in gpt_rm

* add normalization to value_head of opt_rm

* add Anthropic/hh-rlhf dataset

* Update __init__.py

* Add LogExpLoss in RM training

* Update __init__.py

* update rm trainer to use acc as target

* update example/train_rm

* Update train_rm.sh

* code style

* Update README.md

* Update README.md

* add rm test to ci

* fix tokenier

* fix typo

* change batchsize to avoid oom in ci

* Update test_ci.sh

7548ca5a

07 Mar, 2023 3 commits
- [format] applied code formatting on changed files in pull request 3025 (#3026) · e86d9bb2
  github-actions[bot] authored Mar 07, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
  e86d9bb2
- [chatgpt] fix readme (#3025) · 55dcd305
  BlueRum authored Mar 07, 2023
  
  55dcd305
- [chatgpt]fix inference model load (#2988) · e5887034
  BlueRum authored Mar 07, 2023
```
* fix lora bug

* polish

* fix lora gemini

* fix inference laod model bug
```
  e5887034
02 Mar, 2023 2 commits

[ChatGPT] fix README (#2966) · bbf9c827

Fazzie-Maqianli authored Mar 02, 2023



* Update README.md

* fix README

* Update README.md

* Update README.md

---------
Co-authored-by: fastalgo <youyang@cs.berkeley.edu>
Co-authored-by: BlueRum <70618399+ht-zhou@users.noreply.github.com>

bbf9c827

[doc] fix chatgpt inference typo (#2964) · b0a87663
binmakeswell authored Mar 02, 2023

b0a87663

01 Mar, 2023 1 commit

[chatgpt]add inference example (#2944) · 489a9566

BlueRum authored Mar 01, 2023

* [chatgpt] support inference example

* Create inference.sh

* Update README.md

* Delete inference.sh

* Update inference.py

489a9566

14 Feb, 2023 1 commit
- [app] add chatgpt application (#2698) · 1b347010
  ver217 authored Feb 14, 2023
  
  1b347010