Commits · f8288315d9dcf05acffb4d9e5883f3b317f0191d · OpenDAS / ColossalAI

18 Apr, 2023 1 commit
- reconstruct chat trainer and fix training script (#3588) · 1ec0d386
  Yuanchen authored Apr 18, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  1ec0d386
03 Apr, 2023 1 commit

[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223) · 30412866

Camille Zhong authored Apr 03, 2023

* Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

* Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.

* Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

* add test for reward model training

* Update test_ci.sh

* Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

* Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

* Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.

* Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

* Update test_ci.sh

* Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

* update roberta with coati

30412866

28 Mar, 2023 1 commit
- [Coati] first commit (#3283) · b0ce5a10
  Fazzie-Maqianli authored Mar 28, 2023
  
  b0ce5a10
14 Mar, 2023 1 commit

[chatgpt]update ci (#3087) · 23cd5e2c

BlueRum authored Mar 14, 2023

* [chatgpt]update ci

* Update test_ci.sh

* Update test_ci.sh

* Update test_ci.sh

* test

* Update train_prompts.py

* Update train_dummy.py

* add save_path

* polish

* add save path

* polish

* add save path

* polish

* delete bloom-560m test

delete bloom-560m test because of oom

* add ddp test

23cd5e2c

13 Mar, 2023 1 commit
- [chatgpt]Fix examples (#3116) · 68577fbc
  BlueRum authored Mar 13, 2023
```
* fix train_dummy

* fix train-prompts
```
  68577fbc
07 Mar, 2023 2 commits
- change nn to models (#3032) · c21b11ed
  Fazzie-Maqianli authored Mar 07, 2023
  
  c21b11ed
- [chatgpt] Add saving ckpt callback for PPO (#2880) · 287d6049
  LuGY authored Mar 07, 2023
```
* add checkpoint callback for chatgpt

* add save ckpt callbacks for ppo

---------
Co-authored-by: Fazzie-Maqianli <55798671+Fazziekey@users.noreply.github.com>
```
  287d6049
03 Mar, 2023 1 commit

[chatgpt] making experience support dp (#2971) · 19ad49fb

ver217 authored Mar 03, 2023

* [chatgpt] making experience support dp

* [chatgpt] update example test ci

* [chatgpt] update example test ci

* [chatgpt] update example test ci

* [chatgpt] update example test ci

* [chatgpt] update sampler

* [chatgpt] update example test ci

* [chatgpt] refactor sampler

* [chatgpt] update example test ci

19ad49fb

22 Feb, 2023 1 commit

[chatgpt] Support saving ckpt in examples (#2846) · 34ca324b

BlueRum authored Feb 22, 2023

* [chatgpt]fix train_rm bug with lora

* [chatgpt]support colossalai strategy to train rm

* fix pre-commit

* fix pre-commit 2

* [chatgpt]fix rm eval typo

* fix rm eval

* fix pre commit

* add support of saving ckpt in examples

* fix single-gpu save

34ca324b

17 Feb, 2023 1 commit

[chatgpt] startegy add prepare method (#2766) · 4ee311c0

ver217 authored Feb 17, 2023

* [chatgpt] startegy add prepare method

* [chatgpt] refactor examples

* [chatgpt] refactor strategy.prepare

* [chatgpt] support save/load checkpoint

* [chatgpt] fix unwrap actor

* [chatgpt] fix unwrap actor

4ee311c0

15 Feb, 2023 1 commit

[chatgpt] optimize generation kwargs (#2717) · 9c0943ec

ver217 authored Feb 15, 2023

* [chatgpt] ppo trainer use default generate args

* [chatgpt] example remove generation preparing fn

* [chatgpt] benchmark remove generation preparing fn

* [chatgpt] fix ci

9c0943ec

14 Feb, 2023 1 commit
- [app] add chatgpt application (#2698) · 1b347010
  ver217 authored Feb 14, 2023
  
  1b347010