Commits · 268b3cd80d106c2b700156b1993675c7421abd15 · OpenDAS / ColossalAI

28 Apr, 2023 1 commit

[chat] set default zero2 strategy (#3667) · 268b3cd8

binmakeswell authored Apr 28, 2023

* [chat] set default gemini strategy

* [chat] set default zero2 strategy

* [chat] set default zero2 strategy

268b3cd8

27 Apr, 2023 1 commit

[chat] refactor model save/load logic (#3654) · 842768a1

Hongxin Liu authored Apr 27, 2023

* [chat] strategy refactor unwrap model

* [chat] strategy refactor save model

* [chat] add docstr

* [chat] refactor trainer save model

* [chat] fix strategy typing

* [chat] refactor trainer save model

* [chat] update readme

* [chat] fix unit test

842768a1

18 Apr, 2023 1 commit
- reconstruct chat trainer and fix training script (#3588) · 1ec0d386
  Yuanchen authored Apr 18, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  1ec0d386
03 Apr, 2023 1 commit

[chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223) · 30412866

Camille Zhong authored Apr 03, 2023

* Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

* Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.

* Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

* add test for reward model training

* Update test_ci.sh

* Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

* Add RoBERTa for RLHF Stage 2 & 3 (test)

RoBERTa for RLHF Stage 2 & 3 (still in testing)

* Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"

This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.

* Add RoBERTa for RLHF stage 2 & 3

1. add roberta folder under model folder
2. add  roberta option in train_reward_model.py
3. add some test in testci

* Update test_ci.sh

* Revert "Update test_ci.sh"

This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.

* update roberta with coati

30412866

28 Mar, 2023 1 commit
- [Coati] first commit (#3283) · b0ce5a10
  Fazzie-Maqianli authored Mar 28, 2023
  
  b0ce5a10
22 Mar, 2023 1 commit
- [chatgpt]add reward model code for deberta (#3199) · 9998d5ef
  Yuanchen authored Mar 22, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  9998d5ef
20 Mar, 2023 1 commit

[chatgpt]Reward Model Training Process update (#3133) · 7548ca5a

BlueRum authored Mar 20, 2023

* add normalize function to value_head in bloom rm

* add normalization to value_function in gpt_rm

* add normalization to value_head of opt_rm

* add Anthropic/hh-rlhf dataset

* Update __init__.py

* Add LogExpLoss in RM training

* Update __init__.py

* update rm trainer to use acc as target

* update example/train_rm

* Update train_rm.sh

* code style

* Update README.md

* Update README.md

* add rm test to ci

* fix tokenier

* fix typo

* change batchsize to avoid oom in ci

* Update test_ci.sh

7548ca5a

07 Mar, 2023 1 commit
- change nn to models (#3032) · c21b11ed
  Fazzie-Maqianli authored Mar 07, 2023
  
  c21b11ed
03 Mar, 2023 1 commit
- [chatgpt] fix lora gemini conflict in RM training (#2984) · f5ca0397
  BlueRum authored Mar 03, 2023
```
* fix lora bug

* polish

* fix lora gemini
```
  f5ca0397
02 Mar, 2023 1 commit
- [chatgpt]fix lora bug (#2974) · c9e27f0d
  BlueRum authored Mar 02, 2023
```
* fix lora bug

* polish
```
  c9e27f0d
22 Feb, 2023 1 commit
- [chatgpt]support opt & gpt for rm training (#2876) · 2e16f842
  BlueRum authored Feb 22, 2023
  
  2e16f842
21 Feb, 2023 1 commit

[chatgpt] fix rm eval (#2829) · 3eebc4df

BlueRum authored Feb 21, 2023

* [chatgpt]fix train_rm bug with lora

* [chatgpt]support colossalai strategy to train rm

* fix pre-commit

* fix pre-commit 2

* [chatgpt]fix rm eval typo

* fix rm eval

* fix pre commit

3eebc4df

16 Feb, 2023 1 commit

[chatgpt] support colossalai strategy to train rm (#2742) · 613efebc

BlueRum authored Feb 16, 2023

* [chatgpt]fix train_rm bug with lora

* [chatgpt]support colossalai strategy to train rm

* fix pre-commit

* fix pre-commit 2

613efebc

14 Feb, 2023 1 commit
- [app] add chatgpt application (#2698) · 1b347010
  ver217 authored Feb 14, 2023
  
  1b347010