Commits · b0ce5a10326912961f0bc07cbbd250bab7b9c399 · OpenDAS / ColossalAI

28 Mar, 2023 1 commit
- [Coati] first commit (#3283) · b0ce5a10
  Fazzie-Maqianli authored Mar 28, 2023
  
  b0ce5a10
22 Mar, 2023 1 commit
- [chatgpt]add reward model code for deberta (#3199) · 9998d5ef
  Yuanchen authored Mar 22, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  9998d5ef
20 Mar, 2023 1 commit

[chatgpt]Reward Model Training Process update (#3133) · 7548ca5a

BlueRum authored Mar 20, 2023

* add normalize function to value_head in bloom rm

* add normalization to value_function in gpt_rm

* add normalization to value_head of opt_rm

* add Anthropic/hh-rlhf dataset

* Update __init__.py

* Add LogExpLoss in RM training

* Update __init__.py

* update rm trainer to use acc as target

* update example/train_rm

* Update train_rm.sh

* code style

* Update README.md

* Update README.md

* add rm test to ci

* fix tokenier

* fix typo

* change batchsize to avoid oom in ci

* Update test_ci.sh

7548ca5a

07 Mar, 2023 1 commit
- change nn to models (#3032) · c21b11ed
  Fazzie-Maqianli authored Mar 07, 2023
  
  c21b11ed
03 Mar, 2023 1 commit
- [chatgpt] fix lora gemini conflict in RM training (#2984) · f5ca0397
  BlueRum authored Mar 03, 2023
```
* fix lora bug

* polish

* fix lora gemini
```
  f5ca0397
02 Mar, 2023 1 commit
- [chatgpt]fix lora bug (#2974) · c9e27f0d
  BlueRum authored Mar 02, 2023
```
* fix lora bug

* polish
```
  c9e27f0d
22 Feb, 2023 1 commit
- [chatgpt]support opt & gpt for rm training (#2876) · 2e16f842
  BlueRum authored Feb 22, 2023
  
  2e16f842
21 Feb, 2023 1 commit

[chatgpt] fix rm eval (#2829) · 3eebc4df

BlueRum authored Feb 21, 2023

* [chatgpt]fix train_rm bug with lora

* [chatgpt]support colossalai strategy to train rm

* fix pre-commit

* fix pre-commit 2

* [chatgpt]fix rm eval typo

* fix rm eval

* fix pre commit

3eebc4df

16 Feb, 2023 1 commit

[chatgpt] support colossalai strategy to train rm (#2742) · 613efebc

BlueRum authored Feb 16, 2023

* [chatgpt]fix train_rm bug with lora

* [chatgpt]support colossalai strategy to train rm

* fix pre-commit

* fix pre-commit 2

613efebc

14 Feb, 2023 1 commit
- [app] add chatgpt application (#2698) · 1b347010
  ver217 authored Feb 14, 2023
  
  1b347010