- 22 Mar, 2023 2 commits
-
-
Fazzie-Maqianli authored
-
pgzhang authored
* [chatgpt] add supervised fine-tune code * [chatgpt] delete unused code and modified comment code * [chatgpt] use pytorch distributed sampler instead --------- Co-authored-by:zhangpengpeng <zhangpengpeng@joyy.com>
-
- 20 Mar, 2023 1 commit
-
-
BlueRum authored
* add normalize function to value_head in bloom rm * add normalization to value_function in gpt_rm * add normalization to value_head of opt_rm * add Anthropic/hh-rlhf dataset * Update __init__.py * Add LogExpLoss in RM training * Update __init__.py * update rm trainer to use acc as target * update example/train_rm * Update train_rm.sh * code style * Update README.md * Update README.md * add rm test to ci * fix tokenier * fix typo * change batchsize to avoid oom in ci * Update test_ci.sh
-
- 17 Mar, 2023 1 commit
-
-
ver217 authored
* [chatgpt] fix generation early stopping * [chatgpt] fix train prompts example
-
- 13 Mar, 2023 1 commit
-
-
BlueRum authored
* fix gpt-actor * fix gpt-critic * fix opt-critic
-
- 12 Mar, 2023 1 commit
-
-
hiko2MSP authored
-
- 10 Mar, 2023 2 commits
-
-
BlueRum authored
* fix colo-stratergy * polish * fix lora * fix ddp * polish * polish
-
Fazzie-Maqianli authored
-
- 08 Mar, 2023 1 commit
-
-
wenjunyang authored
* fix Critic * fix Critic * fix Critic * fix neglect of attention mask * fix neglect of attention mask * fix neglect of attention mask * add return --------- Co-authored-by:
yangwenjun <yangwenjun@soyoung.com> Co-authored-by:
yangwjd <yangwjd@chanjet.com>
-
- 07 Mar, 2023 1 commit
-
-
Fazzie-Maqianli authored
-