- 20 Mar, 2023 1 commit
-
-
BlueRum authored
* add normalize function to value_head in bloom rm * add normalization to value_function in gpt_rm * add normalization to value_head of opt_rm * add Anthropic/hh-rlhf dataset * Update __init__.py * Add LogExpLoss in RM training * Update __init__.py * update rm trainer to use acc as target * update example/train_rm * Update train_rm.sh * code style * Update README.md * Update README.md * add rm test to ci * fix tokenier * fix typo * change batchsize to avoid oom in ci * Update test_ci.sh
-
- 17 Mar, 2023 1 commit
-
-
ver217 authored
* [chatgpt] fix generation early stopping * [chatgpt] fix train prompts example
-
- 14 Mar, 2023 1 commit
-
-
BlueRum authored
* [chatgpt]update ci * Update test_ci.sh * Update test_ci.sh * Update test_ci.sh * test * Update train_prompts.py * Update train_dummy.py * add save_path * polish * add save path * polish * add save path * polish * delete bloom-560m test delete bloom-560m test because of oom * add ddp test
-
- 13 Mar, 2023 1 commit
-
-
BlueRum authored
* fix train_dummy * fix train-prompts
-
- 07 Mar, 2023 5 commits
-
-
Fazzie-Maqianli authored
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
BlueRum authored
-
LuGY authored
* add checkpoint callback for chatgpt * add save ckpt callbacks for ppo --------- Co-authored-by:Fazzie-Maqianli <55798671+Fazziekey@users.noreply.github.com>
-
BlueRum authored
* fix lora bug * polish * fix lora gemini * fix inference laod model bug
-
- 03 Mar, 2023 2 commits
-
-
BlueRum authored
* fix lora bug * polish * fix lora gemini
-
ver217 authored
* [chatgpt] making experience support dp * [chatgpt] update example test ci * [chatgpt] update example test ci * [chatgpt] update example test ci * [chatgpt] update example test ci * [chatgpt] update sampler * [chatgpt] update example test ci * [chatgpt] refactor sampler * [chatgpt] update example test ci
-
- 02 Mar, 2023 4 commits
-
-
BlueRum authored
* fix lora bug * polish
-
BlueRum authored
* [chatgpt] fix inference demo loading bug * polish
-
Fazzie-Maqianli authored
* Update README.md * fix README * Update README.md * Update README.md --------- Co-authored-by:
fastalgo <youyang@cs.berkeley.edu> Co-authored-by:
BlueRum <70618399+ht-zhou@users.noreply.github.com>
-
binmakeswell authored
-
- 01 Mar, 2023 1 commit
-
-
BlueRum authored
* [chatgpt] support inference example * Create inference.sh * Update README.md * Delete inference.sh * Update inference.py
-
- 22 Feb, 2023 2 commits
- 21 Feb, 2023 1 commit
-
-
BlueRum authored
* [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2 * [chatgpt]fix rm eval typo * fix rm eval * fix pre commit
-
- 17 Feb, 2023 1 commit
-
-
ver217 authored
* [chatgpt] startegy add prepare method * [chatgpt] refactor examples * [chatgpt] refactor strategy.prepare * [chatgpt] support save/load checkpoint * [chatgpt] fix unwrap actor * [chatgpt] fix unwrap actor
-
- 16 Feb, 2023 1 commit
-
-
BlueRum authored
* [chatgpt]fix train_rm bug with lora * [chatgpt]support colossalai strategy to train rm * fix pre-commit * fix pre-commit 2
-
- 15 Feb, 2023 1 commit
-
-
ver217 authored
* [chatgpt] ppo trainer use default generate args * [chatgpt] example remove generation preparing fn * [chatgpt] benchmark remove generation preparing fn * [chatgpt] fix ci
-
- 14 Feb, 2023 1 commit
-
-
ver217 authored
-