- 14 Aug, 2023 1 commit
-
-
Wenhao Chen authored
* style: apply formatter * fix: add outdated warnings * docs: add dataset format and polish * docs: polish README * fix: fix json format * fix: fix typos * revert: revert 7b example
-
- 28 Jul, 2023 1 commit
-
-
Yuanchen authored
Co-authored-by:Yuanchen Xu <yuanchen.xu00@gmail.com>
-
- 29 Jun, 2023 2 commits
-
-
Wenhao Chen authored
* feat: remove on_learn_epoch fn as not used * revert: add _on_learn_epoch fn * to: remove the use of NaiveStrategy * test: remove NaiveStrategy tests * feat: remove NaiveStrategy * style: modify comments and params * feat: split ColossalAIStrategy into LowLevelZeroStrategy and GeminiStrategy * fix: remove naive * fix: align with modified colossal strategy * fix: fix ddp _try_init_dist arg
-
Wenhao Chen authored
* to: add SLTrainer * refactor: refactor RMTrainer and SFTTrainer * fix: fix init file * feat: remove on_learn_epoch fn as not used * fix: align with modified gemini arguments * to: add OnPolicyTrainer * revert: add _on_learn_epoch fn * refactor: refactor PPOTrainer * style: rename PPOTrainer argument * fix: align with modified PPO arguments * test: align with modified train_prompts arguments * chore: modify train_prompts * docs: align with modified arguments * fix: remove unnecessary output * fix: move dataloader to fit fn of SLTrainer * fix: move dataloader to fit fn of OnPolicyTrainer * fix: modify usage of prompt and pretrain dataloader
-
- 19 May, 2023 1 commit
-
-
binmakeswell authored
-
- 17 May, 2023 1 commit
-
-
Yuanchen authored
Co-authored-by:Yuanchen Xu <yuanchen.xu00@gmail.com>
-
- 06 May, 2023 1 commit
-
-
digger-yu authored
* fix spelling error with examples/comminity/ * fix spelling error with example/
-
- 05 May, 2023 2 commits
-
-
Camille Zhong authored
* Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. update roberta with coati chat ci update Revert "chat ci update" This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846. * Update README.md Update README.md * update readme * Update test_ci.sh * update readme and add a script update readme and add a script modify readme Update README.md
-
digger-yu authored
* Update README.md change "huggingaface" to "huggingface" * Update README.md change "Colossa-AI" to "Colossal-AI"
-
- 28 Apr, 2023 2 commits
-
-
tanitna authored
-
binmakeswell authored
* [chat] set default gemini strategy * [chat] set default zero2 strategy * [chat] set default zero2 strategy
-
- 27 Apr, 2023 2 commits
-
-
Hongxin Liu authored
* [chat] strategy refactor unwrap model * [chat] strategy refactor save model * [chat] add docstr * [chat] refactor trainer save model * [chat] fix strategy typing * [chat] refactor trainer save model * [chat] update readme * [chat] fix unit test
-
Camille Zhong authored
* Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. Add RoBERTa for RLHF Stage 2 & 3 (test) RoBERTa for RLHF Stage 2 & 3 (still in testing) Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)" This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368. Add RoBERTa for RLHF stage 2 & 3 1. add roberta folder under model folder 2. add roberta option in train_reward_model.py 3. add some test in testci Update test_ci.sh Revert "Update test_ci.sh" This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a. update roberta with coati chat ci update Revert "chat ci update" This reverts commit 17ae7ae01fa752bd3289fc39069868fde99cf846. * Update README.md Update README.md * update readme * Update test_ci.sh
-
- 20 Apr, 2023 1 commit
-
-
digger-yu authored
-
- 17 Apr, 2023 1 commit
-
-
Fazzie-Maqianli authored
-
- 06 Apr, 2023 1 commit
-
-
kingkingofall authored
* fix stage 2 fix stage 2 * add torch
-
- 29 Mar, 2023 1 commit
-
-
BlueRum authored
* polish train_prompts * polish readme
-
- 28 Mar, 2023 3 commits
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
BlueRum authored
* Update README.md * Update README.md * Update README.md * update example readme
-
Fazzie-Maqianli authored
-
- 24 Mar, 2023 2 commits
-
-
binmakeswell authored
* [doc] fix typo * [doc] fix typo
-
Camille Zhong authored
#issue 3189
-
- 20 Mar, 2023 1 commit
-
-
BlueRum authored
* add normalize function to value_head in bloom rm * add normalization to value_function in gpt_rm * add normalization to value_head of opt_rm * add Anthropic/hh-rlhf dataset * Update __init__.py * Add LogExpLoss in RM training * Update __init__.py * update rm trainer to use acc as target * update example/train_rm * Update train_rm.sh * code style * Update README.md * Update README.md * add rm test to ci * fix tokenier * fix typo * change batchsize to avoid oom in ci * Update test_ci.sh
-
- 07 Mar, 2023 3 commits
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
BlueRum authored
-
BlueRum authored
* fix lora bug * polish * fix lora gemini * fix inference laod model bug
-
- 02 Mar, 2023 2 commits
-
-
Fazzie-Maqianli authored
* Update README.md * fix README * Update README.md * Update README.md --------- Co-authored-by:
fastalgo <youyang@cs.berkeley.edu> Co-authored-by:
BlueRum <70618399+ht-zhou@users.noreply.github.com>
-
binmakeswell authored
-
- 01 Mar, 2023 1 commit
-
-
BlueRum authored
* [chatgpt] support inference example * Create inference.sh * Update README.md * Delete inference.sh * Update inference.py
-
- 14 Feb, 2023 1 commit
-
-
ver217 authored
-