1. 22 Mar, 2023 3 commits
  2. 20 Mar, 2023 1 commit
    • BlueRum's avatar
      [chatgpt]Reward Model Training Process update (#3133) · 7548ca5a
      BlueRum authored
      * add normalize function to value_head in bloom rm
      
      * add normalization to value_function in gpt_rm
      
      * add normalization to value_head of opt_rm
      
      * add Anthropic/hh-rlhf dataset
      
      * Update __init__.py
      
      * Add LogExpLoss in RM training
      
      * Update __init__.py
      
      * update rm trainer to use acc as target
      
      * update example/train_rm
      
      * Update train_rm.sh
      
      * code style
      
      * Update README.md
      
      * Update README.md
      
      * add rm test to ci
      
      * fix tokenier
      
      * fix typo
      
      * change batchsize to avoid oom in ci
      
      * Update test_ci.sh
      7548ca5a
  3. 17 Mar, 2023 3 commits
  4. 14 Mar, 2023 1 commit
    • BlueRum's avatar
      [chatgpt]update ci (#3087) · 23cd5e2c
      BlueRum authored
      * [chatgpt]update ci
      
      * Update test_ci.sh
      
      * Update test_ci.sh
      
      * Update test_ci.sh
      
      * test
      
      * Update train_prompts.py
      
      * Update train_dummy.py
      
      * add save_path
      
      * polish
      
      * add save path
      
      * polish
      
      * add save path
      
      * polish
      
      * delete bloom-560m test
      
      delete bloom-560m test because of oom
      
      * add ddp test
      23cd5e2c
  5. 13 Mar, 2023 2 commits
  6. 12 Mar, 2023 1 commit
  7. 10 Mar, 2023 2 commits
  8. 08 Mar, 2023 1 commit
  9. 07 Mar, 2023 5 commits
  10. 03 Mar, 2023 3 commits
  11. 02 Mar, 2023 4 commits
  12. 01 Mar, 2023 1 commit
  13. 28 Feb, 2023 1 commit
  14. 22 Feb, 2023 2 commits
  15. 21 Feb, 2023 1 commit
    • BlueRum's avatar
      [chatgpt] fix rm eval (#2829) · 3eebc4df
      BlueRum authored
      * [chatgpt]fix train_rm bug with lora
      
      * [chatgpt]support colossalai strategy to train rm
      
      * fix pre-commit
      
      * fix pre-commit 2
      
      * [chatgpt]fix rm eval typo
      
      * fix rm eval
      
      * fix pre commit
      3eebc4df
  16. 20 Feb, 2023 1 commit
  17. 17 Feb, 2023 2 commits
    • ver217's avatar
      [chatgpt] update readme about checkpoint (#2792) · a619a190
      ver217 authored
      * [chatgpt] add save/load checkpoint sample code
      
      * [chatgpt] add save/load checkpoint readme
      
      * [chatgpt] refactor save/load checkpoint readme
      a619a190
    • ver217's avatar
      [chatgpt] startegy add prepare method (#2766) · 4ee311c0
      ver217 authored
      * [chatgpt] startegy add prepare method
      
      * [chatgpt] refactor examples
      
      * [chatgpt] refactor strategy.prepare
      
      * [chatgpt] support save/load checkpoint
      
      * [chatgpt] fix unwrap actor
      
      * [chatgpt] fix unwrap actor
      4ee311c0
  18. 16 Feb, 2023 3 commits
  19. 15 Feb, 2023 3 commits