1. 20 Mar, 2023 2 commits
    • Saurav Maheshkar's avatar
      [refactor] update docs (#3174) · 20d1c994
      Saurav Maheshkar authored
      * refactor: README-zh-Hans
      
      * refactor: REFERENCE
      
      * docs: update paths in README
      20d1c994
    • BlueRum's avatar
      [chatgpt]Reward Model Training Process update (#3133) · 7548ca5a
      BlueRum authored
      * add normalize function to value_head in bloom rm
      
      * add normalization to value_function in gpt_rm
      
      * add normalization to value_head of opt_rm
      
      * add Anthropic/hh-rlhf dataset
      
      * Update __init__.py
      
      * Add LogExpLoss in RM training
      
      * Update __init__.py
      
      * update rm trainer to use acc as target
      
      * update example/train_rm
      
      * Update train_rm.sh
      
      * code style
      
      * Update README.md
      
      * Update README.md
      
      * add rm test to ci
      
      * fix tokenier
      
      * fix typo
      
      * change batchsize to avoid oom in ci
      
      * Update test_ci.sh
      7548ca5a
  2. 17 Mar, 2023 5 commits
  3. 15 Mar, 2023 4 commits
  4. 14 Mar, 2023 7 commits
  5. 13 Mar, 2023 5 commits
  6. 12 Mar, 2023 1 commit
  7. 11 Mar, 2023 1 commit
  8. 10 Mar, 2023 13 commits
  9. 09 Mar, 2023 2 commits