1. 18 Apr, 2023 1 commit
  2. 03 Apr, 2023 1 commit
    • Camille Zhong's avatar
      [chatgpt] add pre-trained model RoBERTa for RLHF stage 2 & 3 (#3223) · 30412866
      Camille Zhong authored
      * Add RoBERTa for RLHF Stage 2 & 3 (test)
      
      RoBERTa for RLHF Stage 2 & 3 (still in testing)
      
      * Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"
      
      This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.
      
      * Add RoBERTa for RLHF stage 2 & 3
      
      1. add roberta folder under model folder
      2. add  roberta option in train_reward_model.py
      3. add some test in testci
      
      * add test for reward model training
      
      * Update test_ci.sh
      
      * Revert "Update test_ci.sh"
      
      This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.
      
      * Add RoBERTa for RLHF Stage 2 & 3 (test)
      
      RoBERTa for RLHF Stage 2 & 3 (still in testing)
      
      * Revert "Add RoBERTa for RLHF Stage 2 & 3 (test)"
      
      This reverts commit 06741d894dcbe958acd4e10d771f22275e20e368.
      
      * Add RoBERTa for RLHF stage 2 & 3
      
      1. add roberta folder under model folder
      2. add  roberta option in train_reward_model.py
      3. add some test in testci
      
      * Update test_ci.sh
      
      * Revert "Update test_ci.sh"
      
      This reverts commit 9c7352b81766f3177d31eeec0ec178a301df966a.
      
      * update roberta with coati
      30412866
  3. 28 Mar, 2023 1 commit
  4. 14 Mar, 2023 1 commit
    • BlueRum's avatar
      [chatgpt]update ci (#3087) · 23cd5e2c
      BlueRum authored
      * [chatgpt]update ci
      
      * Update test_ci.sh
      
      * Update test_ci.sh
      
      * Update test_ci.sh
      
      * test
      
      * Update train_prompts.py
      
      * Update train_dummy.py
      
      * add save_path
      
      * polish
      
      * add save path
      
      * polish
      
      * add save path
      
      * polish
      
      * delete bloom-560m test
      
      delete bloom-560m test because of oom
      
      * add ddp test
      23cd5e2c
  5. 13 Mar, 2023 1 commit
  6. 07 Mar, 2023 2 commits
  7. 03 Mar, 2023 1 commit
    • ver217's avatar
      [chatgpt] making experience support dp (#2971) · 19ad49fb
      ver217 authored
      * [chatgpt] making experience support dp
      
      * [chatgpt] update example test ci
      
      * [chatgpt] update example test ci
      
      * [chatgpt] update example test ci
      
      * [chatgpt] update example test ci
      
      * [chatgpt] update sampler
      
      * [chatgpt] update example test ci
      
      * [chatgpt] refactor sampler
      
      * [chatgpt] update example test ci
      19ad49fb
  8. 22 Feb, 2023 1 commit
    • BlueRum's avatar
      [chatgpt] Support saving ckpt in examples (#2846) · 34ca324b
      BlueRum authored
      * [chatgpt]fix train_rm bug with lora
      
      * [chatgpt]support colossalai strategy to train rm
      
      * fix pre-commit
      
      * fix pre-commit 2
      
      * [chatgpt]fix rm eval typo
      
      * fix rm eval
      
      * fix pre commit
      
      * add support of saving ckpt in examples
      
      * fix single-gpu save
      34ca324b
  9. 17 Feb, 2023 1 commit
    • ver217's avatar
      [chatgpt] startegy add prepare method (#2766) · 4ee311c0
      ver217 authored
      * [chatgpt] startegy add prepare method
      
      * [chatgpt] refactor examples
      
      * [chatgpt] refactor strategy.prepare
      
      * [chatgpt] support save/load checkpoint
      
      * [chatgpt] fix unwrap actor
      
      * [chatgpt] fix unwrap actor
      4ee311c0
  10. 15 Feb, 2023 1 commit
    • ver217's avatar
      [chatgpt] optimize generation kwargs (#2717) · 9c0943ec
      ver217 authored
      * [chatgpt] ppo trainer use default generate args
      
      * [chatgpt] example remove generation preparing fn
      
      * [chatgpt] benchmark remove generation preparing fn
      
      * [chatgpt] fix ci
      9c0943ec
  11. 14 Feb, 2023 1 commit