1. 07 Mar, 2023 1 commit
  2. 03 Mar, 2023 1 commit
    • ver217's avatar
      [chatgpt] making experience support dp (#2971) · 19ad49fb
      ver217 authored
      * [chatgpt] making experience support dp
      
      * [chatgpt] update example test ci
      
      * [chatgpt] update example test ci
      
      * [chatgpt] update example test ci
      
      * [chatgpt] update example test ci
      
      * [chatgpt] update sampler
      
      * [chatgpt] update example test ci
      
      * [chatgpt] refactor sampler
      
      * [chatgpt] update example test ci
      19ad49fb
  3. 22 Feb, 2023 1 commit
    • BlueRum's avatar
      [chatgpt] Support saving ckpt in examples (#2846) · 34ca324b
      BlueRum authored
      * [chatgpt]fix train_rm bug with lora
      
      * [chatgpt]support colossalai strategy to train rm
      
      * fix pre-commit
      
      * fix pre-commit 2
      
      * [chatgpt]fix rm eval typo
      
      * fix rm eval
      
      * fix pre commit
      
      * add support of saving ckpt in examples
      
      * fix single-gpu save
      34ca324b
  4. 17 Feb, 2023 1 commit
    • ver217's avatar
      [chatgpt] startegy add prepare method (#2766) · 4ee311c0
      ver217 authored
      * [chatgpt] startegy add prepare method
      
      * [chatgpt] refactor examples
      
      * [chatgpt] refactor strategy.prepare
      
      * [chatgpt] support save/load checkpoint
      
      * [chatgpt] fix unwrap actor
      
      * [chatgpt] fix unwrap actor
      4ee311c0
  5. 15 Feb, 2023 1 commit
    • ver217's avatar
      [chatgpt] optimize generation kwargs (#2717) · 9c0943ec
      ver217 authored
      * [chatgpt] ppo trainer use default generate args
      
      * [chatgpt] example remove generation preparing fn
      
      * [chatgpt] benchmark remove generation preparing fn
      
      * [chatgpt] fix ci
      9c0943ec
  6. 14 Feb, 2023 1 commit