1. 15 Sep, 2023 1 commit
  2. 14 Aug, 2023 1 commit
    • Wenhao Chen's avatar
      [doc] update Coati README (#4405) · 6d41c3f2
      Wenhao Chen authored
      * style: apply formatter
      
      * fix: add outdated warnings
      
      * docs: add dataset format and polish
      
      * docs: polish README
      
      * fix: fix json format
      
      * fix: fix typos
      
      * revert: revert 7b example
      6d41c3f2
  3. 04 Jul, 2023 1 commit
    • Wenhao Chen's avatar
      [chat] use official transformers and fix some issues (#4117) · 3d8d5d0d
      Wenhao Chen authored
      * feat: remove on_learn_epoch fn as not used
      
      * revert: add _on_learn_epoch fn
      
      * feat: remove NaiveStrategy
      
      * test: update train_prompts tests
      
      * fix: remove prepare_llama_tokenizer_and_embedding
      
      * test: add lora arg
      
      * feat: remove roberta support in train_prompts due to runtime errs
      
      * feat: remove deberta & roberta in rm as not used
      
      * test: remove deberta and roberta tests
      
      * feat: remove deberta and roberta models as not used
      
      * fix: remove calls to roberta
      
      * fix: remove prepare_llama_tokenizer_and_embedding
      
      * chore: update transformers version
      
      * docs: update transformers version
      
      * fix: fix actor inference
      
      * fix: fix ci
      
      * feat: change llama pad token to unk
      
      * revert: revert ddp setup_distributed
      
      * fix: change llama pad token to unk
      
      * revert: undo unnecessary changes
      
      * fix: use pip to install transformers
      3d8d5d0d
  4. 29 Jun, 2023 2 commits
    • Wenhao Chen's avatar
      [chat] remove naive strategy and split colossalai strategy (#4094) · edd75a59
      Wenhao Chen authored
      * feat: remove on_learn_epoch fn as not used
      
      * revert: add _on_learn_epoch fn
      
      * to: remove the use of NaiveStrategy
      
      * test: remove NaiveStrategy tests
      
      * feat: remove NaiveStrategy
      
      * style: modify comments and params
      
      * feat: split ColossalAIStrategy into LowLevelZeroStrategy and GeminiStrategy
      
      * fix: remove naive
      
      * fix: align with modified colossal strategy
      
      * fix: fix ddp _try_init_dist arg
      edd75a59
    • Wenhao Chen's avatar
      [chat] refactor trainer class (#4080) · b03d64d0
      Wenhao Chen authored
      * to: add SLTrainer
      
      * refactor: refactor RMTrainer and SFTTrainer
      
      * fix: fix init file
      
      * feat: remove on_learn_epoch fn as not used
      
      * fix: align with modified gemini arguments
      
      * to: add OnPolicyTrainer
      
      * revert: add _on_learn_epoch fn
      
      * refactor: refactor PPOTrainer
      
      * style: rename PPOTrainer argument
      
      * fix: align with modified PPO arguments
      
      * test: align with modified train_prompts arguments
      
      * chore: modify train_prompts
      
      * docs: align with modified arguments
      
      * fix: remove unnecessary output
      
      * fix: move dataloader to fit fn of SLTrainer
      
      * fix: move dataloader to fit fn of OnPolicyTrainer
      
      * fix: modify usage of prompt and pretrain dataloader
      b03d64d0
  5. 15 May, 2023 1 commit
  6. 10 May, 2023 1 commit
  7. 06 May, 2023 1 commit
  8. 28 Apr, 2023 1 commit
  9. 20 Apr, 2023 1 commit
  10. 13 Apr, 2023 1 commit
    • MisterLin1995's avatar
      [chat] ChatGPT train prompts on ray example (#3309) · 1a809edd
      MisterLin1995 authored
      
      
      * [feat][chatgpt]train prompts on ray example
      
      * [fix]simplify code
      
      * [fix]remove depreciated parameter
      
      * [fix]add dependencies
      
      * [fix]method calling
      
      * [fix]experience maker
      
      * [fix]missing loss function
      
      * [fix]init optimizer
      
      * [feat]add usage comment
      
      * [fix]rename files
      
      * [fix]add readme
      
      * [fix]file path
      
      * [fix]move directory
      
      ---------
      Co-authored-by: default avatarjiangwen <zxl265370@antgroup.com>
      1a809edd
  11. 10 Apr, 2023 1 commit
  12. 06 Apr, 2023 2 commits
    • Fazzie-Maqianli's avatar
      add community example dictionary (#3465) · 6afeb120
      Fazzie-Maqianli authored
      6afeb120
    • YY Lin's avatar
      [Chat]Add Peft support & fix the ptx bug (#3433) · 62f4e2eb
      YY Lin authored
      * Update ppo.py
      
      Fix the bug of fetching wrong batch data
      
      * Add peft model support in SFT and Prompts training
      
      In stage-1 and stage-3, the peft model supports are added. So the trained artifacts will be only a small lora additions instead of the whole bunch of files.
      
      * Delete test_prompts.txt
      
      * Delete test_pretrained.txt
      
      * Move the peft stuffs to a community folder.
      
      * Move the demo sft to community
      
      * delete dirty files
      
      * Add instructions to install peft using source
      
      * Remove Chinese comments
      
      * remove the Chinese comments
      62f4e2eb