1. 29 Mar, 2024 1 commit
    • YeAnbang's avatar
      [ColossalChat] Update RLHF V2 (#5286) · df5e9c53
      YeAnbang authored
      
      
      * Add dpo. Fix sft, ppo, lora. Refactor all
      
      * fix and tested ppo
      
      * 2 nd round refactor
      
      * add ci tests
      
      * fix ci
      
      * fix ci
      
      * fix readme, style
      
      * fix readme style
      
      * fix style, fix benchmark
      
      * reproduce benchmark result, remove useless files
      
      * rename to ColossalChat
      
      * use new image
      
      * fix ci workflow
      
      * fix ci
      
      * use local model/tokenizer for ci tests
      
      * fix ci
      
      * fix ci
      
      * fix ci
      
      * fix ci timeout
      
      * fix rm progress bar. fix ci timeout
      
      * fix ci
      
      * fix ci typo
      
      * remove 3d plugin from ci temporary
      
      * test environment
      
      * cannot save optimizer
      
      * support chat template
      
      * fix readme
      
      * fix path
      
      * test ci locally
      
      * restore build_or_pr
      
      * fix ci data path
      
      * fix benchmark
      
      * fix ci, move ci tests to 3080, disable fast tokenizer
      
      * move ci to 85
      
      * support flash attention 2
      
      * add all-in-one data preparation script. Fix colossal-llama2-chat chat template
      
      * add hardware requirements
      
      * move ci test data
      
      * fix save_model, add unwrap
      
      * fix missing bos
      
      * fix missing bos; support grad accumulation with gemini
      
      * fix ci
      
      * fix ci
      
      * fix ci
      
      * fix llama2 chat template config
      
      * debug sft
      
      * debug sft
      
      * fix colossalai version requirement
      
      * fix ci
      
      * add sanity check to prevent NaN loss
      
      * fix requirements
      
      * add dummy data generation script
      
      * add dummy data generation script
      
      * add dummy data generation script
      
      * add dummy data generation script
      
      * update readme
      
      * update readme
      
      * update readme and ignore
      
      * fix logger bug
      
      * support parallel_output
      
      * modify data preparation logic
      
      * fix tokenization
      
      * update lr
      
      * fix inference
      
      * run pre-commit
      
      ---------
      Co-authored-by: default avatarTong Li <tong.li352711588@gmail.com>
      df5e9c53
  2. 14 Aug, 2023 1 commit
    • Wenhao Chen's avatar
      [doc] update Coati README (#4405) · 6d41c3f2
      Wenhao Chen authored
      * style: apply formatter
      
      * fix: add outdated warnings
      
      * docs: add dataset format and polish
      
      * docs: polish README
      
      * fix: fix json format
      
      * fix: fix typos
      
      * revert: revert 7b example
      6d41c3f2
  3. 13 Apr, 2023 1 commit
    • MisterLin1995's avatar
      [chat] ChatGPT train prompts on ray example (#3309) · 1a809edd
      MisterLin1995 authored
      
      
      * [feat][chatgpt]train prompts on ray example
      
      * [fix]simplify code
      
      * [fix]remove depreciated parameter
      
      * [fix]add dependencies
      
      * [fix]method calling
      
      * [fix]experience maker
      
      * [fix]missing loss function
      
      * [fix]init optimizer
      
      * [feat]add usage comment
      
      * [fix]rename files
      
      * [fix]add readme
      
      * [fix]file path
      
      * [fix]move directory
      
      ---------
      Co-authored-by: default avatarjiangwen <zxl265370@antgroup.com>
      1a809edd