1. 15 Apr, 2024 1 commit
  2. 08 Apr, 2024 1 commit
  3. 07 Apr, 2024 1 commit
  4. 01 Apr, 2024 1 commit
    • Wenhao Chen's avatar
      [shardformer, pipeline] add `gradient_checkpointing_ratio` and heterogenous... · e614aa34
      Wenhao Chen authored
      [shardformer, pipeline] add `gradient_checkpointing_ratio` and heterogenous shard policy for llama (#5508)
      
      * feat: add `GradientCheckpointConfig` and `PipelineGradientCheckpointConfig`
      
      * feat: apply `GradientCheckpointConfig` to policy and llama_forward
      
      * feat: move `distribute_layer` and `get_stage_index` to PipelineStageManager
      
      * fix: add optional args for `distribute_layer` and `get_stage_index`
      
      * fix: fix changed API calls
      
      * test: update llama tests
      
      * style: polish `GradientCheckpointConfig`
      
      * fix: fix pipeline utils tests
      e614aa34
  5. 29 Mar, 2024 1 commit
    • YeAnbang's avatar
      [ColossalChat] Update RLHF V2 (#5286) · df5e9c53
      YeAnbang authored
      
      
      * Add dpo. Fix sft, ppo, lora. Refactor all
      
      * fix and tested ppo
      
      * 2 nd round refactor
      
      * add ci tests
      
      * fix ci
      
      * fix ci
      
      * fix readme, style
      
      * fix readme style
      
      * fix style, fix benchmark
      
      * reproduce benchmark result, remove useless files
      
      * rename to ColossalChat
      
      * use new image
      
      * fix ci workflow
      
      * fix ci
      
      * use local model/tokenizer for ci tests
      
      * fix ci
      
      * fix ci
      
      * fix ci
      
      * fix ci timeout
      
      * fix rm progress bar. fix ci timeout
      
      * fix ci
      
      * fix ci typo
      
      * remove 3d plugin from ci temporary
      
      * test environment
      
      * cannot save optimizer
      
      * support chat template
      
      * fix readme
      
      * fix path
      
      * test ci locally
      
      * restore build_or_pr
      
      * fix ci data path
      
      * fix benchmark
      
      * fix ci, move ci tests to 3080, disable fast tokenizer
      
      * move ci to 85
      
      * support flash attention 2
      
      * add all-in-one data preparation script. Fix colossal-llama2-chat chat template
      
      * add hardware requirements
      
      * move ci test data
      
      * fix save_model, add unwrap
      
      * fix missing bos
      
      * fix missing bos; support grad accumulation with gemini
      
      * fix ci
      
      * fix ci
      
      * fix ci
      
      * fix llama2 chat template config
      
      * debug sft
      
      * debug sft
      
      * fix colossalai version requirement
      
      * fix ci
      
      * add sanity check to prevent NaN loss
      
      * fix requirements
      
      * add dummy data generation script
      
      * add dummy data generation script
      
      * add dummy data generation script
      
      * add dummy data generation script
      
      * update readme
      
      * update readme
      
      * update readme and ignore
      
      * fix logger bug
      
      * support parallel_output
      
      * modify data preparation logic
      
      * fix tokenization
      
      * update lr
      
      * fix inference
      
      * run pre-commit
      
      ---------
      Co-authored-by: default avatarTong Li <tong.li352711588@gmail.com>
      df5e9c53
  6. 27 Mar, 2024 1 commit
  7. 25 Mar, 2024 1 commit
  8. 20 Mar, 2024 1 commit
  9. 12 Mar, 2024 1 commit
  10. 11 Mar, 2024 1 commit
  11. 07 Mar, 2024 1 commit
  12. 05 Mar, 2024 3 commits
  13. 01 Mar, 2024 1 commit
  14. 28 Feb, 2024 1 commit
  15. 19 Feb, 2024 2 commits
  16. 07 Feb, 2024 6 commits
  17. 06 Feb, 2024 3 commits
  18. 05 Feb, 2024 4 commits
  19. 01 Feb, 2024 1 commit
  20. 25 Jan, 2024 1 commit
    • 李文军's avatar
      [NFC] polish... · ec912b1b
      李文军 authored
      [NFC] polish applications/Colossal-LLaMA-2/colossal_llama2/tokenizer/init_tokenizer.py code style (#5228)
      
      ec912b1b
  21. 22 Jan, 2024 1 commit
  22. 18 Jan, 2024 1 commit
  23. 11 Jan, 2024 1 commit
  24. 10 Jan, 2024 1 commit
  25. 09 Jan, 2024 1 commit
  26. 08 Jan, 2024 1 commit
    • binmakeswell's avatar
      [doc] SwiftInfer release (#5236) · 7bc6969c
      binmakeswell authored
      * [doc] SwiftInfer release
      
      * [doc] SwiftInfer release
      
      * [doc] SwiftInfer release
      
      * [doc] SwiftInfer release
      
      * [doc] SwiftInfer release
      7bc6969c
  27. 07 Jan, 2024 1 commit