• YeAnbang's avatar
    [ColossalChat] Update RLHF V2 (#5286) · df5e9c53
    YeAnbang authored
    
    
    * Add dpo. Fix sft, ppo, lora. Refactor all
    
    * fix and tested ppo
    
    * 2 nd round refactor
    
    * add ci tests
    
    * fix ci
    
    * fix ci
    
    * fix readme, style
    
    * fix readme style
    
    * fix style, fix benchmark
    
    * reproduce benchmark result, remove useless files
    
    * rename to ColossalChat
    
    * use new image
    
    * fix ci workflow
    
    * fix ci
    
    * use local model/tokenizer for ci tests
    
    * fix ci
    
    * fix ci
    
    * fix ci
    
    * fix ci timeout
    
    * fix rm progress bar. fix ci timeout
    
    * fix ci
    
    * fix ci typo
    
    * remove 3d plugin from ci temporary
    
    * test environment
    
    * cannot save optimizer
    
    * support chat template
    
    * fix readme
    
    * fix path
    
    * test ci locally
    
    * restore build_or_pr
    
    * fix ci data path
    
    * fix benchmark
    
    * fix ci, move ci tests to 3080, disable fast tokenizer
    
    * move ci to 85
    
    * support flash attention 2
    
    * add all-in-one data preparation script. Fix colossal-llama2-chat chat template
    
    * add hardware requirements
    
    * move ci test data
    
    * fix save_model, add unwrap
    
    * fix missing bos
    
    * fix missing bos; support grad accumulation with gemini
    
    * fix ci
    
    * fix ci
    
    * fix ci
    
    * fix llama2 chat template config
    
    * debug sft
    
    * debug sft
    
    * fix colossalai version requirement
    
    * fix ci
    
    * add sanity check to prevent NaN loss
    
    * fix requirements
    
    * add dummy data generation script
    
    * add dummy data generation script
    
    * add dummy data generation script
    
    * add dummy data generation script
    
    * update readme
    
    * update readme
    
    * update readme and ignore
    
    * fix logger bug
    
    * support parallel_output
    
    * modify data preparation logic
    
    * fix tokenization
    
    * update lr
    
    * fix inference
    
    * run pre-commit
    
    ---------
    Co-authored-by: default avatarTong Li <tong.li352711588@gmail.com>
    df5e9c53
llama.json 779 Bytes