- 27 Sep, 2023 4 commits
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
littsk authored
-
littsk authored
-
Hongxin Liu authored
-
- 26 Sep, 2023 8 commits
-
-
Yan haixu authored
-
Chandler-Bing authored
change filename: pretraining.py -> trainin.py there is no file named pretraing.py. wrong writing
-
Desperado-Jia authored
[doc] Update TODO in README of Colossal-LLaMA-2
-
Tong Li authored
-
Hongxin Liu authored
* [lazy] patch from pretrained * [lazy] fix from pretrained and add tests * [devops] update ci
-
Tong Li authored
-
Baizhou Zhang authored
* support unsharded saving/loading for model * support optimizer unsharded saving * update doc * support unsharded loading for optimizer * small fix
-
Baizhou Zhang authored
* fix example format in docstring * polish shardformer doc
-
- 25 Sep, 2023 2 commits
-
-
flybird11111 authored
* [fix] fix weekly runing example * [fix] fix weekly runing example
-
binmakeswell authored
* [doc] add llama2 domain-specific solution news
-
- 24 Sep, 2023 2 commits
-
-
Yuanchen authored
* Add ColossalEval * Delete evaluate in Chat --------- Co-authored-by:
Xu Yuanchen <yuanchen.xu00@gmail.com> Co-authored-by:
Tong Li <tong.li352711588@gmail.com>
-
Tong Li authored
-
- 22 Sep, 2023 4 commits
-
-
Hongxin Liu authored
* [release] update version * [doc] revert versions
-
Jianghai authored
* add chatglm2 * add * gather needed kernels * fix some bugs * finish context forward * finish context stage * fix * add * pause * add * fix bugs * finish chatglm * fix bug * change some logic * fix bugs * change some logics * add * add * add * fix * fix tests * fix
-
Xu Kai authored
* [gptq] add gptq kernel (#4416) * add gptq * refactor code * fix tests * replace auto-gptq * rname inferance/quant * refactor test * add auto-gptq as an option * reset requirements * change assert and check auto-gptq * add import warnings * change test flash attn version * remove example * change requirements of flash_attn * modify tests * [skip ci] change requirements-test * [gptq] faster gptq cuda kernel (#4494) * [skip ci] add cuda kernels * add license * [skip ci] fix max_input_len * format files & change test size * [skip ci] * [gptq] add gptq tensor parallel (#4538) * add gptq tensor parallel * add gptq tp * delete print * add test gptq check * add test auto gptq check * [gptq] combine gptq and kv cache manager (#4706) * combine gptq and kv cache manager * add init bits * delete useless code * add model path * delete usless print and update test * delete usless import * move option gptq to shard config * change replace linear to shardformer * update bloom policy * delete useless code * fix import bug and delete uselss code * change colossalai/gptq to colossalai/quant/gptq * update import linear for tests * delete useless code and mv gptq_kernel to kernel directory * fix triton kernel * add triton import
-
littsk authored
* Fix the version check bug in colossalai run when generating the cmd. * polish code
-
- 21 Sep, 2023 5 commits
-
-
Hongxin Liu authored
* [lazy] support _like methods and clamp * [lazy] pass transformers models * [lazy] fix device move and requires grad * [lazy] fix requires grad and refactor api * [lazy] fix requires grad
-
Wenhao Chen authored
* feat: modify lora merge weights fn * feat: add lora merge weights config
-
Baizhou Zhang authored
-
Hongxin Liu authored
* [doc] clean up outdated docs * [doc] fix linking * [doc] fix linking
-
Baizhou Zhang authored
-
- 20 Sep, 2023 4 commits
-
-
Baizhou Zhang authored
* fix master param sync for hybrid plugin * rewrite unwrap for ddp/fsdp * rewrite unwrap for zero/gemini * rewrite unwrap for hybrid plugin * fix geemini unwrap * fix bugs
-
Wenhao Chen authored
* feat: modify forward fn of critic and reward model * feat: modify calc_action_log_probs * to: add wandb in sft and rm trainer * feat: update train_sft * feat: update train_rm * style: modify type annotation and add warning * feat: pass tokenizer to ppo trainer * to: modify trainer base and maker base * feat: add wandb in ppo trainer * feat: pass tokenizer to generate * test: update generate fn tests * test: update train tests * fix: remove action_mask * feat: remove unused code * fix: fix wrong ignore_index * fix: fix mock tokenizer * chore: update requirements * revert: modify make_experience * fix: fix inference * fix: add padding side * style: modify _on_learn_batch_end * test: use mock tokenizer * fix: use bf16 to avoid overflow * fix: fix workflow * [chat] fix gemini strategy * [chat] fix * sync: update colossalai strategy * fix: fix args and model dtype * fix: fix checkpoint test * fix: fix requirements * fix: fix missing import and wrong arg * fix: temporarily skip gemini test in stage 3 * style: apply pre-commit * fix: temporarily skip gemini test in stage 1&2 --------- Co-authored-by:Mingyan Jiang <1829166702@qq.com>
-
ppt0011 authored
[doc] explain suitable use case for each plugin
-
Pengtai Xu authored
-
- 19 Sep, 2023 4 commits
-
-
Pengtai Xu authored
-
Pengtai Xu authored
-
Pengtai Xu authored
-
Hongxin Liu authored
* [misc] update pre-commit * [misc] run pre-commit * [misc] remove useless configuration files * [misc] ignore cuda for clang-format
-
- 18 Sep, 2023 3 commits
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
Hongxin Liu authored
* [legacy] remove outdated codes of pipeline (#4692) * [legacy] remove cli of benchmark and update optim (#4690) * [legacy] remove cli of benchmark and update optim * [doc] fix cli doc test * [legacy] fix engine clip grad norm * [legacy] remove outdated colo tensor (#4694) * [legacy] remove outdated colo tensor * [test] fix test import * [legacy] move outdated zero to legacy (#4696) * [legacy] clean up utils (#4700) * [legacy] clean up utils * [example] update examples * [legacy] clean up amp * [legacy] fix amp module * [legacy] clean up gpc (#4742) * [legacy] clean up context * [legacy] clean core, constants and global vars * [legacy] refactor initialize * [example] fix examples ci * [example] fix examples ci * [legacy] fix tests * [example] fix gpt example * [example] fix examples ci * [devops] fix ci installation * [example] fix examples ci
-
Xuanlei Zhao authored
-
- 15 Sep, 2023 4 commits
-
-
Baizhou Zhang authored
-
flybird11111 authored
* [shardformer] update shardformer readme [shardformer] update shardformer readme [shardformer] update shardformer readme * [shardformer] update llama2/opt finetune example and shardformer update to llama2 * [shardformer] update llama2/opt finetune example and shardformer update to llama2 * [shardformer] update llama2/opt finetune example and shardformer update to llama2 * [shardformer] change dataset * [shardformer] change dataset * [shardformer] fix CI * [shardformer] fix * [shardformer] fix * [shardformer] fix * [shardformer] fix * [shardformer] fix [example] update opt example [example] resolve comments fix fix * [example] llama2 add finetune example * [example] llama2 add finetune example * [example] llama2 add finetune example * [example] llama2 add finetune example * fix * update llama2 example * update llama2 example * fix * update llama2 example * update llama2 example * update llama2 example * update llama2 example * update llama2 example * update llama2 example * Update requirements.txt * update llama2 example * update llama2 example * update llama2 example
-
Xuanlei Zhao authored
* add custom policy * update assert
-
Baizhou Zhang authored
* arrange position of chapters * fix typos in seq parallel doc
-