Commits · 36c4bb2893e73022b1060bd6ad5c0685869e5465 · OpenDAS / ColossalAI

01 Feb, 2024 1 commit
- [Chat] fix sft loss nan (#5345) · c5239840
  YeAnbang authored Feb 01, 2024
```
* fix script

* fix script

* fix chat nan

* fix chat nan
```
  c5239840
09 Jan, 2024 1 commit

[npu] change device to accelerator api (#5239) · d202cc28

Hongxin Liu authored Jan 09, 2024



* update accelerator

* fix timer

* fix amp

* update

* fix

* update bug

* add error raise

* fix autocast

* fix set device

* remove doc accelerator

* update doc

* update doc

* update doc

* use nullcontext

* update cpu

* update null context

* change time limit for example

* udpate

* update

* update

* update

* [npu] polish accelerator code

---------
Co-authored-by: Xuanlei Zhao <xuanlei.zhao@gmail.com>
Co-authored-by: zxl <43881818+oahzxl@users.noreply.github.com>

d202cc28

20 Dec, 2023 1 commit
- polish readme in application/chat (#5194) · af952673
  BlueRum authored Dec 20, 2023
  
  af952673
14 Nov, 2023 1 commit
- fix wrong EOS token in ColossalChat · 43ad0d9e
  Orion-Zheng authored Nov 14, 2023
  
  43ad0d9e
27 Sep, 2023 2 commits

[doc] update slack link (#4823) · 822051d8
binmakeswell authored Sep 27, 2023

822051d8

[chat] fix gemini strategy (#4698) · be400a09

flybird11111 authored Sep 27, 2023

* [chat] fix gemini strategy

* [chat] fix gemini strategy

* [chat] fix gemini strategy

* [chat] fix gemini strategy

* g# This is a combination of 2 commits.

[chat] fix gemini strategy

fox

* [chat] fix gemini strategy

update llama2 example

[chat] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* fix

* fix

* fix

* fix

* fix

* Update train_prompts.py

be400a09

24 Sep, 2023 1 commit

[feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) · ce777853

Yuanchen authored Sep 24, 2023



* Add ColossalEval

* Delete evaluate in Chat

---------
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
Co-authored-by: Tong Li <tong.li352711588@gmail.com>

ce777853

21 Sep, 2023 1 commit
- [chat]: add lora merge weights config (#4766) · 901ab1ee
  Wenhao Chen authored Sep 21, 2023
```
* feat: modify lora merge weights fn

* feat: add lora merge weights config
```
  901ab1ee
20 Sep, 2023 1 commit

[chat]: update rm, add wandb and fix bugs (#4471) · 7b9b8644

Wenhao Chen authored Sep 20, 2023



* feat: modify forward fn of critic and reward model

* feat: modify calc_action_log_probs

* to: add wandb in sft and rm trainer

* feat: update train_sft

* feat: update train_rm

* style: modify type annotation and add warning

* feat: pass tokenizer to ppo trainer

* to: modify trainer base and maker base

* feat: add wandb in ppo trainer

* feat: pass tokenizer to generate

* test: update generate fn tests

* test: update train tests

* fix: remove action_mask

* feat: remove unused code

* fix: fix wrong ignore_index

* fix: fix mock tokenizer

* chore: update requirements

* revert: modify make_experience

* fix: fix inference

* fix: add padding side

* style: modify _on_learn_batch_end

* test: use mock tokenizer

* fix: use bf16 to avoid overflow

* fix: fix workflow

* [chat] fix gemini strategy

* [chat] fix

* sync: update colossalai strategy

* fix: fix args and model dtype

* fix: fix checkpoint test

* fix: fix requirements

* fix: fix missing import and wrong arg

* fix: temporarily skip gemini test in stage 3

* style: apply pre-commit

* fix: temporarily skip gemini test in stage 1&2

---------
Co-authored-by: Mingyan Jiang <1829166702@qq.com>

7b9b8644

19 Sep, 2023 1 commit

[misc] update pre-commit and run all files (#4752) · 079bf3cb

Hongxin Liu authored Sep 19, 2023

* [misc] update pre-commit

* [misc] run pre-commit

* [misc] remove useless configuration files

* [misc] ignore cuda for clang-format

079bf3cb

15 Sep, 2023 1 commit
- Optimized some syntax errors in the documentation and code under applications/ (#4127) · e4fc57c3
  digger yu authored Sep 15, 2023
```
Co-authored-by: flybird11111 <1829166702@qq.com>
```
  e4fc57c3
30 Aug, 2023 1 commit
- fix colossalai version in coati examples · c648dc09
  Ying Liu authored Aug 30, 2023
  
  c648dc09
29 Aug, 2023 1 commit

[coati] add chatglm model (#4539) · 1467e3b4

yingliu-hpc authored Aug 29, 2023

* update configuration of chatglm and add support in coati

* add unit test & update chatglm default config & fix bos index issue

* remove chatglm due to oom

* add dataset pkg in requirement-text

* fix parameter issue in test_models

* add ref in tokenize & rm unnessary parts

* separate source & target tokenization in chatglm

* add unit test to chatglm

* fix test dataset issue

* update truncation of chatglm

* fix Colossalai version

* fix colossal ai version in test

1467e3b4

21 Aug, 2023 1 commit

[chat] update config and prompt (#4139) · 285fe7ba

Michelle authored Aug 21, 2023



* update config and prompt

* update config

---------
Co-authored-by: Qianran Ma <qianranm@luchentech.com>

285fe7ba

16 Aug, 2023 1 commit

[devops] add large-scale distributed test marker (#4452) · 26e29d58

Hongxin Liu authored Aug 16, 2023

* [test] remove cpu marker

* [test] remove gpu marker

* [test] update pytest markers

* [ci] update unit test ci

26e29d58

14 Aug, 2023 1 commit

[doc] update Coati README (#4405) · 6d41c3f2

Wenhao Chen authored Aug 14, 2023

* style: apply formatter

* fix: add outdated warnings

* docs: add dataset format and polish

* docs: polish README

* fix: fix json format

* fix: fix typos

* revert: revert 7b example

6d41c3f2

02 Aug, 2023 1 commit

[chat] fix bugs and add unit tests (#4213) · da4f7b85

Wenhao Chen authored Aug 02, 2023

* style: rename replay buffer

Experience replay is typically for off policy algorithms.
Use this name in PPO maybe misleading.

* fix: fix wrong zero2 default arg

* test: update experience tests

* style: rename zero_pad fn

* fix: defer init in CycledDataLoader

* test: add benchmark test

* style: rename internal fn of generation

* style: rename internal fn of lora

* fix: remove unused loss fn

* fix: remove unused utils fn

* refactor: remove generate_with_actor fn

* fix: fix type annotation

* test: add models tests

* fix: skip llama due to long execution time

* style: modify dataset

* style: apply formatter

* perf: update reward dataset

* fix: fix wrong IGNORE_INDEX in sft dataset

* fix: remove DataCollatorForSupervisedDataset

* test: add dataset tests

* style: apply formatter

* style: rename test_ci to test_train

* feat: add llama in inference

* test: add inference tests

* test: change test scripts directory

* fix: update ci

* fix: fix typo

* fix: skip llama due to oom

* fix: fix file mod

* style: apply formatter

* refactor: remove duplicated llama_gptq

* style: apply formatter

* to: update rm test

* feat: add tokenizer arg

* feat: add download model script

* test: update train tests

* fix: modify gemini load and save pretrained

* test: update checkpoint io test

* to: modify nproc_per_node

* fix: do not remove existing dir

* fix: modify save path

* test: add random choice

* fix: fix sft path

* fix: enlarge nproc_per_node to avoid oom

* fix: add num_retry

* fix: make lora config of rm and critic consistent

* fix: add warning about lora weights

* fix: skip some gpt2 tests

* fix: remove grad ckpt in rm and critic due to errors

* refactor: directly use Actor in train_sft

* test: add more arguments

* fix: disable grad ckpt when using lora

* fix: fix save_pretrained and related tests

* test: enable zero2 tests

* revert: remove useless fn

* style: polish code

* test: modify test args

da4f7b85

01 Aug, 2023 1 commit
- [chat] fix compute_approx_kl (#4338) · 75c53890
  Wenhao Chen authored Aug 01, 2023
  
  75c53890
28 Jul, 2023 1 commit
- support session-based training (#4313) · 5187c96b
  Yuanchen authored Jul 28, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  5187c96b
26 Jul, 2023 12 commits
- [NFC] polish applications/Chat/coati/models/utils.py codestyle (#4277) · 09914053
  yuxuan-lou authored Jul 19, 2023
```
* [NFC] polish colossalai/context/random/__init__.py code style

* [NFC] polish applications/Chat/coati/models/utils.py code style
```
  09914053
- [NFC] polish applications/Chat/coati/trainer/strategies/base.py code style (#4278) · 9e512938
  Zirui Zhu authored Jul 19, 2023
  
  9e512938
- applications/Chat/.gitignore (#4279) · c972d653
  Ziheng Qin authored Jul 19, 2023
```
Co-authored-by: henryqin1997 <henryqin1997@gamil.com>
```
  c972d653
- [NFC] polish applications/Chat/coati/models/generation.py code style (#4275) · 709e121c
  RichardoLuo authored Jul 18, 2023
  
  709e121c
- [NFC] polish applications/Chat/inference/server.py code style (#4274) · dc1b6127
  Yuanchen authored Jul 18, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  dc1b6127
- [NFC] fix format of application/Chat/coati/trainer/utils.py (#4273) · caa44330
  アマデウス authored Jul 18, 2023
  
  caa44330
- [NFC] polish applications/Chat/examples/train_reward_model.py code style (#4271) · 1ce997da
  Xu Kai authored Jul 18, 2023
  
  1ce997da
- [NFC] polish applications/Chat/coati/trainer/base.py code style (#4260) · 798cb729
  shenggan authored Jul 18, 2023
  
  798cb729
- [NFC] polish applications/Chat/coati/dataset/sft_dataset.py code style (#4259) · b2debdc0
  Zheng Zangwei (Alex Zheng) authored Jul 18, 2023
  
  b2debdc0
- [NFC] policy applications/Chat/examples/ray/mmmt_prompt.py code style (#4250) · dee1c963
  CZYCW authored Jul 18, 2023
  
  dee1c963
- [NFC] polish applications/Chat/coati/models/base/actor.py code style (#4248) · 77c469e1
  Junming Wu authored Jul 18, 2023
  
  77c469e1
- [NFC] polish applications/Chat/inference/requirements.txt code style (#4265) · 915ed8be
  Camille Zhong authored Jul 18, 2023
  
  915ed8be
04 Jul, 2023 3 commits

[chat] removed cache file (#4155) · f447ca18
Frank Lee authored Jul 04, 2023

f447ca18
[shardformer] shardformer support t5 model (#3994) · c1c672d0
wukong1992 authored Jun 15, 2023
```
test t5
```
c1c672d0

[chat] use official transformers and fix some issues (#4117) · 3d8d5d0d

Wenhao Chen authored Jul 04, 2023

* feat: remove on_learn_epoch fn as not used

* revert: add _on_learn_epoch fn

* feat: remove NaiveStrategy

* test: update train_prompts tests

* fix: remove prepare_llama_tokenizer_and_embedding

* test: add lora arg

* feat: remove roberta support in train_prompts due to runtime errs

* feat: remove deberta & roberta in rm as not used

* test: remove deberta and roberta tests

* feat: remove deberta and roberta models as not used

* fix: remove calls to roberta

* fix: remove prepare_llama_tokenizer_and_embedding

* chore: update transformers version

* docs: update transformers version

* fix: fix actor inference

* fix: fix ci

* feat: change llama pad token to unk

* revert: revert ddp setup_distributed

* fix: change llama pad token to unk

* revert: undo unnecessary changes

* fix: use pip to install transformers

3d8d5d0d

29 Jun, 2023 2 commits

[chat] remove naive strategy and split colossalai strategy (#4094) · edd75a59

Wenhao Chen authored Jun 29, 2023

* feat: remove on_learn_epoch fn as not used

* revert: add _on_learn_epoch fn

* to: remove the use of NaiveStrategy

* test: remove NaiveStrategy tests

* feat: remove NaiveStrategy

* style: modify comments and params

* feat: split ColossalAIStrategy into LowLevelZeroStrategy and GeminiStrategy

* fix: remove naive

* fix: align with modified colossal strategy

* fix: fix ddp _try_init_dist arg

edd75a59

[chat] refactor trainer class (#4080) · b03d64d0

Wenhao Chen authored Jun 29, 2023

* to: add SLTrainer

* refactor: refactor RMTrainer and SFTTrainer

* fix: fix init file

* feat: remove on_learn_epoch fn as not used

* fix: align with modified gemini arguments

* to: add OnPolicyTrainer

* revert: add _on_learn_epoch fn

* refactor: refactor PPOTrainer

* style: rename PPOTrainer argument

* fix: align with modified PPO arguments

* test: align with modified train_prompts arguments

* chore: modify train_prompts

* docs: align with modified arguments

* fix: remove unnecessary output

* fix: move dataloader to fit fn of SLTrainer

* fix: move dataloader to fit fn of OnPolicyTrainer

* fix: modify usage of prompt and pretrain dataloader

b03d64d0

26 Jun, 2023 2 commits
- [hotfix]fix argument naming in docs and examples (#4083) · 4da324cd
  Baizhou Zhang authored Jun 26, 2023
  
  4da324cd
- [chat]: fix chat evaluation possible bug (#4064) · e89b127d
  Michelle authored Jun 26, 2023
```
* fix chat eval

* fix utils

* fix utils

* add comment

---------
Co-authored-by: Qianran Ma <qianranm@luchentech.com>
```
  e89b127d
25 Jun, 2023 1 commit

[chat] refactor strategy class with booster api (#3987) · 153b957a

Wenhao Chen authored Jun 25, 2023

* refactor: adapt boost API in base and naive strategies

* fix: initialize plugin after setup_distributed

* fix: fix save_pretrained fn

* refactor: adapt boost API in DDPStrategy

* to: add _post_init check

* to: fix ddp backward, modify ddp dataloader and unwrap

* feat: adapt boost API in ColossalAIStrategy

* fix: call setup_distributed before use get_current_device

* fix: fix save_model and save_optimizer

* test: remove save_sharded_optimizer test

* style: apply formatter

* fix: fix stage check and add comments

* feat: allow dict type arg in strategy.prepare

* to: temporarily remove lr_scheduler for testing

* style: simplify init of ColossalAIStrategy

* fix: fix lr_scheduler in sft and rm

* style: modify comments

* test: add train_prompts tests

* fix: fix inference only case and use in train_prompts

* test: skip failed tests in ci

* style: fix CodeFactor check

* fix: do not use model.to('cpu') with GeminiPlugin

* test: enable colossalai_gemini tests

* test: set CUDA_VISIBLE_DEVICES in ci

* docs: add note

153b957a

19 Jun, 2023 1 commit
- [nfc] fix dim not defined and fix typo (#3991) · 727c4598
  digger yu authored Jun 19, 2023
  
  727c4598