Commits · 239cd92eff7a583d23c34ed520216f451cda165b · OpenDAS / ColossalAI

09 Nov, 2023 1 commit
- Support mtbench (#5025) · 239cd92e
  Yuanchen authored Nov 09, 2023
```
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
```
  239cd92e
31 Oct, 2023 1 commit
- fix ColossalEval (#4992) · abe071b6
  Yuanchen authored Oct 31, 2023
```
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
```
  abe071b6
17 Oct, 2023 1 commit
- [format] applied code formatting on changed files in pull request 4908 (#4918) · a41cf88e
  github-actions[bot] authored Oct 17, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
  a41cf88e
16 Oct, 2023 1 commit

Update flash_attention_patch.py · 7768afba

Zian(Andy) Zheng authored Oct 13, 2023

To be compatible with the new change in the Transformers library, where a new argument 'padding_mask' was added to forward function of attention layer.
https://github.com/huggingface/transformers/pull/25598

7768afba

10 Oct, 2023 3 commits
- Update README.md · 652adc22
  Camille Zhong authored Oct 10, 2023
  
  652adc22
- Update README.md · afe10a85
  Camille Zhong authored Oct 10, 2023
  
  afe10a85
- Update modelscope link in README.md · 3043d5d6
  Camille Zhong authored Oct 10, 2023
```
add modelscope link
```
  3043d5d6
28 Sep, 2023 1 commit
- update Colossal (#4832) · ed06731e
  Tong Li authored Sep 28, 2023
  
  ed06731e
27 Sep, 2023 3 commits

[doc] update slack link (#4823) · 822051d8
binmakeswell authored Sep 27, 2023

822051d8
Update Qwen-7B results (#4821) · 1fa8c5e0
Yuanchen authored Sep 27, 2023
```
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
```
1fa8c5e0

[chat] fix gemini strategy (#4698) · be400a09

flybird11111 authored Sep 27, 2023

* [chat] fix gemini strategy

* [chat] fix gemini strategy

* [chat] fix gemini strategy

* [chat] fix gemini strategy

* g# This is a combination of 2 commits.

[chat] fix gemini strategy

fox

* [chat] fix gemini strategy

update llama2 example

[chat] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* fix

* fix

* fix

* fix

* fix

* Update train_prompts.py

be400a09

26 Sep, 2023 3 commits
- [hotfix] change llama2 Colossal-LLaMA-2 script filename (#4800) · b6cf0aca
  Chandler-Bing authored Sep 26, 2023
```
change filename:
pretraining.py -> trainin.py
there is no file named pretraing.py. wrong writing
```
  b6cf0aca
- update · 8cbce618
  Tong Li authored Sep 26, 2023
  
  8cbce618
- update readme · bd014673
  Tong Li authored Sep 26, 2023
  
  bd014673
25 Sep, 2023 1 commit
- [doc] add llama2 domain-specific solution news (#4789) · d512a4d3
  binmakeswell authored Sep 25, 2023
```
* [doc] add llama2 domain-specific solution news
```
  d512a4d3
24 Sep, 2023 2 commits
- [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) · ce777853
  Yuanchen authored Sep 24, 2023
```
* Add ColossalEval

* Delete evaluate in Chat

---------
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
Co-authored-by: Tong Li <tong.li352711588@gmail.com>
```
  ce777853
- initial commit: add colossal llama 2 (#4784) · 74aa7d96
  Tong Li authored Sep 24, 2023
  
  74aa7d96
21 Sep, 2023 1 commit
- [chat]: add lora merge weights config (#4766) · 901ab1ee
  Wenhao Chen authored Sep 21, 2023
```
* feat: modify lora merge weights fn

* feat: add lora merge weights config
```
  901ab1ee
20 Sep, 2023 1 commit

[chat]: update rm, add wandb and fix bugs (#4471) · 7b9b8644

Wenhao Chen authored Sep 20, 2023



* feat: modify forward fn of critic and reward model

* feat: modify calc_action_log_probs

* to: add wandb in sft and rm trainer

* feat: update train_sft

* feat: update train_rm

* style: modify type annotation and add warning

* feat: pass tokenizer to ppo trainer

* to: modify trainer base and maker base

* feat: add wandb in ppo trainer

* feat: pass tokenizer to generate

* test: update generate fn tests

* test: update train tests

* fix: remove action_mask

* feat: remove unused code

* fix: fix wrong ignore_index

* fix: fix mock tokenizer

* chore: update requirements

* revert: modify make_experience

* fix: fix inference

* fix: add padding side

* style: modify _on_learn_batch_end

* test: use mock tokenizer

* fix: use bf16 to avoid overflow

* fix: fix workflow

* [chat] fix gemini strategy

* [chat] fix

* sync: update colossalai strategy

* fix: fix args and model dtype

* fix: fix checkpoint test

* fix: fix requirements

* fix: fix missing import and wrong arg

* fix: temporarily skip gemini test in stage 3

* style: apply pre-commit

* fix: temporarily skip gemini test in stage 1&2

---------
Co-authored-by: Mingyan Jiang <1829166702@qq.com>

7b9b8644

19 Sep, 2023 1 commit

[misc] update pre-commit and run all files (#4752) · 079bf3cb

Hongxin Liu authored Sep 19, 2023

* [misc] update pre-commit

* [misc] run pre-commit

* [misc] remove useless configuration files

* [misc] ignore cuda for clang-format

079bf3cb

15 Sep, 2023 1 commit
- Optimized some syntax errors in the documentation and code under applications/ (#4127) · e4fc57c3
  digger yu authored Sep 15, 2023
```
Co-authored-by: flybird11111 <1829166702@qq.com>
```
  e4fc57c3
30 Aug, 2023 1 commit
- fix colossalai version in coati examples · c648dc09
  Ying Liu authored Aug 30, 2023
  
  c648dc09
29 Aug, 2023 1 commit

[coati] add chatglm model (#4539) · 1467e3b4

yingliu-hpc authored Aug 29, 2023

* update configuration of chatglm and add support in coati

* add unit test & update chatglm default config & fix bos index issue

* remove chatglm due to oom

* add dataset pkg in requirement-text

* fix parameter issue in test_models

* add ref in tokenize & rm unnessary parts

* separate source & target tokenization in chatglm

* add unit test to chatglm

* fix test dataset issue

* update truncation of chatglm

* fix Colossalai version

* fix colossal ai version in test

1467e3b4

21 Aug, 2023 1 commit

[chat] update config and prompt (#4139) · 285fe7ba

Michelle authored Aug 21, 2023



* update config and prompt

* update config

---------
Co-authored-by: Qianran Ma <qianranm@luchentech.com>

285fe7ba

16 Aug, 2023 1 commit

[devops] add large-scale distributed test marker (#4452) · 26e29d58

Hongxin Liu authored Aug 16, 2023

* [test] remove cpu marker

* [test] remove gpu marker

* [test] update pytest markers

* [ci] update unit test ci

26e29d58

14 Aug, 2023 1 commit

[doc] update Coati README (#4405) · 6d41c3f2

Wenhao Chen authored Aug 14, 2023

* style: apply formatter

* fix: add outdated warnings

* docs: add dataset format and polish

* docs: polish README

* fix: fix json format

* fix: fix typos

* revert: revert 7b example

6d41c3f2

02 Aug, 2023 1 commit

[chat] fix bugs and add unit tests (#4213) · da4f7b85

Wenhao Chen authored Aug 02, 2023

* style: rename replay buffer

Experience replay is typically for off policy algorithms.
Use this name in PPO maybe misleading.

* fix: fix wrong zero2 default arg

* test: update experience tests

* style: rename zero_pad fn

* fix: defer init in CycledDataLoader

* test: add benchmark test

* style: rename internal fn of generation

* style: rename internal fn of lora

* fix: remove unused loss fn

* fix: remove unused utils fn

* refactor: remove generate_with_actor fn

* fix: fix type annotation

* test: add models tests

* fix: skip llama due to long execution time

* style: modify dataset

* style: apply formatter

* perf: update reward dataset

* fix: fix wrong IGNORE_INDEX in sft dataset

* fix: remove DataCollatorForSupervisedDataset

* test: add dataset tests

* style: apply formatter

* style: rename test_ci to test_train

* feat: add llama in inference

* test: add inference tests

* test: change test scripts directory

* fix: update ci

* fix: fix typo

* fix: skip llama due to oom

* fix: fix file mod

* style: apply formatter

* refactor: remove duplicated llama_gptq

* style: apply formatter

* to: update rm test

* feat: add tokenizer arg

* feat: add download model script

* test: update train tests

* fix: modify gemini load and save pretrained

* test: update checkpoint io test

* to: modify nproc_per_node

* fix: do not remove existing dir

* fix: modify save path

* test: add random choice

* fix: fix sft path

* fix: enlarge nproc_per_node to avoid oom

* fix: add num_retry

* fix: make lora config of rm and critic consistent

* fix: add warning about lora weights

* fix: skip some gpt2 tests

* fix: remove grad ckpt in rm and critic due to errors

* refactor: directly use Actor in train_sft

* test: add more arguments

* fix: disable grad ckpt when using lora

* fix: fix save_pretrained and related tests

* test: enable zero2 tests

* revert: remove useless fn

* style: polish code

* test: modify test args

da4f7b85

01 Aug, 2023 1 commit
- [chat] fix compute_approx_kl (#4338) · 75c53890
  Wenhao Chen authored Aug 01, 2023
  
  75c53890
28 Jul, 2023 1 commit
- support session-based training (#4313) · 5187c96b
  Yuanchen authored Jul 28, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  5187c96b
26 Jul, 2023 11 commits
- [NFC] polish applications/Chat/coati/models/utils.py codestyle (#4277) · 09914053
  yuxuan-lou authored Jul 19, 2023
```
* [NFC] polish colossalai/context/random/__init__.py code style

* [NFC] polish applications/Chat/coati/models/utils.py code style
```
  09914053
- [NFC] polish applications/Chat/coati/trainer/strategies/base.py code style (#4278) · 9e512938
  Zirui Zhu authored Jul 19, 2023
  
  9e512938
- applications/Chat/.gitignore (#4279) · c972d653
  Ziheng Qin authored Jul 19, 2023
```
Co-authored-by: henryqin1997 <henryqin1997@gamil.com>
```
  c972d653
- [NFC] polish applications/Chat/coati/models/generation.py code style (#4275) · 709e121c
  RichardoLuo authored Jul 18, 2023
  
  709e121c
- [NFC] polish applications/Chat/inference/server.py code style (#4274) · dc1b6127
  Yuanchen authored Jul 18, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  dc1b6127
- [NFC] fix format of application/Chat/coati/trainer/utils.py (#4273) · caa44330
  アマデウス authored Jul 18, 2023
  
  caa44330
- [NFC] polish applications/Chat/examples/train_reward_model.py code style (#4271) · 1ce997da
  Xu Kai authored Jul 18, 2023
  
  1ce997da
- [NFC] polish applications/Chat/coati/trainer/base.py code style (#4260) · 798cb729
  shenggan authored Jul 18, 2023
  
  798cb729
- [NFC] polish applications/Chat/coati/dataset/sft_dataset.py code style (#4259) · b2debdc0
  Zheng Zangwei (Alex Zheng) authored Jul 18, 2023
  
  b2debdc0
- [NFC] policy applications/Chat/examples/ray/mmmt_prompt.py code style (#4250) · dee1c963
  CZYCW authored Jul 18, 2023
  
  dee1c963
- [NFC] polish applications/Chat/coati/models/base/actor.py code style (#4248) · 77c469e1
  Junming Wu authored Jul 18, 2023
  
  77c469e1