Commits · cefdc3261534408c83d338749fef183bf0f3fde7 · OpenDAS / ColossalAI

".github/vscode:/vscode.git/clone" did not exist on "03e52ecba3b60b04b552d82809043e5642509005"

12 Dec, 2023 1 commit

[ColossalEval] Support GSM, Data Leakage Evaluation and Tensor Parallel (#5169) · cefdc326

Yuanchen authored Dec 12, 2023



* Support GSM, Data Leakage Evaluation and Tensor Parallel

* remove redundant code and update inference.py in examples/gpt_evaluation

---------
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>

cefdc326

11 Dec, 2023 1 commit
- [colossalqa] fix pangu api (#5170) · b07a6f4e
  Michelle authored Dec 11, 2023
```
* fix pangu api

* add comment
```
  b07a6f4e
07 Dec, 2023 1 commit

[Colossal-Llama-2] Add finetuning Colossal-Llama-2 example (#4878) · b3971044

Yuanchen authored Dec 07, 2023



* Add finetuning Colossal-Llama-2 example

* Add finetuning Colossal-Llama-2 example 2

* Add finetuning Colossal-Llama-2 example and support NEFTuning

* Add inference example and refine neftune

* Modify readme file

* update the imports

---------
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
Co-authored-by: Camille Zhong <44392324+Camille7777@users.noreply.github.com>

b3971044

01 Dec, 2023 1 commit
- [doc] fix colossalqa document (#5146) · 368b5e3d
  Michelle authored Dec 01, 2023
```
* fix doc

* modify doc
```
  368b5e3d
30 Nov, 2023 1 commit
- [ColossalQA] refactor server and webui & add new feature (#5138) · c7fd9a52
  Michelle authored Nov 30, 2023
```
* refactor server and webui & add new feature

* add requirements

* modify readme and ui
```
  c7fd9a52
29 Nov, 2023 2 commits
- [format] applied code formatting on changed files in pull request 5115 (#5118) · f6731db6
  github-actions[bot] authored Nov 29, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
  f6731db6
- fix typo change JOSNL TO JSONL etc. (#5116) · 9110406a
  digger yu authored Nov 29, 2023
  
  9110406a
28 Nov, 2023 1 commit

[FEATURE] Add Safety Eval Datasets to ColossalEval (#5095) · 7b789f4d

Zian(Andy) Zheng authored Nov 27, 2023



* add safetybench and cvalues(responsibility) eval dataset

* Modify code according to review suggestions

---------
Co-authored-by: Orion-Zheng <zhengzian@u.nus.edu>

7b789f4d

27 Nov, 2023 1 commit
- [nfc] fix typo change directoty to directory (#5111) · d5661f0f
  digger yu authored Nov 27, 2023
  
  d5661f0f
23 Nov, 2023 1 commit

[Feature] Add document retrieval QA (#5020) · e53e729d

YeAnbang authored Nov 23, 2023



* add langchain

* add langchain

* Add files via upload

* add langchain

* fix style

* fix style: remove extra space

* add pytest; modified retriever

* add pytest; modified retriever

* add tests to build_on_pr.yml

* fix build_on_pr.yml

* fix build on pr; fix environ vars

* seperate unit tests for colossalqa from build from pr

* fix container setting; fix environ vars

* commented dev code

* add incremental update

* remove stale code

* fix style

* change to sha3 224

* fix retriever; fix style; add unit test for document loader

* fix ci workflow config

* fix ci workflow config

* add set cuda visible device script in ci

* fix doc string

* fix style; update readme; refactored

* add force log info

* change build on pr, ignore colossalqa

* fix docstring, captitalize all initial letters

* fix indexing; fix text-splitter

* remove debug code, update reference

* reset previous commit

* update LICENSE update README add key-value mode, fix bugs

* add files back

* revert force push

* remove junk file

* add test files

* fix retriever bug, add intent classification

* change conversation chain design

* rewrite prompt and conversation chain

* add ui v1

* ui v1

* fix atavar

* add header

* Refactor the RAG Code and support Pangu

* Refactor the ColossalQA chain to Object-Oriented Programming and the UI demo.

* resolved conversation. tested scripts under examples. web demo still buggy

* fix ci tests

* Some modifications to add ChatGPT api

* modify llm.py and remove unnecessary files

* Delete applications/ColossalQA/examples/ui/test_frontend_input.json

* Remove OpenAI api key

* add colossalqa

* move files

* move files

* move files

* move files

* fix style

* Add Readme and fix some bugs.

* Add something to readme and modify some code

* modify a directory name for clarity

* remove redundant directory

* Correct a type in  llm.py

* fix AI prefix

* fix test_memory.py

* fix conversation

* fix some erros and typos

* Fix a missing import in RAG_ChatBot.py

* add colossalcloud LLM wrapper, correct issues in code review

---------
Co-authored-by: YeAnbang <anbangy2@outlook.com>
Co-authored-by: Orion-Zheng <zheng_zian@u.nus.edu>
Co-authored-by: Zian(Andy) Zheng <62330719+Orion-Zheng@users.noreply.github.com>
Co-authored-by: Orion-Zheng <zhengzian@u.nus.edu>

e53e729d

14 Nov, 2023 1 commit
- fix wrong EOS token in ColossalChat · 43ad0d9e
  Orion-Zheng authored Nov 14, 2023
  
  43ad0d9e
09 Nov, 2023 1 commit
- Support mtbench (#5025) · 239cd92e
  Yuanchen authored Nov 09, 2023
```
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
```
  239cd92e
31 Oct, 2023 1 commit
- fix ColossalEval (#4992) · abe071b6
  Yuanchen authored Oct 31, 2023
```
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
```
  abe071b6
17 Oct, 2023 1 commit
- [format] applied code formatting on changed files in pull request 4908 (#4918) · a41cf88e
  github-actions[bot] authored Oct 17, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
  a41cf88e
16 Oct, 2023 1 commit

Update flash_attention_patch.py · 7768afba

Zian(Andy) Zheng authored Oct 13, 2023

To be compatible with the new change in the Transformers library, where a new argument 'padding_mask' was added to forward function of attention layer.
https://github.com/huggingface/transformers/pull/25598

7768afba

10 Oct, 2023 3 commits
- Update README.md · 652adc22
  Camille Zhong authored Oct 10, 2023
  
  652adc22
- Update README.md · afe10a85
  Camille Zhong authored Oct 10, 2023
  
  afe10a85
- Update modelscope link in README.md · 3043d5d6
  Camille Zhong authored Oct 10, 2023
```
add modelscope link
```
  3043d5d6
28 Sep, 2023 1 commit
- update Colossal (#4832) · ed06731e
  Tong Li authored Sep 28, 2023
  
  ed06731e
27 Sep, 2023 3 commits

[doc] update slack link (#4823) · 822051d8
binmakeswell authored Sep 27, 2023

822051d8
Update Qwen-7B results (#4821) · 1fa8c5e0
Yuanchen authored Sep 27, 2023
```
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
```
1fa8c5e0

[chat] fix gemini strategy (#4698) · be400a09

flybird11111 authored Sep 27, 2023

* [chat] fix gemini strategy

* [chat] fix gemini strategy

* [chat] fix gemini strategy

* [chat] fix gemini strategy

* g# This is a combination of 2 commits.

[chat] fix gemini strategy

fox

* [chat] fix gemini strategy

update llama2 example

[chat] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* fix

* fix

* fix

* fix

* fix

* Update train_prompts.py

be400a09

26 Sep, 2023 3 commits
- [hotfix] change llama2 Colossal-LLaMA-2 script filename (#4800) · b6cf0aca
  Chandler-Bing authored Sep 26, 2023
```
change filename:
pretraining.py -> trainin.py
there is no file named pretraing.py. wrong writing
```
  b6cf0aca
- update · 8cbce618
  Tong Li authored Sep 26, 2023
  
  8cbce618
- update readme · bd014673
  Tong Li authored Sep 26, 2023
  
  bd014673
25 Sep, 2023 1 commit
- [doc] add llama2 domain-specific solution news (#4789) · d512a4d3
  binmakeswell authored Sep 25, 2023
```
* [doc] add llama2 domain-specific solution news
```
  d512a4d3
24 Sep, 2023 2 commits
- [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) · ce777853
  Yuanchen authored Sep 24, 2023
```
* Add ColossalEval

* Delete evaluate in Chat

---------
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
Co-authored-by: Tong Li <tong.li352711588@gmail.com>
```
  ce777853
- initial commit: add colossal llama 2 (#4784) · 74aa7d96
  Tong Li authored Sep 24, 2023
  
  74aa7d96
21 Sep, 2023 1 commit
- [chat]: add lora merge weights config (#4766) · 901ab1ee
  Wenhao Chen authored Sep 21, 2023
```
* feat: modify lora merge weights fn

* feat: add lora merge weights config
```
  901ab1ee
20 Sep, 2023 1 commit

[chat]: update rm, add wandb and fix bugs (#4471) · 7b9b8644

Wenhao Chen authored Sep 20, 2023



* feat: modify forward fn of critic and reward model

* feat: modify calc_action_log_probs

* to: add wandb in sft and rm trainer

* feat: update train_sft

* feat: update train_rm

* style: modify type annotation and add warning

* feat: pass tokenizer to ppo trainer

* to: modify trainer base and maker base

* feat: add wandb in ppo trainer

* feat: pass tokenizer to generate

* test: update generate fn tests

* test: update train tests

* fix: remove action_mask

* feat: remove unused code

* fix: fix wrong ignore_index

* fix: fix mock tokenizer

* chore: update requirements

* revert: modify make_experience

* fix: fix inference

* fix: add padding side

* style: modify _on_learn_batch_end

* test: use mock tokenizer

* fix: use bf16 to avoid overflow

* fix: fix workflow

* [chat] fix gemini strategy

* [chat] fix

* sync: update colossalai strategy

* fix: fix args and model dtype

* fix: fix checkpoint test

* fix: fix requirements

* fix: fix missing import and wrong arg

* fix: temporarily skip gemini test in stage 3

* style: apply pre-commit

* fix: temporarily skip gemini test in stage 1&2

---------
Co-authored-by: Mingyan Jiang <1829166702@qq.com>

7b9b8644

19 Sep, 2023 1 commit

[misc] update pre-commit and run all files (#4752) · 079bf3cb

Hongxin Liu authored Sep 19, 2023

* [misc] update pre-commit

* [misc] run pre-commit

* [misc] remove useless configuration files

* [misc] ignore cuda for clang-format

079bf3cb

15 Sep, 2023 1 commit
- Optimized some syntax errors in the documentation and code under applications/ (#4127) · e4fc57c3
  digger yu authored Sep 15, 2023
```
Co-authored-by: flybird11111 <1829166702@qq.com>
```
  e4fc57c3
30 Aug, 2023 1 commit
- fix colossalai version in coati examples · c648dc09
  Ying Liu authored Aug 30, 2023
  
  c648dc09
29 Aug, 2023 1 commit

[coati] add chatglm model (#4539) · 1467e3b4

yingliu-hpc authored Aug 29, 2023

* update configuration of chatglm and add support in coati

* add unit test & update chatglm default config & fix bos index issue

* remove chatglm due to oom

* add dataset pkg in requirement-text

* fix parameter issue in test_models

* add ref in tokenize & rm unnessary parts

* separate source & target tokenization in chatglm

* add unit test to chatglm

* fix test dataset issue

* update truncation of chatglm

* fix Colossalai version

* fix colossal ai version in test

1467e3b4

21 Aug, 2023 1 commit

[chat] update config and prompt (#4139) · 285fe7ba

Michelle authored Aug 21, 2023



* update config and prompt

* update config

---------
Co-authored-by: Qianran Ma <qianranm@luchentech.com>

285fe7ba

16 Aug, 2023 1 commit

[devops] add large-scale distributed test marker (#4452) · 26e29d58

Hongxin Liu authored Aug 16, 2023

* [test] remove cpu marker

* [test] remove gpu marker

* [test] update pytest markers

* [ci] update unit test ci

26e29d58

14 Aug, 2023 1 commit

[doc] update Coati README (#4405) · 6d41c3f2

Wenhao Chen authored Aug 14, 2023

* style: apply formatter

* fix: add outdated warnings

* docs: add dataset format and polish

* docs: polish README

* fix: fix json format

* fix: fix typos

* revert: revert 7b example

6d41c3f2

02 Aug, 2023 1 commit

[chat] fix bugs and add unit tests (#4213) · da4f7b85

Wenhao Chen authored Aug 02, 2023

* style: rename replay buffer

Experience replay is typically for off policy algorithms.
Use this name in PPO maybe misleading.

* fix: fix wrong zero2 default arg

* test: update experience tests

* style: rename zero_pad fn

* fix: defer init in CycledDataLoader

* test: add benchmark test

* style: rename internal fn of generation

* style: rename internal fn of lora

* fix: remove unused loss fn

* fix: remove unused utils fn

* refactor: remove generate_with_actor fn

* fix: fix type annotation

* test: add models tests

* fix: skip llama due to long execution time

* style: modify dataset

* style: apply formatter

* perf: update reward dataset

* fix: fix wrong IGNORE_INDEX in sft dataset

* fix: remove DataCollatorForSupervisedDataset

* test: add dataset tests

* style: apply formatter

* style: rename test_ci to test_train

* feat: add llama in inference

* test: add inference tests

* test: change test scripts directory

* fix: update ci

* fix: fix typo

* fix: skip llama due to oom

* fix: fix file mod

* style: apply formatter

* refactor: remove duplicated llama_gptq

* style: apply formatter

* to: update rm test

* feat: add tokenizer arg

* feat: add download model script

* test: update train tests

* fix: modify gemini load and save pretrained

* test: update checkpoint io test

* to: modify nproc_per_node

* fix: do not remove existing dir

* fix: modify save path

* test: add random choice

* fix: fix sft path

* fix: enlarge nproc_per_node to avoid oom

* fix: add num_retry

* fix: make lora config of rm and critic consistent

* fix: add warning about lora weights

* fix: skip some gpt2 tests

* fix: remove grad ckpt in rm and critic due to errors

* refactor: directly use Actor in train_sft

* test: add more arguments

* fix: disable grad ckpt when using lora

* fix: fix save_pretrained and related tests

* test: enable zero2 tests

* revert: remove useless fn

* style: polish code

* test: modify test args

da4f7b85

01 Aug, 2023 1 commit
- [chat] fix compute_approx_kl (#4338) · 75c53890
  Wenhao Chen authored Aug 01, 2023
  
  75c53890
28 Jul, 2023 1 commit
- support session-based training (#4313) · 5187c96b
  Yuanchen authored Jul 28, 2023
```
Co-authored-by: Yuanchen Xu <yuanchen.xu00@gmail.com>
```
  5187c96b