Commits · af952673f758c71126b27de8b32bdf5df8f74b69 · OpenDAS / ColossalAI

20 Dec, 2023 1 commit
- polish readme in application/chat (#5194) · af952673
  BlueRum authored Dec 20, 2023
  
  af952673
15 Dec, 2023 2 commits
- [doc] update pytorch version in documents. (#5177) · 681d9b12
  flybird11111 authored Dec 15, 2023
```
* fix

aaa

fix

fix

fix

* fix

* fix

* test ci

* fix ci

fix

* update pytorch version in documents
```
  681d9b12
- Fix ColossalEval (#5186) · 3ff60d13
  Yuanchen authored Dec 15, 2023
```
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
```
  3ff60d13
12 Dec, 2023 2 commits

[shardformer] llama support DistCrossEntropy (#5176) · 79718fae

flybird11111 authored Dec 13, 2023



* fix

aaa

fix

fix

fix

* fix

* fix

* test ci

* fix ci

fix

* llama support dist-cross

fix

fix

fix

fix

fix

fix

fix

fix

* fix

* fix

* fix

fix

* test ci

* test ci

* fix

* [Colossal-Llama-2] Add finetuning Colossal-Llama-2 example (#4878)

* Add finetuning Colossal-Llama-2 example

* Add finetuning Colossal-Llama-2 example 2

* Add finetuning Colossal-Llama-2 example and support NEFTuning

* Add inference example and refine neftune

* Modify readme file

* update the imports

---------
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
Co-authored-by: Camille Zhong <44392324+Camille7777@users.noreply.github.com>

* llama support dist-cross

fix

fix

fix

fix

fix

fix

fix

fix

* fix

* fix

* fix

fix

* test ci

* test ci

* fix

* fix ci

* fix ci

---------
Co-authored-by: Yuanchen <70520919+chengeharrison@users.noreply.github.com>
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
Co-authored-by: Camille Zhong <44392324+Camille7777@users.noreply.github.com>

79718fae

[ColossalEval] Support GSM, Data Leakage Evaluation and Tensor Parallel (#5169) · cefdc326

Yuanchen authored Dec 12, 2023



* Support GSM, Data Leakage Evaluation and Tensor Parallel

* remove redundant code and update inference.py in examples/gpt_evaluation

---------
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>

cefdc326

11 Dec, 2023 1 commit
- [colossalqa] fix pangu api (#5170) · b07a6f4e
  Michelle authored Dec 11, 2023
```
* fix pangu api

* add comment
```
  b07a6f4e
08 Dec, 2023 1 commit
- [gemini] hotfix NaN loss while using Gemini + tensor_parallel (#5150) · 21aa5de0
  flybird11111 authored Dec 08, 2023
```
* fix

aaa

fix

fix

fix

* fix

* fix

* test ci

* fix ci

fix
```
  21aa5de0
07 Dec, 2023 1 commit

[Colossal-Llama-2] Add finetuning Colossal-Llama-2 example (#4878) · b3971044

Yuanchen authored Dec 07, 2023



* Add finetuning Colossal-Llama-2 example

* Add finetuning Colossal-Llama-2 example 2

* Add finetuning Colossal-Llama-2 example and support NEFTuning

* Add inference example and refine neftune

* Modify readme file

* update the imports

---------
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
Co-authored-by: Camille Zhong <44392324+Camille7777@users.noreply.github.com>

b3971044

05 Dec, 2023 1 commit
- fix (#5158) · 3dbbf83f
  flybird11111 authored Dec 05, 2023
```
fix
```
  3dbbf83f
01 Dec, 2023 1 commit
- [doc] fix colossalqa document (#5146) · 368b5e3d
  Michelle authored Dec 01, 2023
```
* fix doc

* modify doc
```
  368b5e3d
30 Nov, 2023 2 commits

[ColossalQA] refactor server and webui & add new feature (#5138) · c7fd9a52
Michelle authored Nov 30, 2023
```
* refactor server and webui & add new feature

* add requirements

* modify readme and ui
```
c7fd9a52

[plugin]fix 3d checkpoint load when booster boost without optimizer. (#5135) · 2a2ec49a

flybird11111 authored Nov 30, 2023

* fix 3d checkpoint load when booster boost without optimizer

fix 3d checkpoint load when booster boost without optimizer

* test ci

* revert ci

* fix

fix

2a2ec49a

29 Nov, 2023 5 commits
- [format] applied code formatting on changed files in pull request 5115 (#5118) · f6731db6
  github-actions[bot] authored Nov 29, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
  f6731db6
- [format] applied code formatting on changed files in pull request 5124 (#5125) · 9b36640f
  github-actions[bot] authored Nov 29, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
  9b36640f
- [format] applied code formatting on changed files in pull request 5088 (#5127) · d10ee42f
  github-actions[bot] authored Nov 29, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
  d10ee42f
- fix typo change JOSNL TO JSONL etc. (#5116) · 9110406a
  digger yu authored Nov 29, 2023
  
  9110406a
- [doc] updated paper citation (#5131) · 2899cfda
  Frank Lee authored Nov 29, 2023
  
  2899cfda
28 Nov, 2023 4 commits

[doc] add moe news (#5128) · 177c79f2
binmakeswell authored Nov 28, 2023
```
* [doc] add moe news

* [doc] add moe news

* [doc] add moe news
```
177c79f2

[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) · 7172459e

Wenhao Chen authored Nov 28, 2023



* [shardformer] implement policy for all GPT-J models and test

* [shardformer] support interleaved pipeline parallel for bert finetune

* [shardformer] shardformer support falcon (#4883)

* [shardformer]: fix interleaved pipeline for bert model (#5048)

* [hotfix]: disable seq parallel for gptj and falcon, and polish code (#5093)

* Add Mistral support for Shardformer (#5103)

* [shardformer] add tests to mistral (#5105)

---------
Co-authored-by: Pengtai Xu <henryxu880@gmail.com>
Co-authored-by: ppt0011 <143150326+ppt0011@users.noreply.github.com>
Co-authored-by: flybird11111 <1829166702@qq.com>
Co-authored-by: eric8607242 <e0928021388@gmail.com>

7172459e

[hotfix] fixed memory usage of shardformer module replacement (#5122) · 126cf180
アマデウス authored Nov 28, 2023

126cf180

[FEATURE] Add Safety Eval Datasets to ColossalEval (#5095) · 7b789f4d

Zian(Andy) Zheng authored Nov 27, 2023



* add safetybench and cvalues(responsibility) eval dataset

* Modify code according to review suggestions

---------
Co-authored-by: Orion-Zheng <zhengzian@u.nus.edu>

7b789f4d

27 Nov, 2023 1 commit
- [nfc] fix typo change directoty to directory (#5111) · d5661f0f
  digger yu authored Nov 27, 2023
  
  d5661f0f
24 Nov, 2023 1 commit
- fix typo change lazy_iniy to lazy_init (#5099) · 2bdf76f1
  digger yu authored Nov 24, 2023
  
  2bdf76f1
23 Nov, 2023 2 commits

remove duplicate import (#5100) · 68fcaa22
Xuanlei Zhao authored Nov 23, 2023

68fcaa22

[Feature] Add document retrieval QA (#5020) · e53e729d

YeAnbang authored Nov 23, 2023



* add langchain

* add langchain

* Add files via upload

* add langchain

* fix style

* fix style: remove extra space

* add pytest; modified retriever

* add pytest; modified retriever

* add tests to build_on_pr.yml

* fix build_on_pr.yml

* fix build on pr; fix environ vars

* seperate unit tests for colossalqa from build from pr

* fix container setting; fix environ vars

* commented dev code

* add incremental update

* remove stale code

* fix style

* change to sha3 224

* fix retriever; fix style; add unit test for document loader

* fix ci workflow config

* fix ci workflow config

* add set cuda visible device script in ci

* fix doc string

* fix style; update readme; refactored

* add force log info

* change build on pr, ignore colossalqa

* fix docstring, captitalize all initial letters

* fix indexing; fix text-splitter

* remove debug code, update reference

* reset previous commit

* update LICENSE update README add key-value mode, fix bugs

* add files back

* revert force push

* remove junk file

* add test files

* fix retriever bug, add intent classification

* change conversation chain design

* rewrite prompt and conversation chain

* add ui v1

* ui v1

* fix atavar

* add header

* Refactor the RAG Code and support Pangu

* Refactor the ColossalQA chain to Object-Oriented Programming and the UI demo.

* resolved conversation. tested scripts under examples. web demo still buggy

* fix ci tests

* Some modifications to add ChatGPT api

* modify llm.py and remove unnecessary files

* Delete applications/ColossalQA/examples/ui/test_frontend_input.json

* Remove OpenAI api key

* add colossalqa

* move files

* move files

* move files

* move files

* fix style

* Add Readme and fix some bugs.

* Add something to readme and modify some code

* modify a directory name for clarity

* remove redundant directory

* Correct a type in  llm.py

* fix AI prefix

* fix test_memory.py

* fix conversation

* fix some erros and typos

* Fix a missing import in RAG_ChatBot.py

* add colossalcloud LLM wrapper, correct issues in code review

---------
Co-authored-by: YeAnbang <anbangy2@outlook.com>
Co-authored-by: Orion-Zheng <zheng_zian@u.nus.edu>
Co-authored-by: Zian(Andy) Zheng <62330719+Orion-Zheng@users.noreply.github.com>
Co-authored-by: Orion-Zheng <zhengzian@u.nus.edu>

e53e729d

22 Nov, 2023 5 commits
- [npu] add npu support for hybrid plugin and llama (#5090) · 3acbf6d4
  Xuanlei Zhao authored Nov 22, 2023
```
* llama 3d

* update

* fix autocast
```
  3acbf6d4
- [shardformer]fix flash attention, when mask is casual, just don't unpad it (#5084) · aae49663
  flybird11111 authored Nov 22, 2023
```
* fix flash attn

* fix

fix
```
  aae49663
- [Hotfix] Fix model policy matching strategy in ShardFormer (#5064) · 75af66cd
  Zhongkai Zhao authored Nov 22, 2023
```
* hotfix/Fix get model policy strategy in ShardFormer

* fix bug in auto policy
```
  75af66cd
- [gemini]fix gemini optimzer, saving Shardformer in Gemini got list assignment... · 4ccb9ded
  flybird11111 authored Nov 22, 2023
```
[gemini]fix gemini optimzer, saving Shardformer in Gemini got list assignment index out of range (#5085)
```
  4ccb9ded
- [nfc] fix typo and author name (#5089) · 0d482302
  digger yu authored Nov 22, 2023
  
  0d482302
21 Nov, 2023 3 commits
- [nfc] fix typo in docs/ (#4972) · fd3567e0
  digger yu authored Nov 21, 2023
  
  fd3567e0
- fix thrust-transform-reduce error (#5078) · dce05da5
  Jun Gao authored Nov 21, 2023
  
  dce05da5
- [inference] refactor examples and fix schedule (#5077) · 1cd7efc5
  Hongxin Liu authored Nov 21, 2023
```
* [setup] refactor infer setup

* [hotfix] fix infenrece behavior on 1 1 gpu

* [exmaple] refactor inference examples
```
  1cd7efc5
20 Nov, 2023 7 commits

[hotfix/hybridengine] Fix init model with random parameters in benchmark (#5074) · 4e3959d3
Bin Jia authored Nov 20, 2023
```
* fix init model with random parameters

* fix example
```
4e3959d3
[format] applied code formatting on changed files in pull request 5067 (#5072) · 8921a73c
github-actions[bot] authored Nov 20, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
8921a73c
[inference] update examples and engine (#5073) · fb103cfd
Xu Kai authored Nov 20, 2023
```
* update examples and engine

* fix choices

* update example
```
fb103cfd
[hotfix/hybridengine] fix bug when tp*pp size = 1 (#5069) · 0c7d8beb
Bin Jia authored Nov 20, 2023

0c7d8beb

[npu] add npu support for gemini and zero (#5067) · e5ce4c8e

Hongxin Liu authored Nov 20, 2023

* [npu] setup device utils (#5047)

* [npu] add npu device support

* [npu] support low level zero

* [test] update npu zero plugin test

* [hotfix] fix import

* [test] recover tests

* [npu] gemini support npu (#5052)

* [npu] refactor device utils

* [gemini] support npu

* [example] llama2+gemini support npu

* [kernel] add arm cpu adam kernel (#5065)

* [kernel] add arm cpu adam

* [optim] update adam optimizer

* [kernel] arm cpu adam remove bf16 support

e5ce4c8e

[misc] remove outdated submodule (#5070) · 8d56c9c3
Hongxin Liu authored Nov 20, 2023

8d56c9c3

[Kernels]added flash-decoidng of triton (#5063) · bce91970

Cuiqing Li (李崔卿) authored Nov 20, 2023



* added flash-decoidng of triton based on lightllm kernel

* add req

* clean

* clean

* delete build.sh

---------
Co-authored-by: cuiqing.li <lixx336@gmail.com>

bce91970