Commits · 6a21f96a87948971e7c2e96f2cf2e563304e0c7a · OpenDAS / ColossalAI

10 Oct, 2023 1 commit

[doc] update advanced tutorials, training gpt with hybrid parallelism (#4866) · 6a21f96a

flybird11111 authored Oct 10, 2023

* [doc]update advanced tutorials, training gpt with hybrid parallelism

* [doc]update advanced tutorials, training gpt with hybrid parallelism

* update vit tutorials

* update vit tutorials

* update vit tutorials

* update vit tutorials

* update en/train_vit_with_hybrid_parallel.py

* fix

* resolve comments

* fix

6a21f96a

07 Oct, 2023 5 commits
- [nfc] fix minor typo in README (#4846) · 8aed02b9
  Blagoy Simandoff authored Oct 07, 2023
  
  8aed02b9
- [NFC] polish code style (#4799) · cd6a962e
  Camille Zhong authored Sep 27, 2023
  
  cd6a962e
- [NFC] polish colossalai/inference/quant/gptq/cai_gptq/__init__.py code style (#4792) · 07ed155e
  Michelle authored Sep 27, 2023
  
  07ed155e
- polish code for gptq (#4793) · eef96e08
  littsk authored Sep 25, 2023
  
  eef96e08
- [checkpointio] hotfix torch 2.0 compatibility (#4824) · cb3a25a0
  Hongxin Liu authored Oct 07, 2023
  
  cb3a25a0
06 Oct, 2023 2 commits
- Merge pull request #4856 from KKZ20/test/model_support_for_low_level_zero · ad23460c
  ppt0011 authored Oct 06, 2023
```
[test] remove the redundant code of model output transformation in torchrec
```
  ad23460c
- Merge pull request #4858 from Shawlleyw/main · 81ee91f2
  ppt0011 authored Oct 06, 2023
```
[doc]: typo in document of booster low_level_zero plugin
```
  81ee91f2
05 Oct, 2023 2 commits
- fix: typo in comment of low_level_zero plugin · c97a3523
  shaoyuw authored Oct 05, 2023
  
  c97a3523
- [test] modify model supporting part of low_level_zero plugin (including correspoding docs) · db40e086
  Zhongkai Zhao authored Oct 05, 2023
  
  db40e086
04 Oct, 2023 2 commits
- [infer] fix test bug (#4838) · d1fcc0fa
  Xu Kai authored Oct 04, 2023
```
* fix test bug

* delete useless code

* fix typo
```
  d1fcc0fa
- [inference]fix import bug and delete down useless init (#4830) · 013a4bed
  Jianghai authored Oct 04, 2023
```
* fix import bug and release useless init

* fix

* fix

* fix
```
  013a4bed
02 Oct, 2023 2 commits

[Infer] Serving example w/ ray-serve (multiple GPU case) (#4841) · 573f2705

Yuanheng Zhao authored Oct 02, 2023

* fix imports

* add ray-serve with Colossal-Infer tp

* trivial: send requests script

* add README

* fix worker port

* fix readme

* use app builder and autoscaling

* trivial: input args

* clean code; revise readme

* testci (skip example test)

* use auto model/tokenizer

* revert imports fix (fixed in other PRs)

573f2705

[Infer] Colossal-Inference serving example w/ TorchServe (single GPU case) (#4771) · 3a74eb4b

Yuanheng Zhao authored Oct 02, 2023

* add Colossal-Inference serving example w/ TorchServe

* add dockerfile

* fix dockerfile

* fix dockerfile: fix commit hash, install curl

* refactor file structure

* revise readme

* trivial

* trivial: dockerfile format

* clean dir; revise readme

* fix comments: fix imports and configs

* fix formats

* remove unused requirements

3a74eb4b

28 Sep, 2023 2 commits
- update Colossal (#4832) · ed06731e
  Tong Li authored Sep 28, 2023
  
  ed06731e
- add autotune (#4822) · c3bef204
  Xu Kai authored Sep 28, 2023
  
  c3bef204
27 Sep, 2023 8 commits

[doc] update slack link (#4823) · 822051d8
binmakeswell authored Sep 27, 2023

822051d8
Update Qwen-7B results (#4821) · 1fa8c5e0
Yuanchen authored Sep 27, 2023
```
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
```
1fa8c5e0

[chat] fix gemini strategy (#4698) · be400a09

flybird11111 authored Sep 27, 2023

* [chat] fix gemini strategy

* [chat] fix gemini strategy

* [chat] fix gemini strategy

* [chat] fix gemini strategy

* g# This is a combination of 2 commits.

[chat] fix gemini strategy

fox

* [chat] fix gemini strategy

update llama2 example

[chat] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* [fix] fix gemini strategy

* fix

* fix

* fix

* fix

* fix

* Update train_prompts.py

be400a09

fix format (#4815) · bbbcac26
Tong Li authored Sep 27, 2023

bbbcac26
[format] applied code formatting on changed files in pull request 4595 (#4602) · fb46d05c
github-actions[bot] authored Sep 27, 2023
```
Co-authored-by: github-actions <github-actions@github.com>
```
fb46d05c
[hotfix] Correct several erroneous code comments (#4794) · 11f1e426
littsk authored Sep 27, 2023

11f1e426
[hotfix] fix norm type error in zero optimizer (#4795) · 54b3ad89
littsk authored Sep 27, 2023

54b3ad89
[doc] add lazy init docs (#4808) · da15fdb9
Hongxin Liu authored Sep 27, 2023

da15fdb9

26 Sep, 2023 8 commits
- [misc] add last_epoch in CosineAnnealingWarmupLR (#4778) · a2270633
  Yan haixu authored Sep 26, 2023
  
  a2270633
- [hotfix] change llama2 Colossal-LLaMA-2 script filename (#4800) · b6cf0aca
  Chandler-Bing authored Sep 26, 2023
```
change filename:
pretraining.py -> trainin.py
there is no file named pretraing.py. wrong writing
```
  b6cf0aca
- Merge pull request #4805 from TongLi3701/docs/fix · 62b6af10
  Desperado-Jia authored Sep 26, 2023
```
[doc] Update TODO in README of Colossal-LLaMA-2
```
  62b6af10
- update · 8cbce618
  Tong Li authored Sep 26, 2023
  
  8cbce618
- [lazy] support from_pretrained (#4801) · 4965c0da
  Hongxin Liu authored Sep 26, 2023
```
* [lazy] patch from pretrained

* [lazy] fix from pretrained and add tests

* [devops] update ci
```
  4965c0da
- update readme · bd014673
  Tong Li authored Sep 26, 2023
  
  bd014673
- [checkpointio] support unsharded checkpointIO for hybrid parallel (#4774) · 64a08b2d
  Baizhou Zhang authored Sep 26, 2023
```
* support unsharded saving/loading for model

* support optimizer unsharded saving

* update doc

* support unsharded loading for optimizer

* small fix
```
  64a08b2d
- [doc] polish shardformer doc (#4779) · a2db7554
  Baizhou Zhang authored Sep 26, 2023
```
* fix example format in docstring

* polish shardformer doc
```
  a2db7554
25 Sep, 2023 2 commits
- [fix] fix weekly runing example (#4787) · 26cd6d85
  flybird11111 authored Sep 25, 2023
```
* [fix] fix weekly runing example

* [fix] fix weekly runing example
```
  26cd6d85
- [doc] add llama2 domain-specific solution news (#4789) · d512a4d3
  binmakeswell authored Sep 25, 2023
```
* [doc] add llama2 domain-specific solution news
```
  d512a4d3
24 Sep, 2023 2 commits
- [feature] ColossalEval: Evaluation Pipeline for LLMs (#4786) · ce777853
  Yuanchen authored Sep 24, 2023
```
* Add ColossalEval

* Delete evaluate in Chat

---------
Co-authored-by: Xu Yuanchen <yuanchen.xu00@gmail.com>
Co-authored-by: Tong Li <tong.li352711588@gmail.com>
```
  ce777853
- initial commit: add colossal llama 2 (#4784) · 74aa7d96
  Tong Li authored Sep 24, 2023
  
  74aa7d96
22 Sep, 2023 4 commits

[release] update version (#4775) · 4146f1c0
Hongxin Liu authored Sep 22, 2023
```
* [release] update version

* [doc] revert versions
```
4146f1c0

[inference] chatglm2 infer demo (#4724) · ce7ade38

Jianghai authored Sep 22, 2023

* add chatglm2

* add

* gather needed kernels

* fix some bugs

* finish context forward

* finish context stage

* fix

* add

* pause

* add

* fix bugs

* finish chatglm

* fix bug

* change some logic

* fix bugs

* change some logics

* add

* add

* add

* fix

* fix tests

* fix

ce7ade38

[feature] add gptq for inference (#4754) · 946ab56c

Xu Kai authored Sep 22, 2023

* [gptq] add gptq kernel (#4416)

* add gptq

* refactor code

* fix tests

* replace auto-gptq

* rname inferance/quant

* refactor test

* add auto-gptq as an option

* reset requirements

* change assert and check auto-gptq

* add import warnings

* change test flash attn version

* remove example

* change requirements of flash_attn

* modify tests

* [skip ci] change requirements-test

* [gptq] faster gptq cuda kernel (#4494)

* [skip ci] add cuda kernels

* add license

* [skip ci] fix max_input_len

* format files & change test size

* [skip ci]

* [gptq] add gptq tensor parallel (#4538)

* add gptq tensor parallel

* add gptq tp

* delete print

* add test gptq check

* add test auto gptq check

* [gptq] combine gptq and kv cache manager (#4706)

* combine gptq and kv cache manager

* add init bits

* delete useless code

* add model path

* delete usless print and update test

* delete usless import

* move option gptq to shard config

* change replace linear to shardformer

* update bloom policy

* delete useless code

* fix import bug and delete uselss code

* change colossalai/gptq to colossalai/quant/gptq

* update import linear for tests

* delete useless code and mv gptq_kernel to kernel directory

* fix triton kernel

* add triton import

946ab56c

[bug] Fix the version check bug in colossalai run when generating the cmd. (#4713) · 1e0e0808
littsk authored Sep 22, 2023
```
* Fix the version check bug in colossalai run when generating the cmd.

* polish code
```
1e0e0808