Commits · b8a711aa2df86450c980f4b647199df2375dce33 · OpenDAS / ColossalAI

26 Apr, 2024 1 commit
- [news] llama3 and open-sora v1.1 (#5655) · b8a711aa
  binmakeswell authored Apr 26, 2024
```
* [news] llama3 and open-sora v1.1

* [news] llama3 and open-sora v1.1
```
  b8a711aa
25 Apr, 2024 1 commit

[shardformer] fix chatglm implementation (#5644) · bbb2c21f

Hongxin Liu authored Apr 25, 2024

* [shardformer] fix chatglm policy

* [shardformer] fix chatglm flash attn

* [shardformer] update readme

* [shardformer] fix chatglm init

* [shardformer] fix chatglm test

* [pipeline] fix chatglm merge batch

bbb2c21f

23 Apr, 2024 1 commit

[example] llama3 (#5631) · f4c5aafe

binmakeswell authored Apr 23, 2024

* release llama3

* [release] llama3

* [release] llama3

* [release] llama3

* [release] llama3

f4c5aafe

08 Apr, 2024 1 commit

[devops] remove post commit ci (#5566) · 641b1ee7

Hongxin Liu authored Apr 08, 2024

* [devops] remove post commit ci

* [misc] run pre-commit on all files

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci



---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

641b1ee7

25 Mar, 2024 2 commits

[release] grok-1 inference benchmark (#5500) · 34e90925

binmakeswell authored Mar 25, 2024

* [release] grok-1 inference benchmark

* [release] grok-1 inference benchmark

* [release] grok-1 inference benchmark

* [release] grok-1 inference benchmark

* [release] grok-1 inference benchmark

34e90925

[hotfix] set return_outputs=False in examples and polish code (#5404) · bb0a668f

Wenhao Chen authored Mar 25, 2024

* fix: simplify merge_batch

* fix: use return_outputs=False to eliminate extra memory consumption

* feat: add return_outputs warning

* style: remove `return_outputs=False` as it is the default value

bb0a668f

22 Mar, 2024 1 commit
- [release] grok-1 314b inference (#5490) · 6df844b8
  binmakeswell authored Mar 22, 2024
```
* [release] grok-1 inference

* [release] grok-1 inference

* [release] grok-1 inference
```
  6df844b8
20 Mar, 2024 1 commit

[doc] update open-sora demo (#5479) · d158fc0e

binmakeswell authored Mar 20, 2024

* [doc] update open-sora demo

* [doc] update open-sora demo

* [doc] update open-sora demo

d158fc0e

18 Mar, 2024 1 commit

[doc] release Open-Sora 1.0 with model weights (#5468) · bd998ced

binmakeswell authored Mar 18, 2024

* [doc] release Open-Sora 1.0 with model weights

* [doc] release Open-Sora 1.0 with model weights

* [doc] release Open-Sora 1.0 with model weights

bd998ced

05 Mar, 2024 3 commits
- [doc] update some translations with README-zh-Hans.md (#5382) · 70cce5cb
  digger yu authored Mar 05, 2024
  
  70cce5cb
- [devops] fix extention building (#5427) · 070df689
  Hongxin Liu authored Mar 05, 2024
  
  070df689
- [doc] sora release (#5425) · 822241a9
  binmakeswell authored Mar 05, 2024
```
* [doc] sora release

* [doc] sora release

* [doc] sora release

* [doc] sora release
```
  822241a9
29 Feb, 2024 1 commit
- [doc] fix blog link · a1c6cdb1
  binmakeswell authored Feb 29, 2024
  
  a1c6cdb1
19 Feb, 2024 2 commits
- [doc] updated installation command (#5389) · 705a62a5
  Frank Lee authored Feb 19, 2024
  
  705a62a5
- [doc] Fix typo (#5361) · 69e3ad01
  yixiaoer authored Feb 19, 2024
  
  69e3ad01
25 Jan, 2024 1 commit
- fix some typo (#5307) · bce9499e
  digger yu authored Jan 25, 2024
  
  bce9499e
09 Jan, 2024 1 commit

[npu] change device to accelerator api (#5239) · d202cc28

Hongxin Liu authored Jan 09, 2024



* update accelerator

* fix timer

* fix amp

* update

* fix

* update bug

* add error raise

* fix autocast

* fix set device

* remove doc accelerator

* update doc

* update doc

* update doc

* use nullcontext

* update cpu

* update null context

* change time limit for example

* udpate

* update

* update

* update

* [npu] polish accelerator code

---------
Co-authored-by: Xuanlei Zhao <xuanlei.zhao@gmail.com>
Co-authored-by: zxl <43881818+oahzxl@users.noreply.github.com>

d202cc28

08 Jan, 2024 1 commit

[doc] SwiftInfer release (#5236) · 7bc6969c

binmakeswell authored Jan 08, 2024

* [doc] SwiftInfer release

* [doc] SwiftInfer release

* [doc] SwiftInfer release

* [doc] SwiftInfer release

* [doc] SwiftInfer release

7bc6969c

07 Jan, 2024 1 commit

[doc] add Colossal-LLaMA-2-13B (#5234) · b9b32b15

binmakeswell authored Jan 07, 2024

* [doc] add Colossal-LLaMA-2-13B

* [doc] add Colossal-LLaMA-2-13B

* [doc] add Colossal-LLaMA-2-13B

b9b32b15

15 Dec, 2023 1 commit

[doc] update pytorch version in documents. (#5177) · 681d9b12

flybird11111 authored Dec 15, 2023

* fix

aaa

fix

fix

fix

* fix

* fix

* test ci

* fix ci

fix

* update pytorch version in documents

681d9b12

28 Nov, 2023 2 commits

[doc] add moe news (#5128) · 177c79f2
binmakeswell authored Nov 28, 2023
```
* [doc] add moe news

* [doc] add moe news

* [doc] add moe news
```
177c79f2

[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) · 7172459e

Wenhao Chen authored Nov 28, 2023



* [shardformer] implement policy for all GPT-J models and test

* [shardformer] support interleaved pipeline parallel for bert finetune

* [shardformer] shardformer support falcon (#4883)

* [shardformer]: fix interleaved pipeline for bert model (#5048)

* [hotfix]: disable seq parallel for gptj and falcon, and polish code (#5093)

* Add Mistral support for Shardformer (#5103)

* [shardformer] add tests to mistral (#5105)

---------
Co-authored-by: Pengtai Xu <henryxu880@gmail.com>
Co-authored-by: ppt0011 <143150326+ppt0011@users.noreply.github.com>
Co-authored-by: flybird11111 <1829166702@qq.com>
Co-authored-by: eric8607242 <e0928021388@gmail.com>

7172459e

27 Nov, 2023 1 commit
- [nfc] fix typo change directoty to directory (#5111) · d5661f0f
  digger yu authored Nov 27, 2023
  
  d5661f0f
24 Nov, 2023 1 commit
- fix typo change lazy_iniy to lazy_init (#5099) · 2bdf76f1
  digger yu authored Nov 24, 2023
  
  2bdf76f1
22 Nov, 2023 1 commit
- [nfc] fix typo and author name (#5089) · 0d482302
  digger yu authored Nov 22, 2023
  
  0d482302
21 Nov, 2023 1 commit
- [nfc] fix typo in docs/ (#4972) · fd3567e0
  digger yu authored Nov 21, 2023
  
  fd3567e0
31 Oct, 2023 1 commit
- [doc] add supported feature diagram for hybrid parallel plugin (#4996) · 335cb105
  ppt0011 authored Oct 31, 2023
  
  335cb105
18 Oct, 2023 1 commit
- [nfc] fix some typo with colossalai/ docs/ etc. (#4920) · 11009103
  digger yu authored Oct 18, 2023
  
  11009103
17 Oct, 2023 1 commit

[gemini] support gradient accumulation (#4869) · 21ba89ca

Baizhou Zhang authored Oct 17, 2023

* add test

* fix no_sync bug in low level zero plugin

* fix test

* add argument for grad accum

* add grad accum in backward hook for gemini

* finish implementation, rewrite tests

* fix test

* skip stuck model in low level zero test

* update doc

* optimize communication & fix gradient checkpoint

* modify doc

* cleaning codes

* update cpu adam fp16 case

21ba89ca

10 Oct, 2023 1 commit

[doc] update advanced tutorials, training gpt with hybrid parallelism (#4866) · 6a21f96a

flybird11111 authored Oct 10, 2023

* [doc]update advanced tutorials, training gpt with hybrid parallelism

* [doc]update advanced tutorials, training gpt with hybrid parallelism

* update vit tutorials

* update vit tutorials

* update vit tutorials

* update vit tutorials

* update en/train_vit_with_hybrid_parallel.py

* fix

* resolve comments

* fix

6a21f96a

05 Oct, 2023 1 commit
- [test] modify model supporting part of low_level_zero plugin (including correspoding docs) · db40e086
  Zhongkai Zhao authored Oct 05, 2023
  
  db40e086
27 Sep, 2023 2 commits
- [doc] update slack link (#4823) · 822051d8
  binmakeswell authored Sep 27, 2023
  
  822051d8
- [doc] add lazy init docs (#4808) · da15fdb9
  Hongxin Liu authored Sep 27, 2023
  
  da15fdb9
26 Sep, 2023 2 commits
- [checkpointio] support unsharded checkpointIO for hybrid parallel (#4774) · 64a08b2d
  Baizhou Zhang authored Sep 26, 2023
```
* support unsharded saving/loading for model

* support optimizer unsharded saving

* update doc

* support unsharded loading for optimizer

* small fix
```
  64a08b2d
- [doc] polish shardformer doc (#4779) · a2db7554
  Baizhou Zhang authored Sep 26, 2023
```
* fix example format in docstring

* polish shardformer doc
```
  a2db7554
25 Sep, 2023 1 commit
- [doc] add llama2 domain-specific solution news (#4789) · d512a4d3
  binmakeswell authored Sep 25, 2023
```
* [doc] add llama2 domain-specific solution news
```
  d512a4d3
21 Sep, 2023 2 commits
- [doc] add shardformer doc to sidebar (#4768) · 493a5efe
  Baizhou Zhang authored Sep 21, 2023
  
  493a5efe
- [doc] clean up outdated docs (#4765) · 66f39260
  Hongxin Liu authored Sep 21, 2023
```
* [doc] clean up outdated docs

* [doc] fix linking

* [doc] fix linking
```
  66f39260
20 Sep, 2023 1 commit
- [doc] put native colossalai plugins first in description section · 4d7537ba
  Pengtai Xu authored Sep 20, 2023
  
  4d7537ba
19 Sep, 2023 1 commit
- [doc] add model examples for each plugin · e10d9f08
  Pengtai Xu authored Sep 19, 2023
  
  e10d9f08