Commits · bbb2c21f16c16c0ab789f046a62f5bd2dfde57c1 · OpenDAS / ColossalAI

25 Apr, 2024 1 commit

[shardformer] fix chatglm implementation (#5644) · bbb2c21f

Hongxin Liu authored Apr 25, 2024

* [shardformer] fix chatglm policy

* [shardformer] fix chatglm flash attn

* [shardformer] update readme

* [shardformer] fix chatglm init

* [shardformer] fix chatglm test

* [pipeline] fix chatglm merge batch

bbb2c21f

25 Mar, 2024 1 commit

[hotfix] set return_outputs=False in examples and polish code (#5404) · bb0a668f

Wenhao Chen authored Mar 25, 2024

* fix: simplify merge_batch

* fix: use return_outputs=False to eliminate extra memory consumption

* feat: add return_outputs warning

* style: remove `return_outputs=False` as it is the default value

bb0a668f

28 Nov, 2023 1 commit

[shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) · 7172459e

Wenhao Chen authored Nov 28, 2023



* [shardformer] implement policy for all GPT-J models and test

* [shardformer] support interleaved pipeline parallel for bert finetune

* [shardformer] shardformer support falcon (#4883)

* [shardformer]: fix interleaved pipeline for bert model (#5048)

* [hotfix]: disable seq parallel for gptj and falcon, and polish code (#5093)

* Add Mistral support for Shardformer (#5103)

* [shardformer] add tests to mistral (#5105)

---------
Co-authored-by: Pengtai Xu <henryxu880@gmail.com>
Co-authored-by: ppt0011 <143150326+ppt0011@users.noreply.github.com>
Co-authored-by: flybird11111 <1829166702@qq.com>
Co-authored-by: eric8607242 <e0928021388@gmail.com>

7172459e

26 Sep, 2023 1 commit
- [doc] polish shardformer doc (#4779) · a2db7554
  Baizhou Zhang authored Sep 26, 2023
```
* fix example format in docstring

* polish shardformer doc
```
  a2db7554
15 Sep, 2023 4 commits

[doc] polish shardformer doc (#4735) · 451c3465
Baizhou Zhang authored Sep 15, 2023
```
* arrange position of chapters

* fix typos in seq parallel doc
```
451c3465
[shardformer] update seq parallel document (#4730) · 6a03c933
Bin Jia authored Sep 15, 2023
```
* update doc of seq parallel

* fix typo
```
6a03c933
[doc] add shardformer support matrix/update tensor parallel documents (#4728) · 50e5602c
Baizhou Zhang authored Sep 15, 2023
```
* add compatibility matrix for shardformer doc

* update tp doc
```
50e5602c

[doc] Add user document for Shardformer (#4702) · f911d5b0

Baizhou Zhang authored Sep 15, 2023

* create shardformer doc files

* add docstring for seq-parallel

* update ShardConfig docstring

* add links to llama example

* add outdated massage

* finish introduction & supporting information

* finish 'how shardformer works'

* finish shardformer.md English doc

* fix doctest fail

* add Chinese document

f911d5b0