Commits · 641b1ee71a19e2337f3363620b228dd355835b04 · OpenDAS / ColossalAI

08 Apr, 2024 1 commit

[devops] remove post commit ci (#5566) · 641b1ee7

Hongxin Liu authored Apr 08, 2024

* [devops] remove post commit ci

* [misc] run pre-commit on all files

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci



---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

641b1ee7

04 Mar, 2024 1 commit

[example]add gpt2 benchmark example script. (#5295) · 29695cf7

flybird11111 authored Mar 04, 2024



* benchmark gpt2

* fix

fix

fix

fix

* [doc] fix typo in Colossal-LLaMA-2/README.md (#5247)

* [workflow] fixed build CI (#5240)

* [workflow] fixed build CI

* polish

* polish

* polish

* polish

* polish

* [ci] fixed booster test (#5251)

* [ci] fixed booster test

* [ci] fixed booster test

* [ci] fixed booster test

* [ci] fixed ddp test (#5254)

* [ci] fixed ddp test

* polish

* fix typo in  applications/ColossalEval/README.md (#5250)

* [ci] fix shardformer tests. (#5255)

* fix ci

fix

* revert: revert p2p

* feat: add enable_metadata_cache option

* revert: enable t5 tests

---------
Co-authored-by: Wenhao Chen <cwher@outlook.com>

* [doc] fix doc typo (#5256)

* [doc] fix annotation display

* [doc] fix llama2 doc

* [hotfix]: add pp sanity check and fix mbs arg (#5268)

* fix: fix misleading mbs arg

* feat: add pp sanity check

* fix: fix 1f1b sanity check

* [workflow] fixed incomplete bash command (#5272)

* [workflow] fixed oom tests (#5275)

* [workflow] fixed oom tests

* polish

* polish

* polish

* [ci] fix test_hybrid_parallel_plugin_checkpoint_io.py (#5276)

* fix ci

fix

* fix test

* revert: revert p2p

* feat: add enable_metadata_cache option

* revert: enable t5 tests

* fix

---------
Co-authored-by: Wenhao Chen <cwher@outlook.com>

* [shardformer] hybridparallelplugin support gradients accumulation. (#5246)

* support gradients acc

fix

fix

fix

fix

fix

fix

fix

fix

fix

fix

fix

fix

fix

* fix

fix

* fix

fix

fix

* [hotfix] Fix ShardFormer test execution path when using sequence parallelism (#5230)

* fix auto loading gpt2 tokenizer (#5279)

* [doc] add llama2-13B disyplay (#5285)

* Update README.md

* fix 13b typo

---------
Co-authored-by: binmakeswell <binmakeswell@gmail.com>

* fix llama pretrain (#5287)

* fix

* fix

* fix

fix

* fix

fix

fix

* fix

fix

* benchmark gpt2

* fix

fix

fix

fix

* [workflow] fixed build CI (#5240)

* [workflow] fixed build CI

* polish

* polish

* polish

* polish

* polish

* [ci] fixed booster test (#5251)

* [ci] fixed booster test

* [ci] fixed booster test

* [ci] fixed booster test

* fix

fix

* fix

fix

fix

* fix

* fix

fix

fix

fix

fix

* fix

* Update shardformer.py

---------
Co-authored-by: digger yu <digger-yu@outlook.com>
Co-authored-by: Frank Lee <somerlee.9@gmail.com>
Co-authored-by: Wenhao Chen <cwher@outlook.com>
Co-authored-by: binmakeswell <binmakeswell@gmail.com>
Co-authored-by: Zhongkai Zhao <kanezz620@gmail.com>
Co-authored-by: Michelle <97082656+MichelleMa8@users.noreply.github.com>
Co-authored-by: Desperado-Jia <502205863@qq.com>

29695cf7

16 Jan, 2024 1 commit
- [workflow] fixed oom tests (#5275) · d69cd2eb
  Frank Lee authored Jan 16, 2024
```
* [workflow] fixed oom tests

* polish

* polish

* polish
```
  d69cd2eb
10 Jan, 2024 1 commit
- [workflow] fixed build CI (#5240) · edf94a35
  Frank Lee authored Jan 10, 2024
```
* [workflow] fixed build CI

* polish

* polish

* polish

* polish

* polish
```
  edf94a35
26 Sep, 2023 1 commit

[lazy] support from_pretrained (#4801) · 4965c0da

Hongxin Liu authored Sep 26, 2023

* [lazy] patch from pretrained

* [lazy] fix from pretrained and add tests

* [devops] update ci

4965c0da

21 Sep, 2023 1 commit

[lazy] support torch 2.0 (#4763) · 3e05c07b

Hongxin Liu authored Sep 21, 2023

* [lazy] support _like methods and clamp

* [lazy] pass transformers models

* [lazy] fix device move and requires grad

* [lazy] fix requires grad and refactor api

* [lazy] fix requires grad

3e05c07b

19 Sep, 2023 1 commit

[misc] update pre-commit and run all files (#4752) · 079bf3cb

Hongxin Liu authored Sep 19, 2023

* [misc] update pre-commit

* [misc] run pre-commit

* [misc] remove useless configuration files

* [misc] ignore cuda for clang-format

079bf3cb

15 Aug, 2023 2 commits
- [test] skip some not compatible models · c3ca53cf
  FoolPlayer authored Aug 02, 2023
  
  c3ca53cf
- [hotfix] fix gemini and zero test (#4333) · 411cf1d2
  Hongxin Liu authored Jul 27, 2023
```
* [hotfix] fix gemini and zero test

* [hotfix] fix lazy init test

* [hotfix] fix lazy init test
```
  411cf1d2
01 Aug, 2023 1 commit

[test] remove useless tests (#4359) · 16bf4c02

Hongxin Liu authored Aug 01, 2023

* [test] remove legacy zero test

* [test] remove lazy distribute test

* [test] remove outdated checkpoint io

16bf4c02

19 Jul, 2023 1 commit

[lazy] support init on cuda (#4269) · fc5cef2c

Hongxin Liu authored Jul 19, 2023

* [lazy] support init on cuda

* [test] update lazy init test

* [test] fix transformer version

fc5cef2c

04 Jul, 2023 3 commits
- [test] fixed tests failed due to dtensor change (#4082) · c4b1b659
  Frank Lee authored Jun 26, 2023
```
* [test] fixed tests failed due to dtensor change

* polish code
```
  c4b1b659
- [shardformer] support module saving and loading (#4062) · 8eb09a4c
  Frank Lee authored Jun 22, 2023
```
* [shardformer] support module saving and loading

* polish code
```
  8eb09a4c
- [shardformer] adapted T5 and LLaMa test to use kit (#4049) · 58df7205
  Frank Lee authored Jun 21, 2023
```
* [shardformer] adapted T5 and LLaMa test to use kit

* polish code
```
  58df7205
09 Jun, 2023 1 commit
- Revert "[sync] sync feature/shardformer with develop" · ddcf58ca
  Frank Lee authored Jun 09, 2023
  
  ddcf58ca
08 Jun, 2023 1 commit
- [dtensor] updated api and doc (#3845) · eb39154d
  Frank Lee authored Jun 08, 2023
  
  eb39154d
05 Jun, 2023 1 commit

[lazy] refactor lazy init (#3891) · dbb32692

Hongxin Liu authored Jun 05, 2023

* [lazy] remove old lazy init

* [lazy] refactor lazy init folder structure

* [lazy] fix lazy tensor deepcopy

* [test] update lazy init test

dbb32692