- 19 Jun, 2023 1 commit
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
- 16 Jun, 2023 2 commits
-
-
Frank Lee authored
-
Baizhou Zhang authored
-
- 15 Jun, 2023 2 commits
-
-
Wenhao Chen authored
* feat: make optimizer optional in Booster.boost * test: skip unet test if diffusers version > 0.10.2
-
Baizhou Zhang authored
-
- 09 Jun, 2023 2 commits
- 08 Jun, 2023 1 commit
-
-
Frank Lee authored
-
- 05 Jun, 2023 2 commits
-
-
Hongxin Liu authored
* [bf16] add bf16 support for fused adam (#3844) * [bf16] fused adam kernel support bf16 * [test] update fused adam kernel test * [test] update fused adam test * [bf16] cpu adam and hybrid adam optimizers support bf16 (#3860) * [bf16] implement mixed precision mixin and add bf16 support for low level zero (#3869) * [bf16] add mixed precision mixin * [bf16] low level zero optim support bf16 * [text] update low level zero test * [text] fix low level zero grad acc test * [bf16] add bf16 support for gemini (#3872) * [bf16] gemini support bf16 * [test] update gemini bf16 test * [doc] update gemini docstring * [bf16] add bf16 support for plugins (#3877) * [bf16] add bf16 support for legacy zero (#3879) * [zero] init context support bf16 * [zero] legacy zero support bf16 * [test] add zero bf16 test * [doc] add bf16 related docstring for legacy zero
-
Hongxin Liu authored
* [lazy] remove old lazy init * [lazy] refactor lazy init folder structure * [lazy] fix lazy tensor deepcopy * [test] update lazy init test
-
- 23 May, 2023 2 commits
-
-
wukong1992 authored
-
Frank Lee authored
-
- 19 May, 2023 1 commit
-
-
Hongxin Liu authored
* [test] refactor torch ddp checkpoint test * [plugin] update low level zero optim checkpoint * [plugin] update gemini optim checkpoint
-
- 18 May, 2023 1 commit
-
-
Hongxin Liu authored
* [plugin] torch ddp plugin add save sharded model * [test] fix torch ddp ckpt io test * [test] fix torch ddp ckpt io test * [test] fix low level zero plugin test * [test] fix low level zero plugin test * [test] add debug info * [test] add debug info * [test] add debug info * [test] add debug info * [test] add debug info * [test] fix low level zero plugin test * [test] fix low level zero plugin test * [test] remove debug info
-
- 15 May, 2023 3 commits
-
-
wukong1992 authored
Co-authored-by:纪少敏 <jishaomin@jishaomindeMBP.lan>
-
Hongxin Liu authored
* [test] fix flop tensor test * [test] fix autochunk test * [test] fix lazyinit test * [devops] update torch version of CI * [devops] enable testmon * [devops] fix ci * [devops] fix ci * [test] fix checkpoint io test * [test] fix cluster test * [test] fix timm test * [devops] fix ci * [devops] fix ci * [devops] fix ci * [devops] fix ci * [devops] force sync to test ci * [test] skip fsdp test
-
wukong1992 authored
Co-authored-by:纪少敏 <jishaomin@jishaomindeMBP.lan>
-
- 11 May, 2023 1 commit
-
-
digger-yu authored
* fix spelling error with examples/comminity/ * fix spelling error with tests/ * fix some spelling error with tests/ colossalai/ etc. * fix spelling error with tests/ etc. date:2023.5.10
-
- 10 May, 2023 2 commits
-
-
digger-yu authored
* fix spelling error with examples/comminity/ * fix spelling error with tests/ * fix some spelling error with tests/ colossalai/ etc.
-
jiangmingyan authored
* [booster] update tests for booster * [booster] update tests for booster * [booster] update tests for booster * [booster] update tests for booster * [booster] update tests for booster * [booster] update booster tutorials#3717, fix recursive check
-
- 09 May, 2023 1 commit
-
-
Hongxin Liu authored
* [booster] fix no_sync method * [booster] add test for ddp no_sync * [booster] fix merge * [booster] update unit test * [booster] update unit test * [booster] update unit test
-
- 08 May, 2023 1 commit
-
-
Hongxin Liu authored
* [booster] add prepare dataloader method for plug * [booster] update examples and docstr
-
- 05 May, 2023 3 commits
-
-
Hongxin Liu authored
* [booster] add dp plugin base * [booster] inherit dp plugin base * [booster] refactor unit tests
-
digger-yu authored
fix spelling error in line23 change "cudnn_determinstic"=True to "cudnn_deterministic=True"
-
jiangmingyan authored
* gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint --------- Co-authored-by:
luchen <luchen@luchendeMBP.lan> Co-authored-by:
luchen <luchen@luchendeMacBook-Pro.local>
-
- 26 Apr, 2023 2 commits
-
-
Hongxin Liu authored
* [gemini] support don't scatter after inference * [chat] update colossalai strategy * [chat] fix opt benchmark * [chat] update opt benchmark * [gemini] optimize inference * [test] add gemini inference test * [chat] fix unit test ci * [chat] fix ci * [chat] fix ci * [chat] skip checkpoint test
-
Hongxin Liu authored
* [booster] add low level zero plugin * [booster] fix gemini plugin test * [booster] fix precision * [booster] add low level zero plugin test * [test] fix booster plugin test oom * [test] fix booster plugin test oom * [test] fix googlenet and inception output trans * [test] fix diffuser clip vision model * [test] fix torchaudio_wav2vec2_base * [test] fix low level zero plugin test
-
- 17 Apr, 2023 1 commit
-
-
Hongxin Liu authored
* [gemini] support state dict shard * [gemini] add test state dict shard * [gemini] polish docstr * [gemini] fix merge * [gemini] polish code
-
- 12 Apr, 2023 1 commit
-
-
Hongxin Liu authored
* [gemini] fix nvme optimizer init * [gemini] gemini supports lazy init * [gemini] add init example * [gemini] add fool model * [zero] update gemini ddp * [zero] update init example * add chunk method * add chunk method * [lazyinit] fix lazy tensor tolist * [gemini] fix buffer materialization * [misc] remove useless file * [booster] update gemini plugin * [test] update gemini plugin test * [test] fix gemini plugin test * [gemini] fix import * [gemini] fix import * [lazyinit] use new metatensor * [lazyinit] use new metatensor * [lazyinit] fix __set__ method
-
- 06 Apr, 2023 3 commits
-
-
jiangmingyan authored
* [checkpoint] support huggingface style sharded checkpoint * [checkpoint] support huggingface style sharded checkpoint * [checkpoint] support huggingface style sharded checkpoint * [checkpoint] support huggingface style sharded checkpoint * [checkpoint] support huggingface style sharded checkpoint --------- Co-authored-by:luchen <luchen@luchendeMBP.lan>
-
Frank Lee authored
* [test] added spawn decorator * polish code * polish code * polish code * polish code * polish code * polish code
-
ver217 authored
-
- 04 Apr, 2023 3 commits
-
-
YuliangLiu0306 authored
* [autoparallel] integrate new analyzer in module level * unify the profiling method * polish * fix no codegen bug * fix pass bug * fix liveness test * polish
-
Frank Lee authored
* [checkpoint] refactored the API and added safetensors support * polish code
-
ver217 authored
* [zero] refactor low-level zero folder structure * [zero] fix legacy zero import path * [zero] fix legacy zero import path * [zero] remove useless import * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor legacy zero import path * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor legacy zero import path * [zero] fix test import path * [zero] fix test * [zero] fix circular import * [zero] update import
-
- 03 Apr, 2023 1 commit
-
-
Frank Lee authored
* [test] fixed gemini plugin test * polish code * polish code
-
- 31 Mar, 2023 2 commits
-
-
ver217 authored
* [booster] add gemini plugin * [booster] update docstr * [booster] gemini plugin add coloparam convertor * [booster] fix coloparam convertor * [booster] fix gemini plugin device * [booster] add gemini plugin test * [booster] gemini plugin ignore sync bn * [booster] skip some model * [booster] skip some model * [booster] modify test world size * [booster] modify test world size * [booster] skip test
-
HELSON authored
* [moe] add checkpoint for moe models * [hotfix] fix bugs in unit test
-
- 30 Mar, 2023 1 commit
-
-
YuliangLiu0306 authored
* [autoparallel] adapt autoparallel with new analyzer * fix all node handler tests * polish * polish
-
- 27 Mar, 2023 1 commit
-
-
Frank Lee authored
* [booster] implemented the torch ddd + resnet example * polish code
-