- 10 May, 2023 2 commits
-
-
digger-yu authored
* fix spelling error with examples/comminity/ * fix spelling error with tests/ * fix some spelling error with tests/ colossalai/ etc.
-
jiangmingyan authored
* [booster] update tests for booster * [booster] update tests for booster * [booster] update tests for booster * [booster] update tests for booster * [booster] update tests for booster * [booster] update booster tutorials#3717, fix recursive check
-
- 09 May, 2023 1 commit
-
-
Hongxin Liu authored
* [booster] fix no_sync method * [booster] add test for ddp no_sync * [booster] fix merge * [booster] update unit test * [booster] update unit test * [booster] update unit test
-
- 08 May, 2023 1 commit
-
-
Hongxin Liu authored
* [booster] add prepare dataloader method for plug * [booster] update examples and docstr
-
- 05 May, 2023 3 commits
-
-
Hongxin Liu authored
* [booster] add dp plugin base * [booster] inherit dp plugin base * [booster] refactor unit tests
-
digger-yu authored
fix spelling error in line23 change "cudnn_determinstic"=True to "cudnn_deterministic=True"
-
jiangmingyan authored
* gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin add shard checkpoint save/load * gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint * [API Refactoring]gemini plugin support shard checkpoint --------- Co-authored-by:
luchen <luchen@luchendeMBP.lan> Co-authored-by:
luchen <luchen@luchendeMacBook-Pro.local>
-
- 26 Apr, 2023 2 commits
-
-
Hongxin Liu authored
* [gemini] support don't scatter after inference * [chat] update colossalai strategy * [chat] fix opt benchmark * [chat] update opt benchmark * [gemini] optimize inference * [test] add gemini inference test * [chat] fix unit test ci * [chat] fix ci * [chat] fix ci * [chat] skip checkpoint test
-
Hongxin Liu authored
* [booster] add low level zero plugin * [booster] fix gemini plugin test * [booster] fix precision * [booster] add low level zero plugin test * [test] fix booster plugin test oom * [test] fix booster plugin test oom * [test] fix googlenet and inception output trans * [test] fix diffuser clip vision model * [test] fix torchaudio_wav2vec2_base * [test] fix low level zero plugin test
-
- 17 Apr, 2023 1 commit
-
-
Hongxin Liu authored
* [gemini] support state dict shard * [gemini] add test state dict shard * [gemini] polish docstr * [gemini] fix merge * [gemini] polish code
-
- 12 Apr, 2023 1 commit
-
-
Hongxin Liu authored
* [gemini] fix nvme optimizer init * [gemini] gemini supports lazy init * [gemini] add init example * [gemini] add fool model * [zero] update gemini ddp * [zero] update init example * add chunk method * add chunk method * [lazyinit] fix lazy tensor tolist * [gemini] fix buffer materialization * [misc] remove useless file * [booster] update gemini plugin * [test] update gemini plugin test * [test] fix gemini plugin test * [gemini] fix import * [gemini] fix import * [lazyinit] use new metatensor * [lazyinit] use new metatensor * [lazyinit] fix __set__ method
-
- 06 Apr, 2023 3 commits
-
-
jiangmingyan authored
* [checkpoint] support huggingface style sharded checkpoint * [checkpoint] support huggingface style sharded checkpoint * [checkpoint] support huggingface style sharded checkpoint * [checkpoint] support huggingface style sharded checkpoint * [checkpoint] support huggingface style sharded checkpoint --------- Co-authored-by:luchen <luchen@luchendeMBP.lan>
-
Frank Lee authored
* [test] added spawn decorator * polish code * polish code * polish code * polish code * polish code * polish code
-
ver217 authored
-
- 04 Apr, 2023 3 commits
-
-
YuliangLiu0306 authored
* [autoparallel] integrate new analyzer in module level * unify the profiling method * polish * fix no codegen bug * fix pass bug * fix liveness test * polish
-
Frank Lee authored
* [checkpoint] refactored the API and added safetensors support * polish code
-
ver217 authored
* [zero] refactor low-level zero folder structure * [zero] fix legacy zero import path * [zero] fix legacy zero import path * [zero] remove useless import * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor legacy zero import path * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor gemini folder structure * [zero] refactor legacy zero import path * [zero] fix test import path * [zero] fix test * [zero] fix circular import * [zero] update import
-
- 03 Apr, 2023 1 commit
-
-
Frank Lee authored
* [test] fixed gemini plugin test * polish code * polish code
-
- 31 Mar, 2023 2 commits
-
-
ver217 authored
* [booster] add gemini plugin * [booster] update docstr * [booster] gemini plugin add coloparam convertor * [booster] fix coloparam convertor * [booster] fix gemini plugin device * [booster] add gemini plugin test * [booster] gemini plugin ignore sync bn * [booster] skip some model * [booster] skip some model * [booster] modify test world size * [booster] modify test world size * [booster] skip test
-
HELSON authored
* [moe] add checkpoint for moe models * [hotfix] fix bugs in unit test
-
- 30 Mar, 2023 1 commit
-
-
YuliangLiu0306 authored
* [autoparallel] adapt autoparallel with new analyzer * fix all node handler tests * polish * polish
-
- 27 Mar, 2023 1 commit
-
-
Frank Lee authored
* [booster] implemented the torch ddd + resnet example * polish code
-
- 24 Mar, 2023 2 commits
-
-
YuliangLiu0306 authored
* [API] implement device mesh manager * polish
-
YuliangLiu0306 authored
* [hotfix] skip torchaudio tracing test * fix lazy init test issue
-
- 23 Mar, 2023 2 commits
-
-
Frank Lee authored
* [api] implemented the checkpoint io module * polish code * polish code
-
ver217 authored
* [lazyinit] lazy tensor add distribute * [lazyinit] refactor distribute * [lazyinit] add test dist lazy init * [lazyinit] add verbose info for dist lazy init * [lazyinit] fix rnn flatten weight op * [lazyinit] polish test * [lazyinit] polish test * [lazyinit] fix lazy tensor data setter * [lazyinit] polish test * [lazyinit] fix clean * [lazyinit] make materialize inplace * [lazyinit] refactor materialize * [lazyinit] refactor test distribute * [lazyinit] fix requires_grad * [lazyinit] fix tolist after materialization * [lazyinit] refactor distribute module * [lazyinit] polish docstr * [lazyinit] polish lazy init context * [lazyinit] temporarily skip test * [lazyinit] polish test * [lazyinit] add docstr
-
- 22 Mar, 2023 2 commits
-
-
YuliangLiu0306 authored
-
YuliangLiu0306 authored
* pass gpt trace and meta_prop * pass t5 trace and meta_prop * [FX] refactor experimental tracer and adapt it with hf models * pass all mainstream model zoo * fix CI * fix CI * fix CI * fix CI * fix CI * fix CI * fix CI * fix CI * skip tests * fix CI * using packaging version * polish
-
- 21 Mar, 2023 2 commits
- 20 Mar, 2023 4 commits
- 17 Mar, 2023 2 commits
-
-
ver217 authored
* [lazyinit] fix shared module * [tests] add lazy init test utils * [tests] add torchvision for lazy init * [lazyinit] fix pre op fn * [lazyinit] handle legacy constructor * [tests] refactor lazy init test models * [tests] refactor lazy init test utils * [lazyinit] fix ops don't support meta * [tests] lazy init test timm models * [lazyinit] fix set data * [lazyinit] handle apex layers * [tests] lazy init test transformers models * [tests] lazy init test torchaudio models * [lazyinit] fix import path * [tests] lazy init test torchrec models * [tests] update torch version in CI * [tests] revert torch version in CI * [tests] skip lazy init test
-
Frank Lee authored
* [booster] implemented mixed precision class * polish code
-
- 15 Mar, 2023 4 commits
-
-
YuliangLiu0306 authored
-
ver217 authored
* [tests] model zoo add torchaudio models * [tests] refactor torchaudio wavernn * [tests] refactor fx torchaudio tests
-
Frank Lee authored
-
Frank Lee authored
* [test] added torchvision models to test model zoo * polish code * polish code * polish code * polish code * polish code * polish code
-