- 21 Mar, 2023 6 commits
-
-
Frank Lee authored
* [booster] added the plugin base and torch ddp plugin * polish code * polish code * polish code
-
NatalieC323 authored
* Update requirements.txt * Update environment.yaml * Update README.md * Update environment.yaml * Update README.md * Update README.md * Delete requirements_colossalai.txt * Update requirements.txt * Update README.md
-
Zihao authored
* add auto-offload feature * polish code * fix syn offload runtime pass bug * add offload example * fix offload testing bug * fix example testing bug
-
YuliangLiu0306 authored
-
YH authored
-
YH authored
-
- 20 Mar, 2023 7 commits
-
-
zbian authored
-
Frank Lee authored
* [test] fixed torchrec registration in model zoo * polish code * polish code * polish code
-
NatalieC323 authored
* Update requirements.txt * Update environment.yaml * Update README.md * Update environment.yaml
-
Frank Lee authored
-
Frank Lee authored
* [test] fixed torchrec model test * polish code * polish code * polish code * polish code * polish code * polish code
-
Saurav Maheshkar authored
* refactor: README-zh-Hans * refactor: REFERENCE * docs: update paths in README
-
BlueRum authored
* add normalize function to value_head in bloom rm * add normalization to value_function in gpt_rm * add normalization to value_head of opt_rm * add Anthropic/hh-rlhf dataset * Update __init__.py * Add LogExpLoss in RM training * Update __init__.py * update rm trainer to use acc as target * update example/train_rm * Update train_rm.sh * code style * Update README.md * Update README.md * add rm test to ci * fix tokenier * fix typo * change batchsize to avoid oom in ci * Update test_ci.sh
-
- 17 Mar, 2023 5 commits
-
-
ver217 authored
-
ver217 authored
* [chatgpt] fix generation early stopping * [chatgpt] fix train prompts example
-
ver217 authored
* [lazyinit] fix shared module * [tests] add lazy init test utils * [tests] add torchvision for lazy init * [lazyinit] fix pre op fn * [lazyinit] handle legacy constructor * [tests] refactor lazy init test models * [tests] refactor lazy init test utils * [lazyinit] fix ops don't support meta * [tests] lazy init test timm models * [lazyinit] fix set data * [lazyinit] handle apex layers * [tests] lazy init test transformers models * [tests] lazy init test torchaudio models * [lazyinit] fix import path * [tests] lazy init test torchrec models * [tests] update torch version in CI * [tests] revert torch version in CI * [tests] skip lazy init test
-
binmakeswell authored
* [doc] update contribution guide * [doc] update contribution guide * [doc] add community contribution guide
-
Frank Lee authored
* [booster] implemented mixed precision class * polish code
-
- 15 Mar, 2023 4 commits
-
-
YuliangLiu0306 authored
-
ver217 authored
* [tests] model zoo add torchaudio models * [tests] refactor torchaudio wavernn * [tests] refactor fx torchaudio tests
-
Frank Lee authored
-
Frank Lee authored
* [test] added torchvision models to test model zoo * polish code * polish code * polish code * polish code * polish code * polish code
-
- 14 Mar, 2023 7 commits
-
-
HELSON authored
* [tests] diffuser models in model zoo * remove useless code * [tests] add diffusers to requirement-test
-
Saurav Maheshkar authored
* feat(docker): Add opencontainers image-spec to `Dockerfile` This PR makes few changes to improve the overall quality of the docker image
🐳 . For reference more annotations can be found [here](https://github.com/opencontainers/image-spec/blob/main/annotations.md) * feat(docker): add inline version declaration * fix(docker): drop `org.opencontainers.image.version` LABEL -
YuliangLiu0306 authored
* [DTensor] refactor dtensor with new components * polish
-
ver217 authored
* [lazyinit] refactor lazy tensor and lazy init ctx * [lazyinit] polish docstr * [lazyinit] polish docstr
-
Frank Lee authored
* [test] added timm models to test model zoo * polish code * polish code * polish code * polish code * polish code
-
BlueRum authored
* [chatgpt]update ci * Update test_ci.sh * Update test_ci.sh * Update test_ci.sh * test * Update train_prompts.py * Update train_dummy.py * add save_path * polish * add save path * polish * add save path * polish * delete bloom-560m test delete bloom-560m test because of oom * add ddp test
-
Frank Lee authored
-
- 13 Mar, 2023 5 commits
-
-
Xuanlei Zhao authored
* refact memory code * dont log free var memory * add memory align * update chunk target * update setting for new memory * finish test * update tracer * update typo * update test * add unet test * add bench * update bench * update bench * init * support vit * move to cpu * add cpu benchmark
-
BlueRum authored
* fix train_dummy * fix train-prompts
-
BlueRum authored
* fix gpt-actor * fix gpt-critic * fix opt-critic
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
Jeff Rasley authored
-
- 12 Mar, 2023 1 commit
-
-
hiko2MSP authored
-
- 11 Mar, 2023 1 commit
-
-
binmakeswell authored
* [doc] add Intel cooperation for biomedicine
-
- 10 Mar, 2023 4 commits
-
-
BlueRum authored
* fix colo-stratergy * polish * fix lora * fix ddp * polish * polish
-
binmakeswell authored
-
Kirthi Shankar Sivamani authored
Signed-off-by:Kirthi Shankar Sivamani <ksivamani@nvidia.com>
-
Frank Lee authored
-