- 23 Mar, 2023 5 commits
-
-
Fazzie-Maqianli authored
-
Fazzie-Maqianli authored
-
Frank Lee authored
* [api] implemented the checkpoint io module * polish code * polish code
-
ver217 authored
* [lazyinit] lazy tensor add distribute * [lazyinit] refactor distribute * [lazyinit] add test dist lazy init * [lazyinit] add verbose info for dist lazy init * [lazyinit] fix rnn flatten weight op * [lazyinit] polish test * [lazyinit] polish test * [lazyinit] fix lazy tensor data setter * [lazyinit] polish test * [lazyinit] fix clean * [lazyinit] make materialize inplace * [lazyinit] refactor materialize * [lazyinit] refactor test distribute * [lazyinit] fix requires_grad * [lazyinit] fix tolist after materialization * [lazyinit] refactor distribute module * [lazyinit] polish docstr * [lazyinit] polish lazy init context * [lazyinit] temporarily skip test * [lazyinit] polish test * [lazyinit] add docstr
-
Yan Fang authored
-
- 22 Mar, 2023 6 commits
-
-
Yuanchen authored
Co-authored-by:Yuanchen Xu <yuanchen.xu00@gmail.com>
-
Fazzie-Maqianli authored
-
Frank Lee authored
* [booster] implemented the cluster module * polish code
-
YuliangLiu0306 authored
-
YuliangLiu0306 authored
* pass gpt trace and meta_prop * pass t5 trace and meta_prop * [FX] refactor experimental tracer and adapt it with hf models * pass all mainstream model zoo * fix CI * fix CI * fix CI * fix CI * fix CI * fix CI * fix CI * fix CI * skip tests * fix CI * using packaging version * polish
-
pgzhang authored
* [chatgpt] add supervised fine-tune code * [chatgpt] delete unused code and modified comment code * [chatgpt] use pytorch distributed sampler instead --------- Co-authored-by:zhangpengpeng <zhangpengpeng@joyy.com>
-
- 21 Mar, 2023 6 commits
-
-
Frank Lee authored
* [booster] added the plugin base and torch ddp plugin * polish code * polish code * polish code
-
NatalieC323 authored
* Update requirements.txt * Update environment.yaml * Update README.md * Update environment.yaml * Update README.md * Update README.md * Delete requirements_colossalai.txt * Update requirements.txt * Update README.md
-
Zihao authored
* add auto-offload feature * polish code * fix syn offload runtime pass bug * add offload example * fix offload testing bug * fix example testing bug
-
YuliangLiu0306 authored
-
YH authored
-
YH authored
-
- 20 Mar, 2023 7 commits
-
-
zbian authored
-
Frank Lee authored
* [test] fixed torchrec registration in model zoo * polish code * polish code * polish code
-
NatalieC323 authored
* Update requirements.txt * Update environment.yaml * Update README.md * Update environment.yaml
-
Frank Lee authored
-
Frank Lee authored
* [test] fixed torchrec model test * polish code * polish code * polish code * polish code * polish code * polish code
-
Saurav Maheshkar authored
* refactor: README-zh-Hans * refactor: REFERENCE * docs: update paths in README
-
BlueRum authored
* add normalize function to value_head in bloom rm * add normalization to value_function in gpt_rm * add normalization to value_head of opt_rm * add Anthropic/hh-rlhf dataset * Update __init__.py * Add LogExpLoss in RM training * Update __init__.py * update rm trainer to use acc as target * update example/train_rm * Update train_rm.sh * code style * Update README.md * Update README.md * add rm test to ci * fix tokenier * fix typo * change batchsize to avoid oom in ci * Update test_ci.sh
-
- 17 Mar, 2023 5 commits
-
-
ver217 authored
-
ver217 authored
* [chatgpt] fix generation early stopping * [chatgpt] fix train prompts example
-
ver217 authored
* [lazyinit] fix shared module * [tests] add lazy init test utils * [tests] add torchvision for lazy init * [lazyinit] fix pre op fn * [lazyinit] handle legacy constructor * [tests] refactor lazy init test models * [tests] refactor lazy init test utils * [lazyinit] fix ops don't support meta * [tests] lazy init test timm models * [lazyinit] fix set data * [lazyinit] handle apex layers * [tests] lazy init test transformers models * [tests] lazy init test torchaudio models * [lazyinit] fix import path * [tests] lazy init test torchrec models * [tests] update torch version in CI * [tests] revert torch version in CI * [tests] skip lazy init test
-
binmakeswell authored
* [doc] update contribution guide * [doc] update contribution guide * [doc] add community contribution guide
-
Frank Lee authored
* [booster] implemented mixed precision class * polish code
-
- 15 Mar, 2023 4 commits
-
-
YuliangLiu0306 authored
-
ver217 authored
* [tests] model zoo add torchaudio models * [tests] refactor torchaudio wavernn * [tests] refactor fx torchaudio tests
-
Frank Lee authored
-
Frank Lee authored
* [test] added torchvision models to test model zoo * polish code * polish code * polish code * polish code * polish code * polish code
-
- 14 Mar, 2023 7 commits
-
-
HELSON authored
* [tests] diffuser models in model zoo * remove useless code * [tests] add diffusers to requirement-test
-
Saurav Maheshkar authored
* feat(docker): Add opencontainers image-spec to `Dockerfile` This PR makes few changes to improve the overall quality of the docker image
🐳 . For reference more annotations can be found [here](https://github.com/opencontainers/image-spec/blob/main/annotations.md) * feat(docker): add inline version declaration * fix(docker): drop `org.opencontainers.image.version` LABEL -
YuliangLiu0306 authored
* [DTensor] refactor dtensor with new components * polish
-
ver217 authored
* [lazyinit] refactor lazy tensor and lazy init ctx * [lazyinit] polish docstr * [lazyinit] polish docstr
-
Frank Lee authored
* [test] added timm models to test model zoo * polish code * polish code * polish code * polish code * polish code
-
BlueRum authored
* [chatgpt]update ci * Update test_ci.sh * Update test_ci.sh * Update test_ci.sh * test * Update train_prompts.py * Update train_dummy.py * add save_path * polish * add save path * polish * add save path * polish * delete bloom-560m test delete bloom-560m test because of oom * add ddp test
-
Frank Lee authored
-