- 12 Aug, 2022 8 commits
-
-
Frank Lee authored
-
Geng Zhang authored
-
Frank Lee authored
* [test] fixed the activation codegen test * polish code
-
YuliangLiu0306 authored
* [tensor] shape consistency output transform path and communication cost * polish code
-
Boyuan Yao authored
* [fx] Use colossalai.utils.checkpoint to replace torch.utils.checkpoint for offload activation and add offload annotation recognition in codegen * [fx] Use colossalai.utils.checkpoint to replace torch.utils.checkpoint for offload activation and add offload annotation recognition in codegen * Modification of test and add TODO in codegen * [fx] Modification of colossal ckpt usage * [fx] add gpc.destroy() to test_codegen
-
Kirigaya Kazuto authored
* support p2p communication with any type of object | pass test * reconstruct pipeline schedule with p2p_v2.py(support communication with List[Any]) | pass test * [communication] add p2p_v2.py to support communication with List[Any] * Delete _pipeline_schedule_v2.py * Delete test_cifar_with_data_pipeline_tensor_v2.py * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule * Delete p2p_v2.py * Delete test_boardcast_send_recv_v2.py * Delete test_object_list_p2p_v2.py * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule * [communication] remove print code * [communication] remove print code * [engin/schedule] shorten the running time of testing file to prevent cancelling in CI
-
Frank Lee authored
* [tensor] added linear implementation for the new sharding spec * polish code
-
https://arxiv.org/abs/1604.06174Super Daniel authored
* [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages * [fx] activation checkpointing using Chen strategies. * [fx] add test for ckpt_solver_chen * mend * [fx] add vanilla activation checkpoint search with test on resnet and densenet * [fx] add vanilla activation checkpoint search with test on resnet and densenet * [fx] add a namespace code for solver_chen. * [fx] fix the false interpretation of algorithm 3 in https://arxiv.org/abs/1604.06174. * [fx] fix lowercase naming conventions.
-
- 11 Aug, 2022 7 commits
-
-
ver217 authored
-
Frank Lee authored
-
HELSON authored
-
Super Daniel authored
* [fx] activation checkpointing using Chen strategies. * [fx] add test for ckpt_solver_chen * [fx] add vanilla activation checkpoint search with test on resnet and densenet * [fx] add vanilla activation checkpoint search with test on resnet and densenet * [fx] add a namespace code for solver_chen.
-
Jiarui Fang authored
-
HELSON authored
-
Jiarui Fang authored
-
- 10 Aug, 2022 7 commits
-
-
HELSON authored
-
Super Daniel authored
[fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages (#1425) * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
-
Jiarui Fang authored
-
Jiarui Fang authored
-
Jiarui Fang authored
-
HELSON authored
-
YuliangLiu0306 authored
* [tensor] add shape consistency feature to supportauto sharding spec transform. * [tensor] remove unused argument in simulator, add doc string for target pair.
-
- 09 Aug, 2022 7 commits
-
-
HELSON authored
-
HELSON authored
-
Jiarui Fang authored
-
ver217 authored
-
Jiarui Fang authored
-
Kirigaya Kazuto authored
* support p2p communication with any type of object | pass test * reconstruct pipeline schedule with p2p_v2.py(support communication with List[Any]) | pass test * [communication] add p2p_v2.py to support communication with List[Any] * Delete _pipeline_schedule_v2.py * Delete test_cifar_with_data_pipeline_tensor_v2.py * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule * [communication] remove print code * [communication] remove print code
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
- 08 Aug, 2022 2 commits
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
YuliangLiu0306 authored
-
- 05 Aug, 2022 1 commit
-
-
ver217 authored
-
- 03 Aug, 2022 1 commit
-
-
github-actions[bot] authored
Co-authored-by:github-actions <github-actions@github.com>
-
- 02 Aug, 2022 5 commits
-
-
YuliangLiu0306 authored
* [device] add DeviceMesh class to support logical device layout * polish code * add doc string
-
ver217 authored
-
HELSON authored
-
Jiarui Fang authored
-
ver217 authored
-
- 01 Aug, 2022 2 commits