1. 26 Oct, 2022 1 commit
  2. 20 Oct, 2022 1 commit
  3. 19 Oct, 2022 1 commit
  4. 18 Oct, 2022 1 commit
    • Super Daniel's avatar
      [fx/meta/rpc] move _meta_registration.py to fx folder / register fx functions... · 393f5940
      Super Daniel authored
      [fx/meta/rpc] move _meta_registration.py to fx folder / register fx functions with compatibility checks / remove color debug (#1710)
      
      * [fx] move meta registration
      
      * [fx] fix tests.
      
      * [fx] fix test.
      
      * [fx] fix.
      
      * [meta] refactor meta registration.py.
      
      * [fx] add compatibility descriptions.
      
      * [fx] polish import.
      
      * [fx] add a decorator.
      
      * [fx] fix tests.
      
      * [fx] remove print.
      
      * [fx] edit raise error.
      
      * [fx] edit raise error.
      
      * [fx] add type hint.
      
      * [fx] fix import in experimental.
      
      * [rpc] remove color debug.
      
      * [meta] fix naming.
      393f5940
  5. 12 Oct, 2022 1 commit
  6. 11 Oct, 2022 1 commit
  7. 03 Oct, 2022 1 commit
    • Boyuan Yao's avatar
      [autoparallel] add rotor C version (#1658) · 1df98d5b
      Boyuan Yao authored
      * [autoparallel] add rotor c version
      
      * [fx] remove metainfoprop in rotor solver
      
      * [autoparallel] modify C
       code format
      
      * [autoparallel] remove build.py
      
      * [autoparallel] fix C extension build
      
      * [autoparallel] add C solver consistency test
      
      * [autoparallel] remove some unused imports
      
      * [autoparallel] refactor rotor solver code
      
      * [autoparallel] replace print with colossalai logger
      
      * [autoparallel] ranks fixed
      1df98d5b
  8. 27 Sep, 2022 2 commits
  9. 23 Sep, 2022 2 commits
  10. 20 Sep, 2022 1 commit
  11. 14 Sep, 2022 2 commits
    • Boyuan Yao's avatar
      [fx] Add offload codegen (#1598) · a7cda6f5
      Boyuan Yao authored
      * [fx] add input activation offload to codegen
      
      * [fx] modify unit test
      
      * [fx] remove two skips in torch11
      
      * [fx] use all_input_nodes instead of _input_nodes
      a7cda6f5
    • Super Daniel's avatar
      [hotfix/rotor] fix variable names (#1597) · c8e9b2ad
      Super Daniel authored
      * [fx] add some comment and docstrings.
      
      * [fx] add dataflow analysis for an autograd graph.
      
      * add intepretation for graph analysis.
      
      * [fx] before doing save_tensor_hooks.
      
      * [fx] provide an accurate estimation of memory except for GPT-2.
      
      * [fx] provide an accurate estimation of memory except for GPT-2.
      
      * [fx] provide an accurate estimation of memory except for GPT-2.
      
      * [fx] a very accurate version on GPT-2.
      
      * [fx] refactor code.
      
      * [fx] remove redundant inplace=True.
      
      * [fx] refactor code.
      
      * [fx] refactor code.
      
      * [fx] refactor code.
      
      * [fx] dive into backward memory.
      
      * [fx] fix variable names in ckpt_solvers and unskip tests.
      
      * [fx] commit my changes.
      
      * [fx] restore skips.
      
      * [fx] restore skips.
      
      * [fx] chaange stage into phase.
      
      * [fx] chaange stage into phase.
      
      * [fx] chaange stage into phase.
      c8e9b2ad
  12. 12 Sep, 2022 1 commit
    • Boyuan Yao's avatar
      [fx] Add nested checkpoint in activation checkpoint codegen (#1585) · f3687e4e
      Boyuan Yao authored
      * [fx] add nested activation_checkpoint codegen
      
      * undo algorithms commits
      
      * solver
      
      * undo some commits
      
      * [fx] torch11 add nested activation checkpoint codegen
      
      * remove some imports
      
      * [fx] add some comments in activation codegen
      
      * [fx] codegen instance error fix
      f3687e4e
  13. 07 Sep, 2022 1 commit
    • Super Daniel's avatar
      [fx] provide a stable but not accurate enough version of profiler. (#1547) · 4f596932
      Super Daniel authored
      * [fx] compute memory stat and flop count for MetaInfoProp.
      
      * [fx] modify node attribute.
      
      * [fx] modify ckpt_chen.
      
      * [fx] fix compatibility.
      
      * [fx] fix import error.
      
      * [fx] skip test for MetaInfoProp.
      
      * [fx] skip test for MetaInfoProp.
      
      * [fx] skip test for MetaInfoProp.
      
      * [fx] skip test for MetaInfoProp.
      
      * [fx] skip if torch 1.11.0.
      
      * [fx] recover MetaInfoProp support for PyTorch 1.11.
      
      * [fx] provide a stable but not accurate enough version of profiler.
      
      * [fx] provide a stable but not accurate enough version of profiler.
      
      * [fx] fix compatibility in tests.
      
      * [fx] fix compatibility in tests.
      
      * [fx] fix compatibility in tests.
      
      * [fx] fix compatibility in tests.
      
      * [fx] fix compatibility in tests.
      
      * [fx] fix compatibility in tests.
      
      * [fx] fix compatibility in tests.
      
      * [fx] fix compatibility in tests.
      
      * [fx] fix compatibility in tests.
      
      * [fx] fix compatibility in tests.
      
      * [fx] fix import error.
      4f596932
  14. 02 Sep, 2022 1 commit
  15. 01 Sep, 2022 2 commits
  16. 31 Aug, 2022 1 commit
    • Boyuan Yao's avatar
      [fx] Fix wrong index in annotation and minimal flops in ckpt solver (#1521) · b231430b
      Boyuan Yao authored
      * [fx] fix wrong variable name in solver rotor
      
      * [fx] fix wrong variable name in solver rotor
      
      * [fx] fix the discretize bug
      
      * [fx] fix the first op in activation checkpoint codegen
      
      * [fx] fix some bugs of ckpt solver
      
      * [fx] modify test_ckpt_torchvision
      
      * [fx] set sequence to __sequence__ attr of GraphModule
      
      * [fx] docstring modification
      
      * [fx] remove performance test
      b231430b
  17. 27 Aug, 2022 1 commit
  18. 26 Aug, 2022 1 commit
    • Boyuan Yao's avatar
      [fx] Add activation checkpoint solver rotor (#1496) · de1e716d
      Boyuan Yao authored
      * [fx] fix defining ckpt functions inside forward
      
      * [fx] Modify activation checkpoint codegen and add ColoGraphModule
      
      * [fx] some modification
      
      * some modifications
      
      * some modifications
      
      * some modifications
      
      * some modifications
      
      * some code modifications
      
      * [automatic_parallel] ckpt solver rotor
      
      * [fx] add ckpt_solver_rotor
      
      * [fx] modification
      
      * code refactor
      
      * code refactor
      de1e716d
  19. 25 Aug, 2022 1 commit
  20. 24 Aug, 2022 1 commit
    • Super Daniel's avatar
      [fx] add profiler for fx nodes. (#1480) · 32efe8e7
      Super Daniel authored
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] merge development into main (#1)
      
      * [fx] activation checkpointing using Chen strategies.
      
      * [fx] add test for ckpt_solver_chen
      
      * [fx] add vanilla activation checkpoint search with test on resnet and densenet
      
      * [fx] add a namespace code for solver_chen.
      
      * [fx] fix the false interpretation of algorithm 3 in https://arxiv.org/abs/1604.06174.
      
      * [fx] fix lowercase naming conventions.
      
      * [fx] simplify test for ckpt.
      
      * [fx] add rules to linearize computation graphs for searching. (#2)
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] merge development into main (#1)
      
      * [fx] activation checkpointing using Chen strategies.
      
      * [fx] add test for ckpt_solver_chen
      
      * [fx] add vanilla activation checkpoint search with test on resnet and densenet
      
      * [fx] add a namespace code for solver_chen.
      
      * [fx] fix the false interpretation of algorithm 3 in https://arxiv.org/abs/1604.06174.
      
      * [fx] fix lowercase naming conventions.
      
      * [fx] simplify test for ckpt.
      
      * [fx] fix test and algorithm bugs in activation checkpointing.
      
      * [fx] polish ckpt_test.
      
      * [fx] add rules to linearize computation graphs for searching.
      
      * [fx] remove chen_sqrt for sake of simplicity
      
      * [fx] remove chen_sqrt for sake of simplicity
      
      * [fx] remove chen_sqrt for sake of simplicity
      
      * [fx] remove chen_sqrt for sake of simplicity
      
      * [fx] fix inconsistencies.
      
      * [fx] fix MetaInfoProp.
      
      * [fx] fix MetaInfoProp.
      
      * [fx] consider MetaInfoProp for inplace operands.
      
      * [fx] consider MetaInfoProp for inplace operands.
      
      * [fx] consider MetaInfoProp for inplace operands.
      
      * [fx] consider MetaInfoProp for inplace operands.
      
      * [fx] consider MetaInfoProp for inplace operands.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] fix error in tests.
      
      * [fx] unfix bug.
      
      * [fx] unfix bug.
      32efe8e7
  21. 22 Aug, 2022 1 commit
    • Boyuan Yao's avatar
      [fx] Fix ckpt functions' definitions in forward (#1476) · 1f2e547f
      Boyuan Yao authored
      * [fx] fix defining ckpt functions inside forward
      
      * [fx] Modify activation checkpoint codegen and add ColoGraphModule
      
      * [fx] some modification
      
      * some modifications
      
      * some modifications
      
      * some modifications
      
      * some modifications
      
      * some code modifications
      1f2e547f
  22. 17 Aug, 2022 2 commits
  23. 15 Aug, 2022 1 commit
    • Super Daniel's avatar
      [fx] fix test and algorithm bugs in activation checkpointing. (#1451) · 0dbd61c2
      Super Daniel authored
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] merge development into main (#1)
      
      * [fx] activation checkpointing using Chen strategies.
      
      * [fx] add test for ckpt_solver_chen
      
      * [fx] add vanilla activation checkpoint search with test on resnet and densenet
      
      * [fx] add a namespace code for solver_chen.
      
      * [fx] fix the false interpretation of algorithm 3 in https://arxiv.org/abs/1604.06174.
      
      * [fx] fix lowercase naming conventions.
      
      * [fx] simplify test for ckpt.
      
      * [fx] fix test and algorithm bugs in activation checkpointing.
      
      * mend
      
      [fx] fix test and algorithm bugs in activation checkpointing.
      
      * mend
      
      [fx] fix test and algorithm bugs in activation checkpointing.
      
      * mend
      
      [fx] fix test and algorithm bugs in activation checkpointing.
      
      * mend
      
      [fx] fix test and algorithm bugs in activation checkpointing.
      
      * [fx] polish ckpt_test.
      
      * [fx] polish ckpt_test.
      
      * [fx] polish ckpt_test.
      0dbd61c2
  24. 12 Aug, 2022 3 commits
    • Frank Lee's avatar
      [test] fixed the activation codegen test (#1447) · 5a52e21f
      Frank Lee authored
      * [test] fixed the activation codegen test
      
      * polish code
      5a52e21f
    • Boyuan Yao's avatar
      [fx] Use colossalai checkpoint and add offload recognition in codegen (#1439) · 5774fe02
      Boyuan Yao authored
      * [fx] Use colossalai.utils.checkpoint to replace torch.utils.checkpoint for offload activation and add offload annotation recognition in codegen
      
      * [fx] Use colossalai.utils.checkpoint to replace torch.utils.checkpoint for offload activation and add offload annotation recognition in codegen
      
      * Modification of test and add TODO in codegen
      
      * [fx] Modification of colossal ckpt usage
      
      * [fx] add gpc.destroy() to test_codegen
      5774fe02
    • Super Daniel's avatar
      [fx] fix the false interpretation of algorithm 3 in https://arxiv.org/abs/1604.06174. (#1446) · d40a9392
      Super Daniel authored
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] activation checkpointing using Chen strategies.
      
      * [fx] add test for ckpt_solver_chen
      
      * mend
      
      * [fx] add vanilla activation checkpoint search with test on resnet and densenet
      
      * [fx] add vanilla activation checkpoint search with test on resnet and densenet
      
      * [fx] add a namespace code for solver_chen.
      
      * [fx] fix the false interpretation of algorithm 3 in https://arxiv.org/abs/1604.06174.
      
      * [fx] fix lowercase naming conventions.
      d40a9392
  25. 11 Aug, 2022 1 commit
  26. 10 Aug, 2022 1 commit
    • Super Daniel's avatar
      [fx] modify the calculation of node_size in MetaInfoProp for activation... · f20cb4e8
      Super Daniel authored
      [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages (#1425)
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      f20cb4e8
  27. 01 Aug, 2022 2 commits
  28. 27 Jul, 2022 1 commit
    • Super Daniel's avatar
      [fx] add torchaudio test (#1369) · be229217
      Super Daniel authored
      * [fx]add torchaudio test
      
      * [fx]add torchaudio test
      
      * [fx] add torchaudio test
      
      * [fx] add torchaudio test
      
      * [fx] add torchaudio test
      
      * [fx] add torchaudio test
      
      * [fx] add torchaudio test
      
      * [fx] add torchaudio test and test patches
      
      * Delete ~
      
      * [fx] add patches and patches test
      
      * [fx] add patches and patches test
      
      * [fx] fix patches
      
      * [fx] fix rnn patches
      
      * [fx] fix rnn patches
      
      * [fx] fix rnn patches
      
      * [fx] fix rnn patches
      
      * [fx] merge upstream
      
      * [fx] fix import errors
      be229217
  29. 26 Jul, 2022 1 commit
  30. 25 Jul, 2022 2 commits
  31. 21 Jul, 2022 1 commit