1. 06 Sep, 2022 1 commit
  2. 05 Sep, 2022 3 commits
  3. 02 Sep, 2022 3 commits
  4. 01 Sep, 2022 6 commits
    • Super Daniel's avatar
      [fx] add test for meta tensor. (#1527) · 7dc53237
      Super Daniel authored
      * [fx] add test for meta tensor.
      
      * [fx] add test for meta tensor.
      
      * [fx] add test for meta tensor.
      
      * [fx] add test for meta tensor.
      
      * [fx] fix error.
      7dc53237
    • YuliangLiu0306's avatar
      4b3d6cae
    • CsRic's avatar
      [embedding] add tablewise sharding for FAW (#1526) · 5156d5b4
      CsRic authored
      5156d5b4
    • Kirigaya Kazuto's avatar
      [pipeline/pipleline_process_group] finish PipelineProcessGroup to manage local... · f1e18362
      Kirigaya Kazuto authored
      [pipeline/pipleline_process_group] finish PipelineProcessGroup to manage local abd global rank in TP,DP and PP (#1508)
      
      * support p2p communication with any type of object | pass test
      
      * reconstruct pipeline schedule with p2p_v2.py(support communication with List[Any]) | pass test
      
      * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule
      
      * [pipeline/rpc] implement a demo for PP with cuda rpc framework
      
      * [pipeline/rpc] support interleaving | fix checkpoint bug | change logic when dispatch data in work_list to ensure steady 1F1B
      
      * [pipeline/rpc] implement distributed optimizer | test with assert_close
      
      * [pipeline/rpc] implement distributed optimizer | test with assert_close
      
      * [pipeline/rpc] update outstanding mechanism | optimize dispatching strategy
      
      * [pipeline/rpc] update outstanding mechanism | optimize dispatching strategy
      
      * [pipeline/rpc] update outstanding mechanism | optimize dispatching strategy
      
      * [pipeline/pipleline_process_group] finish PipelineProcessGroup to manage local abd global rank in TP,DP and PP
      
      * [pipeline/pipleline_process_group] remove comment
      
      * [pipeline/pipleline_process_group] remove comment
      
      * [pipeline/pipleline_process_group] skip process group test
      
      * [pipeline/pipleline_process_group] remove test named function
      f1e18362
    • binmakeswell's avatar
      polish license (#1522) · 8a29ce54
      binmakeswell authored
      8a29ce54
    • Super Daniel's avatar
      [hotfix] avoid conflict of meta registry with torch 1.13.0. (#1530) · 112a1f0a
      Super Daniel authored
      * [hotfix] avoid conflict of meta registry with torch 1.13.0.
      
      * [hotfix] avoid conflict of meta registry with torch 1.13.0.
      112a1f0a
  5. 31 Aug, 2022 4 commits
    • Boyuan Yao's avatar
      [fx] Fix wrong index in annotation and minimal flops in ckpt solver (#1521) · b231430b
      Boyuan Yao authored
      * [fx] fix wrong variable name in solver rotor
      
      * [fx] fix wrong variable name in solver rotor
      
      * [fx] fix the discretize bug
      
      * [fx] fix the first op in activation checkpoint codegen
      
      * [fx] fix some bugs of ckpt solver
      
      * [fx] modify test_ckpt_torchvision
      
      * [fx] set sequence to __sequence__ attr of GraphModule
      
      * [fx] docstring modification
      
      * [fx] remove performance test
      b231430b
    • github-actions[bot]'s avatar
    • Super Daniel's avatar
      [fx] hack __torch_dispatch__ for meta tensor and autograd. (#1515) · 5cc849f6
      Super Daniel authored
      * [fx] hack __torch_dispatch__ for meta tensor and autograd.
      
      * [fx] hack __torch_dispatch__ for meta tensor and autograd.
      
      * [fx] hack __torch_dispatch__ for meta tensor and autograd.
      
      * [fx] hack __torch_dispatch__ for meta tensor and autograd.
      
      * [fx] hack __torch_dispatch__ for meta tensor and autograd.
      
      * [fx] add bad case detections.
      
      * [fx] add bad case detections.
      
      * [fx] rename MetaTensor attributes.
      
      * [fx] fix unexpected error.
      
      * [fx] fix unexpected error.
      
      * [fx] fix unexpected error.
      
      * [fx] fix unexpected error.
      
      * [fx] fix unexpected error.
      
      * [fx] add register backward for native_batch_norm_backward.
      
      * [fx] add more meta backend support for nn.Modules.
      
      * [fx] add meta backend to support timm and torchvision models.
      
      * [fx] add meta hardswish for timm models.
      5cc849f6
    • Jiarui Fang's avatar
      4537d39d
  6. 30 Aug, 2022 5 commits
  7. 29 Aug, 2022 4 commits
  8. 27 Aug, 2022 1 commit
  9. 26 Aug, 2022 10 commits
  10. 25 Aug, 2022 3 commits
    • Super Daniel's avatar
      [fx] add more op patches for profiler and error message for unsupported ops. (#1495) · 09c023be
      Super Daniel authored
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] merge development into main (#1)
      
      * [fx] activation checkpointing using Chen strategies.
      
      * [fx] add test for ckpt_solver_chen
      
      * [fx] add vanilla activation checkpoint search with test on resnet and densenet
      
      * [fx] add a namespace code for solver_chen.
      
      * [fx] fix the false interpretation of algorithm 3 in https://arxiv.org/abs/1604.06174.
      
      * [fx] fix lowercase naming conventions.
      
      * [fx] simplify test for ckpt.
      
      * [fx] add rules to linearize computation graphs for searching. (#2)
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] modify the calculation of node_size in MetaInfoProp for activation checkpointing usages
      
      * [fx] merge development into main (#1)
      
      * [fx] activation checkpointing using Chen strategies.
      
      * [fx] add test for ckpt_solver_chen
      
      * [fx] add vanilla activation checkpoint search with test on resnet and densenet
      
      * [fx] add a namespace code for solver_chen.
      
      * [fx] fix the false interpretation of algorithm 3 in https://arxiv.org/abs/1604.06174.
      
      * [fx] fix lowercase naming conventions.
      
      * [fx] simplify test for ckpt.
      
      * [fx] fix test and algorithm bugs in activation checkpointing.
      
      * [fx] polish ckpt_test.
      
      * [fx] add rules to linearize computation graphs for searching.
      
      * [fx] remove chen_sqrt for sake of simplicity
      
      * [fx] remove chen_sqrt for sake of simplicity
      
      * [fx] remove chen_sqrt for sake of simplicity
      
      * [fx] remove chen_sqrt for sake of simplicity
      
      * [fx] fix inconsistencies.
      
      * [fx] fix MetaInfoProp.
      
      * [fx] fix MetaInfoProp.
      
      * [fx] consider MetaInfoProp for inplace operands.
      
      * [fx] consider MetaInfoProp for inplace operands.
      
      * [fx] consider MetaInfoProp for inplace operands.
      
      * [fx] consider MetaInfoProp for inplace operands.
      
      * [fx] consider MetaInfoProp for inplace operands.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] add profiler for fx nodes.
      
      * [fx] fix error in tests.
      
      * [fx] unfix bug.
      
      * [fx] unfix bug.
      
      * [fx] patch more modules and functions.
      
      * [fx] change name of utils.py to profiler.py
      
      * [fx] add profiler for rnn.
      
      * [fx] add profiler for rnn.
      
      * [fx] polish and add more patch for profiler.
      
      * [fx] polish and add more patch for profiler.
      09c023be
    • YuliangLiu0306's avatar
      [autoparallel] add cost graph class (#1481) · 413c0534
      YuliangLiu0306 authored
      * [autoparallel] add cost graph class
      
      * polish code
      413c0534
    • YuliangLiu0306's avatar
      [tensor]add 1D device mesh (#1492) · 4b03c25f
      YuliangLiu0306 authored
      4b03c25f