1. 26 Aug, 2022 1 commit
    • Kirigaya Kazuto's avatar
      [pipeline/rpc] update outstanding mechanism | optimize dispatching strategy (#1497) · 5a6fd71f
      Kirigaya Kazuto authored
      * support p2p communication with any type of object | pass test
      
      * reconstruct pipeline schedule with p2p_v2.py(support communication with List[Any]) | pass test
      
      * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule
      
      * [pipeline/rpc] implement a demo for PP with cuda rpc framework
      
      * [pipeline/rpc] support interleaving | fix checkpoint bug | change logic when dispatch data in work_list to ensure steady 1F1B
      
      * [pipeline/rpc] implement distributed optimizer | test with assert_close
      
      * [pipeline/rpc] implement distributed optimizer | test with assert_close
      
      * [pipeline/rpc] update outstanding mechanism | optimize dispatching strategy
      
      * [pipeline/rpc] update outstanding mechanism | optimize dispatching strategy
      
      * [pipeline/rpc] update outstanding mechanism | optimize dispatching strategy
      5a6fd71f
  2. 25 Aug, 2022 1 commit
    • Kirigaya Kazuto's avatar
      [pipeline/rpc] implement distributed optimizer | test with assert_close (#1486) · 9145aef2
      Kirigaya Kazuto authored
      * support p2p communication with any type of object | pass test
      
      * reconstruct pipeline schedule with p2p_v2.py(support communication with List[Any]) | pass test
      
      * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule
      
      * [pipeline/rpc] implement a demo for PP with cuda rpc framework
      
      * [pipeline/rpc] support interleaving | fix checkpoint bug | change logic when dispatch data in work_list to ensure steady 1F1B
      
      * [pipeline/rpc] implement distributed optimizer | test with assert_close
      
      * [pipeline/rpc] implement distributed optimizer | test with assert_close
      9145aef2
  3. 24 Aug, 2022 1 commit
    • Kirigaya Kazuto's avatar
      [pipeline/rpc] support interleaving | fix checkpoint bug | change logic when... · a6c87491
      Kirigaya Kazuto authored
      [pipeline/rpc] support interleaving | fix checkpoint bug | change logic when dispatch data in work_list to ensure steady 1F1B (#1483)
      
      * support p2p communication with any type of object | pass test
      
      * reconstruct pipeline schedule with p2p_v2.py(support communication with List[Any]) | pass test
      
      * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule
      
      * [pipeline/rpc] implement a demo for PP with cuda rpc framework
      
      * [pipeline/rpc] support interleaving | fix checkpoint bug | change logic when dispatch data in work_list to ensure steady 1F1B
      a6c87491
  4. 22 Aug, 2022 1 commit
    • Kirigaya Kazuto's avatar
      [pipeline/rpc] implement a demo for PP with cuda rpc framework (#1470) · bb5f5289
      Kirigaya Kazuto authored
      * support p2p communication with any type of object | pass test
      
      * reconstruct pipeline schedule with p2p_v2.py(support communication with List[Any]) | pass test
      
      * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule
      
      * [pipeline/rpc] implement a demo for PP with cuda rpc framework
      
      * Delete p2p_v2.py
      
      * Delete _pipeline_schedule_v2.py
      
      * Delete test_object_list_p2p_v2.py
      
      * Delete test_boardcast_send_recv_v2.py
      
      * Delete test_cifar_with_data_pipeline_tensor_v2.py
      bb5f5289
  5. 10 Jun, 2022 1 commit