"applications/ChatGPT/vscode:/vscode.git/clone" did not exist on "4269196c79ca61bfbe48ce19a79a1788011c073b"
  • Kirigaya Kazuto's avatar
    [pipeline/pipleline_process_group] finish PipelineProcessGroup to manage local... · f1e18362
    Kirigaya Kazuto authored
    [pipeline/pipleline_process_group] finish PipelineProcessGroup to manage local abd global rank in TP,DP and PP (#1508)
    
    * support p2p communication with any type of object | pass test
    
    * reconstruct pipeline schedule with p2p_v2.py(support communication with List[Any]) | pass test
    
    * [engin/schedule] use p2p_v2 to recontruct pipeline_schedule
    
    * [pipeline/rpc] implement a demo for PP with cuda rpc framework
    
    * [pipeline/rpc] support interleaving | fix checkpoint bug | change logic when dispatch data in work_list to ensure steady 1F1B
    
    * [pipeline/rpc] implement distributed optimizer | test with assert_close
    
    * [pipeline/rpc] implement distributed optimizer | test with assert_close
    
    * [pipeline/rpc] update outstanding mechanism | optimize dispatching strategy
    
    * [pipeline/rpc] update outstanding mechanism | optimize dispatching strategy
    
    * [pipeline/rpc] update outstanding mechanism | optimize dispatching strategy
    
    * [pipeline/pipleline_process_group] finish PipelineProcessGroup to manage local abd global rank in TP,DP and PP
    
    * [pipeline/pipleline_process_group] remove comment
    
    * [pipeline/pipleline_process_group] remove comment
    
    * [pipeline/pipleline_process_group] skip process group test
    
    * [pipeline/pipleline_process_group] remove test named function
    f1e18362
rpc_test_utils.py 3.67 KB