"extensions/csrc/cuda/multi_tensor_sgd_kernel.cu" did not exist on "0772828fba9b9dfc07e5e319b324642ecb0455e9"
  1. 15 Apr, 2024 1 commit
  2. 08 Apr, 2024 1 commit
  3. 07 Apr, 2024 1 commit
  4. 01 Apr, 2024 1 commit
    • Wenhao Chen's avatar
      [shardformer, pipeline] add `gradient_checkpointing_ratio` and heterogenous... · e614aa34
      Wenhao Chen authored
      [shardformer, pipeline] add `gradient_checkpointing_ratio` and heterogenous shard policy for llama (#5508)
      
      * feat: add `GradientCheckpointConfig` and `PipelineGradientCheckpointConfig`
      
      * feat: apply `GradientCheckpointConfig` to policy and llama_forward
      
      * feat: move `distribute_layer` and `get_stage_index` to PipelineStageManager
      
      * fix: add optional args for `distribute_layer` and `get_stage_index`
      
      * fix: fix changed API calls
      
      * test: update llama tests
      
      * style: polish `GradientCheckpointConfig`
      
      * fix: fix pipeline utils tests
      e614aa34
  5. 27 Mar, 2024 1 commit
  6. 25 Mar, 2024 1 commit
  7. 07 Feb, 2024 6 commits