1. 29 May, 2023 1 commit
  2. 30 Mar, 2023 1 commit
  3. 25 May, 2022 1 commit
  4. 01 Sep, 2020 1 commit
    • Samyam Rajbhandari's avatar
      Samyamr/grad acc stage2 (#338) · 7240abf3
      Samyam Rajbhandari authored
      
      
      * Adding gradient accumulation support for ZeRO Stage 2. Changing all Megatron-LM tests to also test gradient accumulation
      
      * Gradient Accumulation support for Stage 2. Model tests added to test the feature
      
      * formatting
      
      * Update deepspeed_light.py
      
      removing comment
      
      * Update ds_config_func_bs8_zero1.json
      
      reverting this file back. Its not needed for this PR
      
      * defining baseline prefix
      Co-authored-by: default avatarJeff Rasley <jerasley@microsoft.com>
      7240abf3
  5. 03 Feb, 2020 1 commit