1. 01 Sep, 2020 1 commit
    • Samyam Rajbhandari's avatar
      Samyamr/grad acc stage2 (#338) · 7240abf3
      Samyam Rajbhandari authored
      
      
      * Adding gradient accumulation support for ZeRO Stage 2. Changing all Megatron-LM tests to also test gradient accumulation
      
      * Gradient Accumulation support for Stage 2. Model tests added to test the feature
      
      * formatting
      
      * Update deepspeed_light.py
      
      removing comment
      
      * Update ds_config_func_bs8_zero1.json
      
      reverting this file back. Its not needed for this PR
      
      * defining baseline prefix
      Co-authored-by: default avatarJeff Rasley <jerasley@microsoft.com>
      7240abf3
  2. 14 Jul, 2020 1 commit
  3. 27 May, 2020 1 commit
  4. 20 May, 2020 1 commit
  5. 19 May, 2020 1 commit
  6. 10 Mar, 2020 1 commit
  7. 22 Feb, 2020 1 commit
  8. 12 Feb, 2020 1 commit
  9. 07 Feb, 2020 1 commit
    • Samyam Rajbhandari's avatar
      Samyamr/batchconfig (#33) · 5a0abc65
      Samyam Rajbhandari authored
      * simplifying the batch config, using a single assert to test for validity and allowing for specifying only the micro batch size
      
      * Simplifying Batch Config, Adding ability to specify batch using just micro_batch, and adding a bunch of unit tests
      
      * ran formatting
      
      * Typo fixes and added the config file
      
      * reformatting
      
      * path fixes
      
      * removing print statements
      5a0abc65
  10. 04 Feb, 2020 1 commit
  11. 03 Feb, 2020 2 commits