1. 01 Sep, 2020 1 commit
    • Samyam Rajbhandari's avatar
      Samyamr/grad acc stage2 (#338) · 7240abf3
      Samyam Rajbhandari authored
      
      
      * Adding gradient accumulation support for ZeRO Stage 2. Changing all Megatron-LM tests to also test gradient accumulation
      
      * Gradient Accumulation support for Stage 2. Model tests added to test the feature
      
      * formatting
      
      * Update deepspeed_light.py
      
      removing comment
      
      * Update ds_config_func_bs8_zero1.json
      
      reverting this file back. Its not needed for this PR
      
      * defining baseline prefix
      Co-authored-by: default avatarJeff Rasley <jerasley@microsoft.com>
      7240abf3
  2. 31 Aug, 2020 1 commit
  3. 28 Aug, 2020 1 commit
  4. 27 Aug, 2020 1 commit
  5. 18 Aug, 2020 1 commit
  6. 14 Aug, 2020 1 commit
  7. 13 Aug, 2020 2 commits
  8. 12 Aug, 2020 2 commits
  9. 10 Aug, 2020 1 commit
  10. 08 Aug, 2020 1 commit
  11. 07 Aug, 2020 2 commits
  12. 01 Aug, 2020 1 commit
  13. 28 Jul, 2020 2 commits
  14. 27 Jul, 2020 1 commit
  15. 25 Jul, 2020 1 commit
  16. 24 Jul, 2020 2 commits
  17. 23 Jul, 2020 1 commit
  18. 22 Jul, 2020 2 commits
  19. 21 Jul, 2020 1 commit
  20. 15 Jul, 2020 2 commits
  21. 14 Jul, 2020 1 commit
  22. 13 Jul, 2020 1 commit
  23. 11 Jul, 2020 1 commit
  24. 06 Jul, 2020 1 commit
  25. 29 Jun, 2020 1 commit
  26. 25 Jun, 2020 1 commit
  27. 24 Jun, 2020 1 commit
  28. 23 Jun, 2020 1 commit
  29. 20 Jun, 2020 4 commits
  30. 17 Jun, 2020 1 commit