1. 19 Nov, 2020 1 commit
    • Jeff Rasley's avatar
      ZeRO-1 tune max-elems + bug fix (#532) · 08c96a1b
      Jeff Rasley authored
      * zero-1 memory fix
      
      * auto-tune max elems per comm to reduce padding/comm intervals
      
      * clean-up and added previously missing reduction options
      
      * fix testing backing to work with torch1.7
      08c96a1b
  2. 18 Nov, 2020 1 commit
  3. 12 Nov, 2020 1 commit
  4. 10 Nov, 2020 1 commit
  5. 30 Oct, 2020 1 commit
  6. 07 Oct, 2020 2 commits
  7. 29 Sep, 2020 1 commit
  8. 25 Sep, 2020 1 commit
  9. 22 Sep, 2020 1 commit
  10. 21 Sep, 2020 1 commit
  11. 18 Sep, 2020 3 commits
  12. 16 Sep, 2020 1 commit
  13. 15 Sep, 2020 1 commit
  14. 11 Sep, 2020 2 commits
  15. 10 Sep, 2020 3 commits
  16. 03 Sep, 2020 1 commit
  17. 02 Sep, 2020 2 commits
  18. 10 Aug, 2020 1 commit
  19. 15 Jul, 2020 2 commits
  20. 11 Jul, 2020 1 commit
  21. 06 Jul, 2020 1 commit
  22. 23 Jun, 2020 1 commit
  23. 30 May, 2020 1 commit
  24. 29 May, 2020 1 commit
  25. 27 May, 2020 1 commit
  26. 19 May, 2020 1 commit
  27. 11 May, 2020 1 commit
  28. 06 May, 2020 1 commit
  29. 24 Apr, 2020 1 commit
  30. 27 Mar, 2020 2 commits
    • Olatunji Ruwase's avatar
      Support multi-output models (#170) · 53c73fe3
      Olatunji Ruwase authored
      * Push to remote
      
      * Correctly handle multi output models by doing loss scaling in backward()
      Unit tests for multi output models
      
      * Fix formatting issues
      
      * Formatting issues fix
      
      * Fix formatting
      
      * Update DeepSpeedExamples submodule
      Enable Megatron model tests
      53c73fe3
    • Calogero Zarbo's avatar
      Add "zero_allow_untested_optimizer" option in conf file (#173) · 43f27332
      Calogero Zarbo authored
      * added zero_allow_untested_optimizer flag helpers
      
      * add zero_allow_untested_optimizer config constants
      
      * zero_allow_untested_optimizer logic with assertion
      
      * Added unit test and CustomOptimizer helper class
      43f27332
  31. 25 Mar, 2020 1 commit