1. 21 Nov, 2020 1 commit
  2. 20 Nov, 2020 1 commit
  3. 19 Nov, 2020 1 commit
    • Jeff Rasley's avatar
      ZeRO-1 tune max-elems + bug fix (#532) · 08c96a1b
      Jeff Rasley authored
      * zero-1 memory fix
      
      * auto-tune max elems per comm to reduce padding/comm intervals
      
      * clean-up and added previously missing reduction options
      
      * fix testing backing to work with torch1.7
      08c96a1b
  4. 18 Nov, 2020 1 commit
  5. 12 Nov, 2020 1 commit
  6. 10 Nov, 2020 1 commit
  7. 30 Oct, 2020 1 commit
  8. 07 Oct, 2020 2 commits
  9. 29 Sep, 2020 1 commit
  10. 25 Sep, 2020 1 commit
  11. 22 Sep, 2020 1 commit
  12. 21 Sep, 2020 1 commit
  13. 18 Sep, 2020 3 commits
  14. 16 Sep, 2020 1 commit
  15. 15 Sep, 2020 1 commit
  16. 11 Sep, 2020 2 commits
  17. 10 Sep, 2020 3 commits
  18. 03 Sep, 2020 1 commit
  19. 02 Sep, 2020 2 commits
  20. 10 Aug, 2020 1 commit
  21. 15 Jul, 2020 2 commits
  22. 11 Jul, 2020 1 commit
  23. 06 Jul, 2020 1 commit
  24. 23 Jun, 2020 1 commit
  25. 30 May, 2020 1 commit
  26. 29 May, 2020 1 commit
  27. 27 May, 2020 1 commit
  28. 19 May, 2020 1 commit
  29. 11 May, 2020 1 commit
  30. 06 May, 2020 1 commit
  31. 24 Apr, 2020 1 commit
  32. 27 Mar, 2020 1 commit
    • Olatunji Ruwase's avatar
      Support multi-output models (#170) · 53c73fe3
      Olatunji Ruwase authored
      * Push to remote
      
      * Correctly handle multi output models by doing loss scaling in backward()
      Unit tests for multi output models
      
      * Fix formatting issues
      
      * Formatting issues fix
      
      * Fix formatting
      
      * Update DeepSpeedExamples submodule
      Enable Megatron model tests
      53c73fe3