1. 06 Apr, 2020 1 commit
  2. 03 Apr, 2020 1 commit
  3. 28 Mar, 2020 1 commit
  4. 27 Mar, 2020 2 commits
    • Olatunji Ruwase's avatar
      Support multi-output models (#170) · 53c73fe3
      Olatunji Ruwase authored
      * Push to remote
      
      * Correctly handle multi output models by doing loss scaling in backward()
      Unit tests for multi output models
      
      * Fix formatting issues
      
      * Formatting issues fix
      
      * Fix formatting
      
      * Update DeepSpeedExamples submodule
      Enable Megatron model tests
      53c73fe3
    • Calogero Zarbo's avatar
      Add "zero_allow_untested_optimizer" option in conf file (#173) · 43f27332
      Calogero Zarbo authored
      * added zero_allow_untested_optimizer flag helpers
      
      * add zero_allow_untested_optimizer config constants
      
      * zero_allow_untested_optimizer logic with assertion
      
      * Added unit test and CustomOptimizer helper class
      43f27332
  5. 26 Mar, 2020 1 commit
  6. 25 Mar, 2020 1 commit
  7. 23 Mar, 2020 1 commit
  8. 22 Mar, 2020 2 commits
  9. 18 Mar, 2020 4 commits
  10. 17 Mar, 2020 5 commits
  11. 12 Mar, 2020 1 commit
  12. 11 Mar, 2020 2 commits
  13. 10 Mar, 2020 4 commits
  14. 09 Mar, 2020 1 commit
  15. 07 Mar, 2020 1 commit
  16. 03 Mar, 2020 1 commit
  17. 27 Feb, 2020 4 commits
  18. 26 Feb, 2020 1 commit
    • Jeff Rasley's avatar
      Init distributed torch only if needed (#108) · 5aa58b38
      Jeff Rasley authored
      * add auto-detect to torch dist init
      
      * update tests to infer distributed init status
      
      * prevent crash if dist_init_required is True but already initiliazed
      
      * only init if safe to do so (forgot to add this file in prev commit)
      5aa58b38
  19. 25 Feb, 2020 2 commits
    • zenlytix's avatar
      Option to keep VM without incuring charges with shutdown command (#106) · 6efee45c
      zenlytix authored
      * Update scripts to handle cases where you have other VMs in your sub
      
      * Support subs with other VMs and fix for PDSH permission error
      
      * Minor fix to support subs with other VMs
      
      * Added shutdown with or without delete VM option
      
      In Azure deallocate is like machine shutdown (and prevents billing). You can restart deallocated VM. To fully drop the VM delete is used. This command with "-d" option will fully delete the VM. Without any argument it justs deallocates / shutd down the VM.
      6efee45c
    • zenlytix's avatar
      Fixes to support subscriptions with other VMs (#105) · cd0d6f3c
      zenlytix authored
      * Update scripts to handle cases where you have other VMs in your sub
      
      * Support subs with other VMs and fix for PDSH permission error
      
      * Minor fix to support subs with other VMs
      cd0d6f3c
  20. 24 Feb, 2020 3 commits
  21. 22 Feb, 2020 1 commit