1. 27 Mar, 2020 2 commits
    • Olatunji Ruwase's avatar
      Support multi-output models (#170) · 53c73fe3
      Olatunji Ruwase authored
      * Push to remote
      
      * Correctly handle multi output models by doing loss scaling in backward()
      Unit tests for multi output models
      
      * Fix formatting issues
      
      * Formatting issues fix
      
      * Fix formatting
      
      * Update DeepSpeedExamples submodule
      Enable Megatron model tests
      53c73fe3
    • Calogero Zarbo's avatar
      Add "zero_allow_untested_optimizer" option in conf file (#173) · 43f27332
      Calogero Zarbo authored
      * added zero_allow_untested_optimizer flag helpers
      
      * add zero_allow_untested_optimizer config constants
      
      * zero_allow_untested_optimizer logic with assertion
      
      * Added unit test and CustomOptimizer helper class
      43f27332
  2. 25 Mar, 2020 1 commit
  3. 10 Mar, 2020 2 commits
  4. 27 Feb, 2020 1 commit
  5. 26 Feb, 2020 1 commit
    • Jeff Rasley's avatar
      Init distributed torch only if needed (#108) · 5aa58b38
      Jeff Rasley authored
      * add auto-detect to torch dist init
      
      * update tests to infer distributed init status
      
      * prevent crash if dist_init_required is True but already initiliazed
      
      * only init if safe to do so (forgot to add this file in prev commit)
      5aa58b38
  6. 22 Feb, 2020 1 commit
  7. 20 Feb, 2020 1 commit
  8. 15 Feb, 2020 1 commit
  9. 14 Feb, 2020 1 commit
    • Shaden Smith's avatar
      Porting BingBertSquad test (#70) · 37ff62cc
      Shaden Smith authored
      * Porting BingBertSquad test
      
      * Updating default paths.
      
      * Enable model tests.
      
      * Updating DeepSpeedExamples submodule
      
      * Adding BingBertSquad's log uploads.
      
      * Messed up the submodule again :-)
      37ff62cc
  10. 12 Feb, 2020 1 commit
  11. 10 Feb, 2020 1 commit
  12. 07 Feb, 2020 1 commit
    • Samyam Rajbhandari's avatar
      Samyamr/batchconfig (#33) · 5a0abc65
      Samyam Rajbhandari authored
      * simplifying the batch config, using a single assert to test for validity and allowing for specifying only the micro batch size
      
      * Simplifying Batch Config, Adding ability to specify batch using just micro_batch, and adding a bunch of unit tests
      
      * ran formatting
      
      * Typo fixes and added the config file
      
      * reformatting
      
      * path fixes
      
      * removing print statements
      5a0abc65
  13. 06 Feb, 2020 2 commits
  14. 05 Feb, 2020 1 commit
  15. 04 Feb, 2020 4 commits
  16. 03 Feb, 2020 3 commits
  17. 01 Feb, 2020 1 commit