- 27 Jan, 2021 1 commit
-
-
Jeff Rasley authored
-
- 26 Jan, 2021 1 commit
-
-
Ying Xiong authored
* fix wrong idx bug in invertible LayerNormBackward1 this index bug cause wrong scale grad * fix unexpected deletion * fix idx for LayerNormBackward1_fused_add * move pos defination in LayerNormBackward1 kernels * fix format error Co-authored-by:Reza Yazdani <reyazda@microsoft.com>
-
- 25 Jan, 2021 1 commit
-
-
sdtblck authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 20 Jan, 2021 3 commits
-
-
Leo Gao authored
* Fix ZeRO 2 + Pipelining
-
Shaden Smith authored
-
Stas Bekman authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 19 Jan, 2021 1 commit
-
-
Jeff Rasley authored
* Update README.md * Update index.md
-
- 15 Jan, 2021 5 commits
-
-
Stas Bekman authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Stas Bekman authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Stas Bekman authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
-
Olatunji Ruwase authored
-
- 14 Jan, 2021 1 commit
-
-
Jeff Rasley authored
-
- 13 Jan, 2021 2 commits
-
-
Reza Yazdani authored
* move workspace memory-allocation to PyTorch * refine the code based on the comments * remove unnecessary options * remove bsz from set_seq_len function
-
Cheng Li authored
Co-authored-by:
Cheng Li <pistasable@gmail.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
- 12 Jan, 2021 1 commit
-
-
Shaden Smith authored
Special thanks to @g-karthik for tracking this issue down.
-
- 08 Jan, 2021 5 commits
-
-
Olatunji Ruwase authored
* Add Linear warmup+decay lr schedule Update lr schedule unit tests * LR scheduler unit tests for LR Range Test and 1Cycle * Disable yapf to preserve parameterizaton * Disable test_pipe.py for CI debugging * Disable test_lr_scheduler for CI debugging * Disable test_lr_scheduler for CI debugging * Enable all unit tests for CI debugging Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
-
Ammar Ahmad Awan authored
* Remove a very verbose print statement. * Update engine.py
-
Jeff Rasley authored
-
Stas Bekman authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 07 Jan, 2021 3 commits
-
-
Jeff Rasley authored
Co-authored-by:Olatunji Ruwase <olruwase@microsoft.com>
-
dependabot[bot] authored
Bumps [nokogiri](https://github.com/sparklemotion/nokogiri) from 1.10.10 to 1.11.0. - [Release notes](https://github.com/sparklemotion/nokogiri/releases) - [Changelog](https://github.com/sparklemotion/nokogiri/blob/master/CHANGELOG.md) - [Commits](https://github.com/sparklemotion/nokogiri/compare/v1.10.10...v1.11.0 ) Signed-off-by:
dependabot[bot] <support@github.com> Co-authored-by:
dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
Xingjian Shi authored
-
- 06 Jan, 2021 1 commit
-
-
Jeff Rasley authored
Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
Olatunji Ruwase <olruwase@microsoft.com>
-
- 05 Jan, 2021 4 commits
-
-
Olatunji Ruwase authored
-
brett koonce authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Ammar Ahmad Awan authored
-
gcooper-isi authored
Allow DeepSpeed models to be initialized with optimizer=None Co-authored-by:Shaden Smith <Shaden.Smith@microsoft.com>
-
- 04 Jan, 2021 2 commits
-
-
Olatunji Ruwase authored
-
Jeff Rasley authored
-
- 23 Dec, 2020 1 commit
-
-
Jeff Rasley authored
Co-authored-by:Samyam Rajbhandari <samyamr@microsoft.com>
-
- 18 Dec, 2020 1 commit
-
-
Jeff Rasley authored
-
- 17 Dec, 2020 1 commit
-
-
Reza Yazdani authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 15 Dec, 2020 2 commits
-
-
Jeff Rasley authored
Co-authored-by:Shaden Smith <Shaden.Smith@microsoft.com>
-
Stas Bekman authored
* [doc] xref to hostfile discussion wasn't clear where to find what was meant by `hostfile` - so adding a link to where it's discussed. * remove whitespace
-
- 14 Dec, 2020 1 commit
-
-
Stas Bekman authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 11 Dec, 2020 3 commits
-
-
Jeff Rasley authored
* Update launch.py * formatting
-
carefree0910 authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Stas Bekman authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-