- 08 Jun, 2020 1 commit
-
-
eltonzheng authored
-
- 05 Jun, 2020 1 commit
-
-
Olatunji Ruwase authored
* Debugging * Fix step() bug; Make step timing optional * Remove unnecessary changes * Format fixes * Replace list with scalar variable * Remove redundant code * Fix typo
-
- 04 Jun, 2020 4 commits
-
-
Jeff Rasley authored
-
Vidush Vishwanath authored
-
Chunyang Wen authored
* Add log util * replace all occurrences of print and logging * address format * disable propagate to avoid duplicate log
-
Shaden Smith authored
* links and formatting
-
- 03 Jun, 2020 2 commits
-
-
Ammar Ahmad Awan authored
Co-authored-by:Ammar Ahmad Awan <ammar.awan@microsoft.com>
-
eltonzheng authored
-
- 30 May, 2020 1 commit
-
-
Jeff Rasley authored
-
- 29 May, 2020 8 commits
-
-
Samyam Rajbhandari authored
-
Shaden Smith authored
-
Jeff Rasley authored
-
Shaden Smith authored
-
Shaden Smith authored
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Jeff Rasley authored
* Transformer kernels release Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Elton Zheng <eltonz@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by:
Tunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Elton Zheng <eltonz@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by:
Tunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com>
-
- 28 May, 2020 2 commits
-
-
Chunyang Wen authored
* fix: typo in code docs * more pythonic code
-
Chunyang Wen authored
Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
- 27 May, 2020 5 commits
-
-
Jeff Rasley authored
-
Jeff Rasley authored
* add support for predivide as a flag * add predivide json config, remove allgather_disable (as it's not currently used anymore)
-
Samyam Rajbhandari authored
Contiguous Gradients should be set to false by default. Its not useful unless the model is very large
-
Samyam Rajbhandari authored
* Fix for CPU memory Bloating Issue caused by pyorch backward graph creation in allgather. Fixed by calling detach on tensors before calling all_gather * Fix for CPU memory Bloating Issue caused by pyorch backward graph creation in allgather. Fixed by calling detach on tensors before calling all_gather * Fix for CPU memory Bloating Issue caused by pyorch backward graph creation in allgather. Fixed by calling detach on tensors before calling all_gather
-
Jeff Rasley authored
* updates to support fp32 grad clipping and disable max_grad_norm
-
- 26 May, 2020 1 commit
-
-
Shaden Smith authored
-
- 25 May, 2020 1 commit
-
-
Chunyang Wen authored
-
- 21 May, 2020 2 commits
-
-
Shaden Smith authored
-
Shaden Smith authored
-
- 20 May, 2020 1 commit
-
-
Jeff Rasley authored
-
- 19 May, 2020 6 commits
-
-
Shaden Smith authored
-
Shaden Smith authored
-
Shaden Smith authored
* BERT title
-
Shaden Smith authored
-
Shaden Smith authored
-
Jeff Rasley authored
Updates for ZeRO stage 2 + ZeRO stage 1 w. RS Co-authored-by:
Tunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Elton Zheng <eltonz@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
yuxionghe <yuxhe@microsoft.com> Co-authored-by:
Arash Ashari <arashari@microsoft.com>
-
- 18 May, 2020 1 commit
-
-
Arash Ashari authored
* adding BingSqaud e2e test * updating the draft test; bring final step under try section * finalizinf test for base deepspeed and deepspeed with ZeRO * applying the comment (thanks Jeff); fixed formatting
-
- 15 May, 2020 1 commit
-
-
Shaden Smith authored
-
- 13 May, 2020 1 commit
-
-
Jeff Rasley authored
-
- 12 May, 2020 1 commit
-
-
Shaden Smith authored
-
- 11 May, 2020 1 commit
-
-
Olatunji Ruwase authored
* Support dynamic loss scale args in fp16 optimizers * Update names
-