- 15 Jul, 2020 2 commits
-
-
Jeff Rasley authored
* empty grad fix * add unit tests for empty grad
-
Olatunji Ruwase authored
-
- 14 Jul, 2020 1 commit
-
-
Olatunji Ruwase authored
* Support saving and loading ZeRO checkpoints on different data parallelism degree. * Fix formatting * Support checkpoint with varying GPU count in ZeRO stage 1 * Fix formatting * Formatting fixes * Update model tests * Remove pprint * Minor fix * Fix formatting * Update model tests Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 13 Jul, 2020 1 commit
-
-
Jeff Rasley authored
* add amp docs
-
- 11 Jul, 2020 1 commit
-
-
Jeff Rasley authored
* add amp support for deepspeed (non-ZeRO) * tests for amp mode
-
- 06 Jul, 2020 1 commit
-
-
Olatunji Ruwase authored
* Load non-DeepSpeed checkpoints into ZeRO optimizer * Handle parameters smaller than DP * Formatting fixes * Handle empty partitions * Fix perf bug Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 29 Jun, 2020 1 commit
-
-
Jeff Rasley authored
-
- 25 Jun, 2020 1 commit
-
-
Jeff Rasley authored
-
- 24 Jun, 2020 1 commit
-
-
Conglong Li authored
* syntax/typo fix * add README for documentation * fix links * update navigation * typo fix * docs readme fix
-
- 23 Jun, 2020 1 commit
-
-
Olatunji Ruwase authored
* Load non-DeepSpeed checkpoints into ZeRO optimizer * Handle parameters smaller than DP * Formatting fixes
-
- 20 Jun, 2020 4 commits
-
-
Shaden Smith authored
-
Samyam Rajbhandari authored
* Removing handle_overflow debugging code in deepspeed_utils.py * Removing handle_overflow debugging code in deepspeed_zero_optimizer.py Removing unnecessary overflow handle code. Not sure why it was there in the first place.
-
Shaden Smith authored
This reverts commit 54c0267e.
-
Tunji Ruwase authored
-
- 17 Jun, 2020 3 commits
-
-
Shaden Smith authored
-
Jeff Rasley authored
-
Shaden Smith authored
-
- 16 Jun, 2020 1 commit
-
-
RezaYazdaniAminabadi authored
* add the fine-tuning results * updating tutorial and blog-post * updated the tutorials and links
-
- 12 Jun, 2020 1 commit
-
-
Chunyang Wen authored
-
- 09 Jun, 2020 1 commit
-
-
Jeff Rasley authored
* install update: no-sudo + clean build files Co-authored-by:Shaden Smith <Shaden.Smith@microsoft.com>
-
- 08 Jun, 2020 1 commit
-
-
eltonzheng authored
-
- 05 Jun, 2020 1 commit
-
-
Olatunji Ruwase authored
* Debugging * Fix step() bug; Make step timing optional * Remove unnecessary changes * Format fixes * Replace list with scalar variable * Remove redundant code * Fix typo
-
- 04 Jun, 2020 4 commits
-
-
Jeff Rasley authored
-
Vidush Vishwanath authored
-
Chunyang Wen authored
* Add log util * replace all occurrences of print and logging * address format * disable propagate to avoid duplicate log
-
Shaden Smith authored
* links and formatting
-
- 03 Jun, 2020 2 commits
-
-
Ammar Ahmad Awan authored
Co-authored-by:Ammar Ahmad Awan <ammar.awan@microsoft.com>
-
eltonzheng authored
-
- 30 May, 2020 1 commit
-
-
Jeff Rasley authored
-
- 29 May, 2020 8 commits
-
-
Samyam Rajbhandari authored
-
Shaden Smith authored
-
Jeff Rasley authored
-
Shaden Smith authored
-
Shaden Smith authored
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Jeff Rasley authored
* Transformer kernels release Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Elton Zheng <eltonz@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by:
Tunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Elton Zheng <eltonz@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by:
Tunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com>
-
- 28 May, 2020 2 commits
-
-
Chunyang Wen authored
* fix: typo in code docs * more pythonic code
-
Chunyang Wen authored
Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
- 27 May, 2020 1 commit
-
-
Jeff Rasley authored
-