"vscode:/vscode.git/clone" did not exist on "6398fbc391519ce033d97d23925fd13ec4ed85a7"
- 01 Dec, 2020 2 commits
-
-
Reza Yazdani authored
* tracking optimizer step in cpu-adam when loading checkpoint * add warning/error message for updating optimizer step count * resolve build issue * supporting state update from the python side * track step from python in all cases * remove comma
-
Reza Yazdani authored
* supporting different hidden dimensions * add support for larger hidden dimensions (greater than 8K) * remove empty line * add loop unrolling factor for dropout kernels * update different kernels based on the reviews Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 28 Nov, 2020 1 commit
-
-
Stas Bekman authored
This PR: * fixes a misspelled method name * also `( () )` doesn't read too well, until one reads the code and understands that it's not a formatting bug. I proposed to simply say that it's a callable object.
-
- 25 Nov, 2020 4 commits
-
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Shaden Smith authored
-
- 23 Nov, 2020 1 commit
-
-
Jeff Rasley authored
-
- 25 Nov, 2020 2 commits
-
-
Jeff Rasley authored
-
Ammar Ahmad Awan authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 24 Nov, 2020 4 commits
-
-
Olatunji Ruwase authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Jeff Rasley authored
-
- 23 Nov, 2020 2 commits
-
-
Samyam Rajbhandari authored
In the absence of a model parallel group, model_parallel_allreduce should not do any reduction. This commit fixes the bug which was doing a model parallel allreduce across world group when model parallel group is None
-
Samyam Rajbhandari authored
-
- 21 Nov, 2020 1 commit
-
-
Olatunji Ruwase authored
-
- 20 Nov, 2020 1 commit
-
-
Olatunji Ruwase authored
* Use zero-tensors for missing gradients to avoid size mismatch * Unit test for unbalanced gradients in ZeRO * Formatting fixes
-
- 19 Nov, 2020 7 commits
-
-
Jeff Rasley authored
-
Ammar Ahmad Awan authored
* Use AML method to set env vars instead of using mpi4py. Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Seunghwan Hong authored
* Add guard to not using `torch.version.cuda` above no-CUDA environment. * Fix several typos on setup.py. Signed-off-by:
Seunghwan Hong <seunghwan@scatterlab.co.kr> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Jeff Rasley authored
* zero-1 memory fix * auto-tune max elems per comm to reduce padding/comm intervals * clean-up and added previously missing reduction options * fix testing backing to work with torch1.7
-
Jeff Rasley authored
-
- 18 Nov, 2020 2 commits
-
-
Jeff Rasley authored
-
Olatunji Ruwase authored
* Fix layout bug in ZeRO Stage 1 checkpoint logic Add elastic checkpoint option for ZeRO stage 1, default to True * Format fixes
-
- 14 Nov, 2020 2 commits
-
-
Jeff Rasley authored
-
Jeff Rasley authored
* remove cpu-feature * remove psutils requirement
-
- 12 Nov, 2020 3 commits
-
-
Shaden Smith authored
* Adds torch install requirement to documentation. * build ops documentation
-
Jeff Rasley authored
* on cpu box error gracefully if cuda home doesn't exist * gaurd against torch import issue * fix sytax error * fix import
-
Jeff Rasley authored
Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com>
-
- 11 Nov, 2020 2 commits
-
-
Samyam Rajbhandari authored
* Update zero.md Update to ZeRO tutorial to specify the use of activation checkpointing * Update zero-offload.md Use activation checkpointing with ZeRO-Offload Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
-
- 10 Nov, 2020 2 commits
-
-
Olatunji Ruwase authored
* Progressive layer dropping docs (#499) * test * Adding tutorial and news page for pld * updating the tutorial and posts of PLD * update the finetune tutorial * Update PLD tutorial (#512) * Update installation instructions * Format fix * ZeRO tutorial * Format fixes * ZeRO-Offload * ZeRO and ZeRO-Offload tutorials * Update navigation page * Format fixes * Add yuxhe feedback * Fix blog post link * Fix OneBit-Adam link Tweak scheduler example * Fix date link * Add DeepSpeed_Adam * Add PLD tutorial to navigation Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> * updating the pld docs * DeepSpeed implementation of PLD (#508) * DeepSpeed implementation of PLD * Format fixes * Formatting fixes * Fix broken url * Address PR feedback * Bump DSE Co-authored-by:
Minjia Zhang <33713995+minjiaz@users.noreply.github.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Minjia Zhang <minjiaz@microsoft.com>
-
Minjia Zhang authored
-
- 09 Nov, 2020 2 commits
-
-
Olatunji Ruwase authored
* PLD documentation * Formatting fixes * Fix url bug
-
Olatunji Ruwase authored
* PLD documentation * Formatting fixes
-
- 05 Nov, 2020 1 commit
-
-
Reza Yazdani authored
* fixing cpu-adam * fixing copy with optimizer for data and model parallelism * fixing cpu-adam * fix cpu-adam * fix cpu-adam
-
- 30 Oct, 2020 1 commit
-
-
Reza Yazdani authored
-