- 21 Sep, 2020 1 commit
-
-
RezaYazdaniAminabadi authored
-
- 18 Sep, 2020 3 commits
-
-
Shaden Smith authored
-
Jeff Rasley authored
This reverts commit 01b6e27e . Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com>
-
Shaden Smith authored
* Activation checkpointing bugfix and unit tests. * Activation checkpointing bugfix and unit tests. Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 17 Sep, 2020 3 commits
-
-
Jeff Rasley authored
-
Gowtham Prudhvi authored
-
Shaden Smith authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 16 Sep, 2020 4 commits
-
-
Haibin Lin authored
* Update stage2.py Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Shaden Smith authored
* Switches fused_optimizer overflow calculation
-
Olatunji Ruwase authored
* Update installation instructions * Format fix * ZeRO tutorial * Format fixes * ZeRO-Offload * ZeRO and ZeRO-Offload tutorials * Update navigation page * Format fixes * Add yuxhe feedback * Fix blog post link * Fix OneBit-Adam link Tweak scheduler example * Fix date link Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
Olatunji Ruwase authored
Update lr schedule unit tests
-
- 15 Sep, 2020 3 commits
-
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Jeff Rasley authored
* add pytest skips around tests that require certain ops to be installed
-
- 13 Sep, 2020 1 commit
-
-
Shaden Smith authored
-
- 11 Sep, 2020 2 commits
-
-
Jeff Rasley authored
This reverts commit e549be60.
-
RezaYazdaniAminabadi authored
* supporting different intermediate sizes other than 4*hidden_dim * run precommit * uncommnet the unit tests Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 10 Sep, 2020 22 commits
-
-
Shaden Smith authored
-
Jeff Rasley authored
-
Olatunji Ruwase authored
Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Shaden Smith authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
-
Jeff Rasley authored
Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com>
-
Arash Ashari authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Minjia Zhang authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Arash Ashari authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
-
Olatunji Ruwase authored
Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
-
Jeff Rasley authored
Fixes a dataype issue with softmax where the number of blocks being sent to the Triton kernel source was a torch.Tensor but should have been a python integer. On some environments (e.g., conda) this resulted in triton not knowing how to serialize the input (and crashing in our tests). Once switching to the correct datatype that triton expects this seems to have solved the issue. Co-authored-by:Shaden Smith <Shaden.Smith@microsoft.com>
-
Ammar Ahmad Awan authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Shaden Smith authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
* ZeRO-Offload (squash) (#381) Co-authored-by:
Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Jie <37380896+jren73@users.noreply.github.com> Co-authored-by:
Arash Ashari <arashari@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
arashashari <arashashari@ArashMSLaptop.redmond.corp.microsoft.com> Co-authored-by:
RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com>
-
- 09 Sep, 2020 1 commit
-
-
Jeff Rasley authored
-