- 10 Sep, 2020 11 commits
-
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Shaden Smith authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com>
-
Arash Ashari authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Minjia Zhang authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Arash Ashari authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Olatunji Ruwase authored
Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
Ammar Ahmad Awan authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Shaden Smith authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Jeff Rasley authored
* ZeRO-Offload (squash) (#381) Co-authored-by:
Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Jie <37380896+jren73@users.noreply.github.com> Co-authored-by:
Arash Ashari <arashari@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
arashashari <arashashari@ArashMSLaptop.redmond.corp.microsoft.com> Co-authored-by:
RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com>
-
- 09 Sep, 2020 3 commits
-
-
Arash Ashari authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
Ammar Ahmad Awan authored
* 1-bit adam (#353) Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Your Name <you@example.com> Co-authored-by:
tanghl1994 <htang14@ur.rochester.edu> Co-authored-by:
Hank <tanghl1994@gmail.com> Co-authored-by:
root <root@node2x12b.cs.rochester.edu> Co-authored-by:
Ammar Ahmad Awan <awan.ammar@microsoft.com>
-
Arash Ashari authored
-
- 06 Sep, 2020 2 commits
-
-
Arash Ashari authored
* adding BingSqaud e2e test * updating the draft test; bring final step under try section * finalizinf test for base deepspeed and deepspeed with ZeRO * applying the comment (thanks Jeff); fixed formatting * update Sparse Attention Tutorial * fixed few issues and applied comments for better organization and readability * updated sparse attention tutorial with making how to use section incremental; applying more comments Co-authored-by:arashashari <arashashari@ArashMSLaptop.redmond.corp.microsoft.com>
-
Olatunji Ruwase authored
-
- 04 Sep, 2020 1 commit
-
-
Shaden Smith authored
-
- 03 Sep, 2020 1 commit
-
-
Arash Ashari authored
* adding link to Sparse Attention in Navigation page
-
- 02 Sep, 2020 1 commit
-
-
Jeff Rasley authored
* Sparse attn + ops/runtime refactor + v0.3.0 Co-authored-by:
Arash Ashari <arashari@microsoft.com> Co-authored-by:
Arash Ashari <arashari@microsoft.com>
-
- 08 Aug, 2020 1 commit
-
-
Shaden Smith authored
-
- 07 Aug, 2020 1 commit
-
-
Jeff Rasley authored
Add webinar on-demand links and update readme
-
- 28 Jul, 2020 1 commit
-
-
Emmanuel Kahembwe authored
-
- 25 Jul, 2020 1 commit
-
-
Shaden Smith authored
-
- 13 Jul, 2020 1 commit
-
-
Jeff Rasley authored
* add amp docs
-
- 24 Jun, 2020 1 commit
-
-
Conglong Li authored
* syntax/typo fix * add README for documentation * fix links * update navigation * typo fix * docs readme fix
-
- 20 Jun, 2020 1 commit
-
-
Shaden Smith authored
-
- 17 Jun, 2020 2 commits
-
-
Shaden Smith authored
-
Shaden Smith authored
-
- 16 Jun, 2020 1 commit
-
-
RezaYazdaniAminabadi authored
* add the fine-tuning results * updating tutorial and blog-post * updated the tutorials and links
-
- 04 Jun, 2020 1 commit
-
-
Shaden Smith authored
* links and formatting
-
- 03 Jun, 2020 1 commit
-
-
Ammar Ahmad Awan authored
Co-authored-by:Ammar Ahmad Awan <ammar.awan@microsoft.com>
-
- 29 May, 2020 8 commits
-
-
Samyam Rajbhandari authored
-
Shaden Smith authored
-
Jeff Rasley authored
-
Shaden Smith authored
-
Shaden Smith authored
-
Jeff Rasley authored
-
Jeff Rasley authored
-
Jeff Rasley authored
* Transformer kernels release Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Elton Zheng <eltonz@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by:
Tunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Elton Zheng <eltonz@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by:
Tunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com>
-
- 27 May, 2020 1 commit
-
-
Jeff Rasley authored
* add support for predivide as a flag * add predivide json config, remove allgather_disable (as it's not currently used anymore)
-