- 29 May, 2023 2 commits
- 26 Apr, 2023 1 commit
-
-
aiss authored
-
- 30 Mar, 2023 1 commit
-
-
aiss authored
-
- 10 Aug, 2022 1 commit
-
-
aiss authored
-
- 14 Jun, 2022 1 commit
-
-
aiss authored
-
- 11 Jun, 2022 2 commits
- 25 May, 2022 1 commit
-
-
aiss authored
-
- 08 Mar, 2021 1 commit
-
-
Samyam Rajbhandari authored
* Squash stage3 v1 (#146) Co-authored-by:
Samyam <samyamr@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
eltonzheng <eltonz@microsoft.com> * Fix correctness bug (#147) * formatting fix (#150) * stage3 bugfix (API) update and simplified FP16 Z3 tests (#151) * fp16 Z3 API update and bugfix * revert debug change * ZeRO-3 detach and race condition bugfixes (#149) * trying out ZeRO-3 race condition fix * CUDA sync instead of stream * reduction stream sync * remove commented code * Fix optimizer state_dict KeyError (#148) Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> * fix for smaller SGS sizes, ensures each grad is backed by unique tensors (#152) * Simplifying the logic for getting averaged gradients (#153) * skip for now * Z3 Docs redux (#154) * removing some TODOs and commented code (#155) * New Z3 defaults (#156) Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> * formatting * megatron external params Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
eltonzheng <eltonz@microsoft.com>
-
- 23 Dec, 2020 1 commit
-
-
Jeff Rasley authored
Co-authored-by:Samyam Rajbhandari <samyamr@microsoft.com>
-
- 03 Dec, 2020 1 commit
-
-
Jeff Rasley authored
-
- 25 Nov, 2020 1 commit
-
-
Shaden Smith authored
-
- 19 Nov, 2020 1 commit
-
-
Seunghwan Hong authored
* Add guard to not using `torch.version.cuda` above no-CUDA environment. * Fix several typos on setup.py. Signed-off-by:
Seunghwan Hong <seunghwan@scatterlab.co.kr> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com>
-
- 12 Nov, 2020 2 commits
-
-
Jeff Rasley authored
* on cpu box error gracefully if cuda home doesn't exist * gaurd against torch import issue * fix sytax error * fix import
-
Jeff Rasley authored
Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com>
-
- 14 Oct, 2020 1 commit
-
-
Jeff Rasley authored
add compute cap of 6.0, support p100
-
- 12 Oct, 2020 2 commits
-
-
Jeff Rasley authored
-
Jeff Rasley authored
add compute cap of 6.0 to transformer kernels
-
- 29 Sep, 2020 1 commit
-
-
Olatunji Ruwase authored
* Disable default installation of CPU Adam * Handle cpufeature import/use errors separately
-
- 17 Sep, 2020 1 commit
-
-
Shaden Smith authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 10 Sep, 2020 2 commits
-
-
Jeff Rasley authored
-
Jeff Rasley authored
* ZeRO-Offload (squash) (#381) Co-authored-by:
Olatunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Jie <37380896+jren73@users.noreply.github.com> Co-authored-by:
Arash Ashari <arashari@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
arashashari <arashashari@ArashMSLaptop.redmond.corp.microsoft.com> Co-authored-by:
RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com>
-
- 09 Sep, 2020 1 commit
-
-
Ammar Ahmad Awan authored
* 1-bit adam (#353) Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Your Name <you@example.com> Co-authored-by:
tanghl1994 <htang14@ur.rochester.edu> Co-authored-by:
Hank <tanghl1994@gmail.com> Co-authored-by:
root <root@node2x12b.cs.rochester.edu> Co-authored-by:
Ammar Ahmad Awan <awan.ammar@microsoft.com>
-
- 02 Sep, 2020 1 commit
-
-
Jeff Rasley authored
* Sparse attn + ops/runtime refactor + v0.3.0 Co-authored-by:
Arash Ashari <arashari@microsoft.com> Co-authored-by:
Arash Ashari <arashari@microsoft.com>
-
- 22 Jul, 2020 1 commit
-
-
Jeff Rasley authored
-
- 29 May, 2020 1 commit
-
-
Jeff Rasley authored
* Transformer kernels release Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Elton Zheng <eltonz@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by:
Tunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Jeff Rasley <jerasley@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Elton Zheng <eltonz@microsoft.com> Co-authored-by:
Reza Yazdani <reyazda@microsoft.com> Co-authored-by:
RezaYazdaniAminabadi <44502768+RezaYazdaniAminabadi@users.noreply.github.com> Co-authored-by:
Tunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com>
-
- 12 Mar, 2020 1 commit
-
-
Jeff Rasley authored
* add support for torch 1.3+ builds inside a docker build environment * remove apex imports
-
- 10 Feb, 2020 1 commit
-
-
Shaden Smith authored
-
- 09 Feb, 2020 1 commit
-
-
Jeff Rasley authored
-
- 03 Feb, 2020 1 commit
-
-
Jeff Rasley authored
-
- 31 Jan, 2020 1 commit
-
-
Jeff Rasley authored
-