"src/vscode:/vscode.git/clone" did not exist on "1b42732ced07861b810f77ecf3fc8ce63ce465e8"
- 29 Jan, 2021 1 commit
-
-
Jeff Rasley authored
-
- 18 Dec, 2020 1 commit
-
-
Jeff Rasley authored
-
- 19 Nov, 2020 1 commit
-
-
Jeff Rasley authored
* zero-1 memory fix * auto-tune max elems per comm to reduce padding/comm intervals * clean-up and added previously missing reduction options * fix testing backing to work with torch1.7
-
- 10 Sep, 2020 1 commit
-
-
Shaden Smith authored
Co-authored-by:Jeff Rasley <jerasley@microsoft.com>
-
- 19 May, 2020 1 commit
-
-
Jeff Rasley authored
Updates for ZeRO stage 2 + ZeRO stage 1 w. RS Co-authored-by:
Tunji Ruwase <olruwase@microsoft.com> Co-authored-by:
Samyam Rajbhandari <samyamr@microsoft.com> Co-authored-by:
Shaden Smith <ShadenTSmith@gmail.com> Co-authored-by:
Elton Zheng <eltonz@microsoft.com> Co-authored-by:
Shaden Smith <Shaden.Smith@microsoft.com> Co-authored-by:
yuxionghe <yuxhe@microsoft.com> Co-authored-by:
Arash Ashari <arashari@microsoft.com>
-
- 05 Feb, 2020 1 commit
-
-
Shaden Smith authored
* Enables NCCL backend in @distributed_test * Adds pytest-forked to avoid CUDA re-initialization issue. * paste typo * transcription typo
-
- 04 Feb, 2020 2 commits
-
-
Jeff Rasley authored
* add allreduce test * comment out set rank to cuda for now * switched back to gloo
-
Shaden Smith authored
* Adds distributed_test decorator and some unit tests. * Setting NCCL backend. * Parametrizes test. * rank -> local_rank * Temporarily disable CUDA initialization.
-