- 27 Jan, 2022 1 commit
-
-
Stas Bekman authored
The paper has this info, so proposing to copy it next to the table. Otherwise it's hard to guess whether you used 40GB A100s or 80GB ones (and secondary, n_gpus per node). Thank you!
-
- 30 Nov, 2021 1 commit
-
-
Jared Casper authored
-
- 18 Oct, 2021 1 commit
-
-
Jared Casper authored
-
- 15 Oct, 2021 1 commit
-
-
mshoeybi authored
-
- 08 Oct, 2021 1 commit
-
-
Jared Casper authored
-
- 29 Sep, 2021 1 commit
-
-
Jared Casper authored
-
- 19 Aug, 2021 2 commits
- 16 Jul, 2021 1 commit
-
-
Haibin Lin authored
-
- 12 Jul, 2021 2 commits
-
-
Deepak Narayanan authored
-
Deepak Narayanan authored
-
- 18 May, 2021 2 commits
-
-
Stas Bekman authored
-
Devrim authored
-
- 15 May, 2021 1 commit
-
-
Jared Casper authored
-
- 14 May, 2021 3 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
mohammad authored
-
- 12 Apr, 2021 1 commit
-
-
mohammad authored
-
- 08 Apr, 2021 1 commit
-
-
Mohammad Shoeybi authored
-
- 27 Jan, 2021 1 commit
-
-
Jared Casper authored
-
- 22 Jan, 2021 1 commit
-
-
Jared Casper authored
-
- 12 Jan, 2021 2 commits
-
-
Jared Casper authored
-
Mohammad Shoeybi authored
-
- 19 Dec, 2020 1 commit
-
-
Jared Casper authored
-
- 13 Nov, 2020 1 commit
-
-
Deepak Narayanan authored
-
- 12 Nov, 2020 3 commits
-
-
Deepak Narayanan authored
-
Deepak Narayanan authored
-
Deepak Narayanan authored
Also includes following changes for inter-layer model-parallel implementation: - Refactoring of model implementations - Training loop changes to support inter-layer communication using `ring_exchange` - New groups for inter-layer communication - Checkpoint changes - Command line arguments
-
- 11 Sep, 2020 1 commit
-
-
Jared Casper authored
-
- 23 Jul, 2020 1 commit
-
-
Neel Kant authored
-
- 07 Jul, 2020 1 commit
-
-
Neel Kant authored
-
- 29 Jun, 2020 1 commit
-
-
Neel Kant authored
-
- 24 Jun, 2020 2 commits
- 15 May, 2020 1 commit
-
-
Evelina Bakhturina authored
Signed-off-by:Evelina Bakhturina <ebakhturina@nvidia.com>
-
- 14 May, 2020 4 commits
-
-
Evelina Bakhturina authored
Signed-off-by:Evelina Bakhturina <ebakhturina@nvidia.com>
-
Evelina Bakhturina authored
Signed-off-by:Evelina Bakhturina <ebakhturina@nvidia.com>
-
Evelina Bakhturina authored
Signed-off-by:Evelina Bakhturina <ebakhturina@nvidia.com>
-
Evelina Bakhturina authored
Signed-off-by:Evelina Bakhturina <ebakhturina@nvidia.com>
-
- 05 May, 2020 1 commit
-
-
David E. Weekly authored
Fixed some small typos in the README
-