- 27 Mar, 2024 1 commit
-
-
liangjing authored
-
- 06 Apr, 2023 1 commit
-
-
Jared Casper authored
-
- 22 Feb, 2023 1 commit
-
-
Lawrence McAfee authored
-
- 24 Sep, 2022 1 commit
-
-
Jared Casper authored
-
- 23 Sep, 2022 2 commits
-
-
Jared Casper authored
Also merged in some changed from apex
-
Jared Casper authored
The LICENSE file says everything is 3-clause BSD, which is what we want, but at some point the Apache license was added to the top of some files and that proliferated. This commit removes the Apache license from any files that we own the copyright to. Also updates the copyright year and removes the unnessary coding=utf-8 line.
-
- 21 Jul, 2022 1 commit
-
-
Jared Casper authored
Remove deprecated destination argument to state_dict functions and make all arguments keyword to avoid warnings.
-
- 20 May, 2022 2 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 18 May, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 28 Apr, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 02 Apr, 2021 1 commit
-
-
Jared Casper authored
-
- 24 Mar, 2021 1 commit
-
-
Vijay Korthikanti authored
-
- 19 Mar, 2021 1 commit
-
-
Mohammad Shoeybi authored
-
- 08 Mar, 2021 1 commit
-
-
Mohammad Shoeybi authored
-
- 25 Jan, 2021 1 commit
-
-
Jared Casper authored
-
- 22 Jan, 2021 2 commits
-
-
Jared Casper authored
-
Vijay Korthikanti authored
-
- 13 Jan, 2021 1 commit
-
-
Vijay Korthikanti authored
-
- 08 Jan, 2021 1 commit
-
-
Vijay Korthikanti authored
-
- 27 Dec, 2020 1 commit
-
-
mohammad authored
-
- 22 Dec, 2020 1 commit
-
-
mohammad authored
Add the option for fp32 residual connection (fp32 residual connection machinery still needs to be added)
-
- 12 Nov, 2020 6 commits
-
-
Deepak Narayanan authored
Small bugfix in bert_model.py: make sure word_embeddings is initialized before instantiating lm_head
-
Deepak Narayanan authored
-
Deepak Narayanan authored
-
Deepak Narayanan authored
-
Deepak Narayanan authored
-
Deepak Narayanan authored
Also includes following changes for inter-layer model-parallel implementation: - Refactoring of model implementations - Training loop changes to support inter-layer communication using `ring_exchange` - New groups for inter-layer communication - Checkpoint changes - Command line arguments
-
- 18 Sep, 2020 1 commit
-
-
root authored
-
- 28 Jul, 2020 1 commit
-
-
Boris Fomitchev authored
Signed-off-by:Boris Fomitchev <bfomitchev@nvidia.com>
-
- 26 Jul, 2020 1 commit
-
-
Boris Fomitchev authored
-
- 24 Jul, 2020 1 commit
-
-
Boris Fomitchev authored
Signed-off-by:Boris Fomitchev <bfomitchev@nvidia.com>
-
- 07 Jul, 2020 1 commit
-
-
Neel Kant authored
-
- 22 Jun, 2020 1 commit
-
-
Neel Kant authored
-
- 05 Jun, 2020 3 commits
- 05 May, 2020 2 commits
- 03 May, 2020 1 commit
-
-
Neel Kant authored
-