- 23 Sep, 2022 1 commit
-
-
Jared Casper authored
The LICENSE file says everything is 3-clause BSD, which is what we want, but at some point the Apache license was added to the top of some files and that proliferated. This commit removes the Apache license from any files that we own the copyright to. Also updates the copyright year and removes the unnessary coding=utf-8 line.
-
- 24 May, 2022 2 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 18 May, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 17 May, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 09 May, 2022 1 commit
-
-
Sangkug Lym authored
change dummy_handler to nullcontext
-
- 28 Apr, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 30 Mar, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 09 Mar, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 07 Mar, 2022 2 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 03 Mar, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 02 Mar, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 19 Feb, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 18 Feb, 2022 1 commit
-
-
Sangkug Lym authored
-
- 16 Feb, 2022 1 commit
-
-
Sangkug Lym authored
remove redundant linear layer class definition add fuse_gradient_accumulation attribute to weights for simple targetting reflect feedback and clean up the codes arg change
-
- 03 Sep, 2021 1 commit
-
-
slym authored
-
- 02 Sep, 2021 4 commits
-
-
slym authored
-
slym authored
-
Sangkug Lym authored
allreduce overlap with wgrad gemm change custom delay to dummy add
-
Jared Casper authored
-
- 31 Aug, 2021 1 commit
-
-
vycezhong authored
-
- 02 Jul, 2021 1 commit
-
-
hwijeen authored
-
- 22 Mar, 2021 1 commit
-
-
Jared Casper authored
-
- 26 Jan, 2021 1 commit
-
-
mohammad authored
-
- 25 Jan, 2021 1 commit
-
-
Jared Casper authored
-
- 22 Jan, 2021 1 commit
-
-
Jared Casper authored
-
- 25 Dec, 2020 1 commit
-
-
mohammad authored
-
- 22 Dec, 2020 1 commit
-
-
mohammad authored
Add the option for fp32 residual connection (fp32 residual connection machinery still needs to be added)
-
- 12 Nov, 2020 2 commits
-
-
Deepak Narayanan authored
-
Deepak Narayanan authored
Also includes following changes for inter-layer model-parallel implementation: - Refactoring of model implementations - Training loop changes to support inter-layer communication using `ring_exchange` - New groups for inter-layer communication - Checkpoint changes - Command line arguments
-
- 11 Sep, 2020 1 commit
-
-
Vijay Korthikanti authored
-
- 03 Sep, 2020 1 commit
-
-
mohammad authored
-
- 27 Aug, 2020 1 commit
-
-
Boris Fomitchev authored
Signed-off-by:Boris Fomitchev <bfomitchev@nvidia.com>
-
- 26 Aug, 2020 1 commit
-
-
Boris Fomitchev authored
-
- 03 Aug, 2020 1 commit
-
-
mohammad authored
-
- 31 Jul, 2020 1 commit
-
-
Boris Fomitchev authored
Signed-off-by:Boris Fomitchev <bfomitchev@nvidia.com>
-
- 30 Jul, 2020 1 commit
-
-
Boris Fomitchev authored
Signed-off-by:Boris Fomitchev <bfomitchev@nvidia.com>
-
- 26 Jul, 2020 1 commit
-
-
Boris Fomitchev authored
-
- 24 Jul, 2020 1 commit
-
-
Boris Fomitchev authored
Signed-off-by:Boris Fomitchev <bfomitchev@nvidia.com>
-