- 07 Oct, 2022 1 commit
-
-
shanmugamr authored
-
- 06 Oct, 2022 6 commits
-
-
shanmugamr authored
-
shanmugamr authored
-
-
shanmugamr authored
-
Jared Casper authored
Setter for pipeline parallel split rank, remove print See merge request ADLR/megatron-lm!454
-
Eric Harper authored
-
- 05 Oct, 2022 1 commit
-
-
shanmugamr authored
-
- 04 Oct, 2022 2 commits
-
-
shanmugamr authored
-
shanmugamr authored
-
- 27 Sep, 2022 4 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
- 26 Sep, 2022 3 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
- 24 Sep, 2022 3 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
- 23 Sep, 2022 5 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
Also merged in some changed from apex
-
Jared Casper authored
Clean up licensing. See merge request ADLR/megatron-lm!451
-
Jared Casper authored
The LICENSE file says everything is 3-clause BSD, which is what we want, but at some point the Apache license was added to the top of some files and that proliferated. This commit removes the Apache license from any files that we own the copyright to. Also updates the copyright year and removes the unnessary coding=utf-8 line.
-
- 12 Sep, 2022 2 commits
-
-
Jared Casper authored
Update state_dict arguments for recent PyTorch versions. See merge request ADLR/megatron-lm!432
-
Jared Casper authored
Memory safety checks were incorrect for the tokens_to_generate=0 case See merge request ADLR/megatron-lm!447
-
- 02 Sep, 2022 1 commit
-
-
rprenger authored
-
- 16 Aug, 2022 4 commits
-
-
Jared Casper authored
fixed grad scalar warning for bf16 See merge request ADLR/megatron-lm!442
-
Mohammad Shoeybi authored
-
Jared Casper authored
fixed grad scalar warning so it only prints it for fp16 See merge request ADLR/megatron-lm!441
-
mshoeybi authored
-
- 10 Aug, 2022 2 commits
-
-
Jared Casper authored
Timing levels See merge request ADLR/megatron-lm!436
-
Mohammad Shoeybi authored
-
- 06 Aug, 2022 2 commits
-
-
Jared Casper authored
fix a bug for size mismatch See merge request ADLR/megatron-lm!438
-
Peng Xu authored
-
- 29 Jul, 2022 1 commit
-
-
Jared Casper authored
support for all mask in fused kernel + avoiding inplace operation in bwd pass See merge request ADLR/megatron-lm!435
-
- 28 Jul, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 27 Jul, 2022 1 commit
-
-
Jared Casper authored
added a flag to be able to switch between pytorch and ring exchange p2p See merge request ADLR/megatron-lm!434
-
- 26 Jul, 2022 1 commit
-
-
Mohammad Shoeybi authored
-