- 13 Oct, 2022 2 commits
-
-
Jared Casper authored
Adding proper test cases See merge request ADLR/megatron-lm!460
-
Shanmugam Ramasamy authored
-
- 07 Oct, 2022 6 commits
-
-
shanmugamr authored
-
shanmugamr authored
-
shanmugamr authored
-
shanmugamr authored
'
-
shanmugamr authored
'
-
shanmugamr authored
-
- 06 Oct, 2022 10 commits
-
-
shanmugamr authored
-
shanmugamr authored
-
shanmugamr authored
-
shanmugamr authored
-
shanmugamr authored
-
shanmugamr authored
-
-
shanmugamr authored
-
Jared Casper authored
Setter for pipeline parallel split rank, remove print See merge request ADLR/megatron-lm!454
-
Eric Harper authored
-
- 05 Oct, 2022 1 commit
-
-
shanmugamr authored
-
- 04 Oct, 2022 2 commits
-
-
shanmugamr authored
-
shanmugamr authored
-
- 27 Sep, 2022 4 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
- 26 Sep, 2022 3 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
- 24 Sep, 2022 3 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
- 23 Sep, 2022 5 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
Also merged in some changed from apex
-
Jared Casper authored
Clean up licensing. See merge request ADLR/megatron-lm!451
-
Jared Casper authored
The LICENSE file says everything is 3-clause BSD, which is what we want, but at some point the Apache license was added to the top of some files and that proliferated. This commit removes the Apache license from any files that we own the copyright to. Also updates the copyright year and removes the unnessary coding=utf-8 line.
-
- 12 Sep, 2022 2 commits
-
-
Jared Casper authored
Update state_dict arguments for recent PyTorch versions. See merge request ADLR/megatron-lm!432
-
Jared Casper authored
Memory safety checks were incorrect for the tokens_to_generate=0 case See merge request ADLR/megatron-lm!447
-
- 02 Sep, 2022 1 commit
-
-
rprenger authored
-
- 16 Aug, 2022 1 commit
-
-
Jared Casper authored
fixed grad scalar warning for bf16 See merge request ADLR/megatron-lm!442
-