- 17 May, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 16 May, 2022 6 commits
-
-
Vijay Korthikanti authored
-
Jared Casper authored
Fourth phase vision merge: classification and segmentation tasks See merge request ADLR/megatron-lm!400
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 13 May, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 10 May, 2022 1 commit
-
-
Jared Casper authored
jit functions warmpups to (1) match fprop and recompute results and (2) remove grad_enable See merge request ADLR/megatron-lm!404
-
- 09 May, 2022 1 commit
-
-
Sangkug Lym authored
change dummy_handler to nullcontext
-
- 28 Apr, 2022 3 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 25 Apr, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 30 Mar, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 25 Mar, 2022 1 commit
-
-
Sangkug Lym authored
-
- 15 Mar, 2022 3 commits
-
-
Jared Casper authored
Bugfix: Pipeline output deallocation w/ persistent layer norm See merge request ADLR/megatron-lm!402
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
- 11 Mar, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 09 Mar, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 07 Mar, 2022 3 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 03 Mar, 2022 2 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 02 Mar, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 01 Mar, 2022 3 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 24 Feb, 2022 2 commits
-
-
Lawrence McAfee authored
Switch Transformer See merge request ADLR/megatron-lm!373
-
rprenger authored
-
- 23 Feb, 2022 1 commit
-
-
Jared Casper authored
Gradient accumulation fusion See merge request ADLR/megatron-lm!394
-
- 19 Feb, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 18 Feb, 2022 1 commit
-
-
Sangkug Lym authored
-
- 17 Feb, 2022 5 commits
-
-
Jared Casper authored
changed all_gather to _all_gather_base in distributed checkpointing See merge request ADLR/megatron-lm!395
-
rprenger authored
-
mshoeybi authored
-
mshoeybi authored
-
Lawrence McAfee authored
-