- 22 Jan, 2021 1 commit
-
-
Vijay Korthikanti authored
-
- 20 Jan, 2021 1 commit
-
-
Jared Casper authored
GPT2->GPT in zero-shot evaluation scripts See merge request ADLR/megatron-lm!210
-
- 17 Jan, 2021 1 commit
-
-
Deepak Narayanan authored
-
- 13 Jan, 2021 2 commits
-
-
Jared Casper authored
Transformer decoder See merge request ADLR/megatron-lm!205
-
Vijay Korthikanti authored
-
- 12 Jan, 2021 8 commits
-
-
Vijay Korthikanti authored
-
Jared Casper authored
TOC fix See merge request ADLR/megatron-lm!208
-
Jared Casper authored
-
Vijay Korthikanti authored
-
Jared Casper authored
Readme update + change gpt2 to gpt See merge request ADLR/megatron-lm!206
-
Mohammad Shoeybi authored
-
Jared Casper authored
fixed validation loss reporting in tensorboard See merge request ADLR/megatron-lm!207
-
mohammad authored
-
- 09 Jan, 2021 2 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 07 Jan, 2021 2 commits
-
-
Mohammad Shoeybi authored
Change some arguments to default to on. See merge request ADLR/megatron-lm!203
-
Jared Casper authored
-
- 06 Jan, 2021 4 commits
-
-
Jared Casper authored
Optimizer refactor See merge request ADLR/megatron-lm!197
-
Jared Casper authored
added backward compatibility to be able to load checkpoints See merge request ADLR/megatron-lm!200
-
mohammad authored
-
mohammad authored
-
- 05 Jan, 2021 13 commits
-
-
mohammad authored
-
mohammad authored
-
Mohammad Shoeybi authored
Only create task heads on last pipeline stage. See merge request ADLR/megatron-lm!202
-
Mohammad Shoeybi authored
Use new api to get loss scale when finetuning. See merge request ADLR/megatron-lm!201
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
Optimize refactor reload model params See merge request ADLR/megatron-lm!199
-
Jared Casper authored
Use batched send and recv instead of torch.distributed.ring_exchange() See merge request ADLR/megatron-lm!198
-
mohammad authored
-
mohammad authored
-
mohammad authored
-
mohammad authored
-
mohammad authored
-
- 04 Jan, 2021 1 commit
-
-
Deepak Narayanan authored
-
- 31 Dec, 2020 1 commit
-
-
mohammad authored
-
- 30 Dec, 2020 4 commits