- 27 Jan, 2021 6 commits
-
-
Jared Casper authored
added grad and params norm to logging and tensorboard See merge request ADLR/megatron-lm!214
-
mohammad authored
-
mohammad authored
-
Jared Casper authored
vit model does not get imported automatically anymore See merge request ADLR/megatron-lm!216
-
mohammad authored
-
mohammad authored
-
- 26 Jan, 2021 9 commits
-
-
Jared Casper authored
call helpers makefile every run so we recompile if the code has changed See merge request ADLR/megatron-lm!215
-
Jared Casper authored
Vision transformer See merge request ADLR/megatron-lm!204
-
Vijay Korthikanti authored
-
mohammad authored
-
mohammad authored
-
mohammad authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Jared Casper authored
attention_mask_func cleanup See merge request ADLR/megatron-lm!212
-
- 25 Jan, 2021 6 commits
-
-
Jared Casper authored
Fixing merge_mp_partitions See merge request ADLR/megatron-lm!213
-
Jared Casper authored
-
Jared Casper authored
Adding option to remove the binary head for BERT See merge request ADLR/megatron-lm!211
-
Mohammad Shoeybi authored
-
Jared Casper authored
-
Jared Casper authored
-
- 24 Jan, 2021 1 commit
-
-
mohammad authored
-
- 22 Jan, 2021 3 commits
-
-
Jared Casper authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 20 Jan, 2021 1 commit
-
-
Jared Casper authored
GPT2->GPT in zero-shot evaluation scripts See merge request ADLR/megatron-lm!210
-
- 17 Jan, 2021 1 commit
-
-
Deepak Narayanan authored
-
- 14 Jan, 2021 1 commit
-
-
Vijay Korthikanti authored
-
- 13 Jan, 2021 2 commits
-
-
Jared Casper authored
Transformer decoder See merge request ADLR/megatron-lm!205
-
Vijay Korthikanti authored
-
- 12 Jan, 2021 9 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Jared Casper authored
TOC fix See merge request ADLR/megatron-lm!208
-
Jared Casper authored
-
Vijay Korthikanti authored
-
Jared Casper authored
Readme update + change gpt2 to gpt See merge request ADLR/megatron-lm!206
-
Mohammad Shoeybi authored
-
Jared Casper authored
fixed validation loss reporting in tensorboard See merge request ADLR/megatron-lm!207
-
mohammad authored
-
- 09 Jan, 2021 1 commit
-
-
Vijay Korthikanti authored
-