- 22 Apr, 2021 3 commits
-
-
Mohammad Shoeybi authored
Integrate code from t5_main into existing code. See merge request ADLR/megatron-lm!261
-
Jared Casper authored
lm evaluation See merge request ADLR/megatron-lm!262
-
Jared Casper authored
Training data and task deduplication See merge request ADLR/megatron-lm!252
-
- 21 Apr, 2021 4 commits
-
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
- 20 Apr, 2021 3 commits
-
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
- 19 Apr, 2021 1 commit
-
-
Mostofa Patwary authored
-
- 16 Apr, 2021 2 commits
-
-
Mostofa Patwary authored
-
Jared Casper authored
-
- 12 Apr, 2021 2 commits
-
-
Jared Casper authored
added link to the pipeline paper See merge request ADLR/megatron-lm!259
-
mohammad authored
-
- 08 Apr, 2021 2 commits
-
-
Deepak Narayanan authored
Release fixes See merge request ADLR/megatron-lm!258
-
Mohammad Shoeybi authored
-
- 03 Apr, 2021 3 commits
-
-
Mohammad Shoeybi authored
Small bugfix to make sure refactored code works with interleaved schedule See merge request ADLR/megatron-lm!256
-
Deepak Narayanan authored
-
Mohammad Shoeybi authored
Pipeline refactor See merge request ADLR/megatron-lm!254
-
- 02 Apr, 2021 3 commits
-
-
Jared Casper authored
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
- 01 Apr, 2021 1 commit
-
-
Mostofa Patwary authored
-
- 31 Mar, 2021 2 commits
-
-
Mohammad Shoeybi authored
removed the checks for bfloat jitting See merge request ADLR/megatron-lm!255
-
mshoeybi authored
-
- 30 Mar, 2021 1 commit
-
-
Mostofa Patwary authored
-
- 26 Mar, 2021 1 commit
-
-
mpatwary authored
-
- 24 Mar, 2021 3 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
Vijay Korthikanti authored
-
- 23 Mar, 2021 1 commit
-
-
Jared Casper authored
Make sure pipeline-model-parallel size is greater than 2 for interleaved schedule See merge request ADLR/megatron-lm!253
-
- 20 Mar, 2021 1 commit
-
-
Deepak Narayanan authored
-
- 19 Mar, 2021 5 commits
-
-
Mostofa Patwary authored
-
Jared Casper authored
ICT zeroshot evaluation See merge request ADLR/megatron-lm!248
-
Mostofa Patwary authored
-
Jared Casper authored
Bfloat fused softmax + fused layer norm See merge request ADLR/megatron-lm!251
-
Mohammad Shoeybi authored
-
- 18 Mar, 2021 2 commits
-
-
Jared Casper authored
refactored the fused kernels build See merge request ADLR/megatron-lm!250
-
Mohammad Shoeybi authored
-