- 12 Apr, 2021 3 commits
-
-
Jared Casper authored
-
Jared Casper authored
added link to the pipeline paper See merge request ADLR/megatron-lm!259
-
mohammad authored
-
- 08 Apr, 2021 2 commits
-
-
Deepak Narayanan authored
Release fixes See merge request ADLR/megatron-lm!258
-
Mohammad Shoeybi authored
-
- 03 Apr, 2021 3 commits
-
-
Mohammad Shoeybi authored
Small bugfix to make sure refactored code works with interleaved schedule See merge request ADLR/megatron-lm!256
-
Deepak Narayanan authored
-
Mohammad Shoeybi authored
Pipeline refactor See merge request ADLR/megatron-lm!254
-
- 02 Apr, 2021 1 commit
-
-
Jared Casper authored
-
- 31 Mar, 2021 2 commits
-
-
Mohammad Shoeybi authored
removed the checks for bfloat jitting See merge request ADLR/megatron-lm!255
-
mshoeybi authored
-
- 24 Mar, 2021 3 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
Vijay Korthikanti authored
-
- 23 Mar, 2021 1 commit
-
-
Jared Casper authored
Make sure pipeline-model-parallel size is greater than 2 for interleaved schedule See merge request ADLR/megatron-lm!253
-
- 22 Mar, 2021 1 commit
-
-
Jared Casper authored
-
- 20 Mar, 2021 1 commit
-
-
Deepak Narayanan authored
-
- 19 Mar, 2021 4 commits
-
-
Jared Casper authored
ICT zeroshot evaluation See merge request ADLR/megatron-lm!248
-
Mostofa Patwary authored
-
Jared Casper authored
Bfloat fused softmax + fused layer norm See merge request ADLR/megatron-lm!251
-
Mohammad Shoeybi authored
-
- 18 Mar, 2021 2 commits
-
-
Jared Casper authored
refactored the fused kernels build See merge request ADLR/megatron-lm!250
-
Mohammad Shoeybi authored
-
- 17 Mar, 2021 2 commits
-
-
Jared Casper authored
softmax data load/store optimization See merge request ADLR/megatron-lm!249
-
Vijay Korthikanti authored
-
- 16 Mar, 2021 2 commits
-
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 08 Mar, 2021 2 commits
-
-
Jared Casper authored
Bfloat with fp32 grad acc See merge request ADLR/megatron-lm!247
-
Mohammad Shoeybi authored
-
- 04 Mar, 2021 6 commits
-
-
Jared Casper authored
Rc debug underflow See merge request ADLR/megatron-lm!246
-
Rewon Child authored
-
Rewon Child authored
-
Rewon Child authored
-
Rewon Child authored
-
Rewon Child authored
-
- 03 Mar, 2021 5 commits
-
-
Jared Casper authored
Retrieval index building See merge request ADLR/megatron-lm!239
-
Jared Casper authored
Get PyTorch batched communication API working for interleaved schedule See merge request ADLR/megatron-lm!242
-
Jared Casper authored
Remove pipeline stall timing See merge request ADLR/megatron-lm!244
-
Deepak Narayanan authored
-
Deepak Narayanan authored
-