- 16 Mar, 2021 1 commit
-
-
Vijay Korthikanti authored
-
- 08 Mar, 2021 2 commits
-
-
Jared Casper authored
Bfloat with fp32 grad acc See merge request ADLR/megatron-lm!247
-
Mohammad Shoeybi authored
-
- 04 Mar, 2021 6 commits
-
-
Jared Casper authored
Rc debug underflow See merge request ADLR/megatron-lm!246
-
Rewon Child authored
-
Rewon Child authored
-
Rewon Child authored
-
Rewon Child authored
-
Rewon Child authored
-
- 03 Mar, 2021 6 commits
-
-
Jared Casper authored
Retrieval index building See merge request ADLR/megatron-lm!239
-
Jared Casper authored
Get PyTorch batched communication API working for interleaved schedule See merge request ADLR/megatron-lm!242
-
Jared Casper authored
Remove pipeline stall timing See merge request ADLR/megatron-lm!244
-
Deepak Narayanan authored
-
Deepak Narayanan authored
-
Deepak Narayanan authored
-
- 27 Feb, 2021 1 commit
-
-
Jared Casper authored
Fix deadlock when get_num_microbatches() < pipeline-parallel size See merge request ADLR/megatron-lm!243
-
- 26 Feb, 2021 3 commits
-
-
Deepak Narayanan authored
Fix deadlock when get_num_microbatches() < pipeline-parallel size (don't try to measure pipeline stall)
-
Jared Casper authored
Support Torch DDP for single-stage, num_microbatches() > 1 See merge request ADLR/megatron-lm!240
-
Deepak Narayanan authored
-
- 25 Feb, 2021 3 commits
-
-
Mohammad Shoeybi authored
Don't import deprecated model from realm_model which is broken. See merge request ADLR/megatron-lm!241
-
Jared Casper authored
-
Jared Casper authored
fix warning condition See merge request ADLR/megatron-lm!238
-
- 23 Feb, 2021 6 commits
-
-
Rewon Child authored
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
Vijay Korthikanti authored
-
Jared Casper authored
Storing and loading fingerprints of in deduplication See merge request ADLR/megatron-lm!236
-
- 22 Feb, 2021 3 commits
-
-
Mostofa Patwary authored
-
Jared Casper authored
Fix interleaved schedule assertion See merge request ADLR/megatron-lm!237
-
Deepak Narayanan authored
-
- 19 Feb, 2021 5 commits
-
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
- 18 Feb, 2021 4 commits
-
-
Jared Casper authored
Bug Jaccard similarity and filtering n-grams See merge request ADLR/megatron-lm!234
-
Jared Casper authored
ICT Retriever See merge request ADLR/megatron-lm!235
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-