- 27 Aug, 2021 2 commits
-
-
Ryan Prenger authored
-
Jared Casper authored
Revisited distributing checkpointed activations along the tensor parallel ranks See merge request ADLR/megatron-lm!311
-
- 24 Aug, 2021 1 commit
-
-
Jared Casper authored
Fused softmax checks and additions from Github (#133) See merge request ADLR/megatron-lm!312
-
- 23 Aug, 2021 7 commits
-
-
mshoeybi authored
-
hyunwoongko authored
-
hyunwoongko authored
-
hyunwoongko authored
-
hyunwoongko authored
-
hyunwoongko authored
-
mshoeybi authored
-
- 21 Aug, 2021 5 commits
- 19 Aug, 2021 9 commits
-
-
Jared Casper authored
Checkpoint a set number of individual Transformer layers See merge request ADLR/megatron-lm!301
-
Jared Casper authored
Memory optimizations for contiguous buffers See merge request ADLR/megatron-lm!310
-
slym authored
-
slym authored
-
slym authored
consider the case of pipeline-model prallelism clean up arugments argument naming cleanup update readme and examples
-
mshoeybi authored
-
mshoeybi authored
-
mshoeybi authored
-
mshoeybi authored
-
- 18 Aug, 2021 3 commits
-
-
mshoeybi authored
-
Jared Casper authored
bug fix for previous MR (Flag to call empty_cache() each iteration, to reduce fragmentation) See merge request ADLR/megatron-lm!309
-
Lawrence McAfee authored
-
- 17 Aug, 2021 13 commits
-
-
Jared Casper authored
minor changes from github issues See merge request ADLR/megatron-lm!308
-
mshoeybi authored
-
Jared Casper authored
Flag to call empty_cache() each iteration, to reduce fragmentation See merge request ADLR/megatron-lm!306
-
Jared Casper authored
simplified the iteration read check across ranks See merge request ADLR/megatron-lm!307
-
Lawrence McAfee authored
-
mshoeybi authored
-
Jared Casper authored
some small PRs from github See merge request ADLR/megatron-lm!305
-
Jared Casper authored
added across ranks sync for checkpoint iteration load and couple of other fixes See merge request ADLR/megatron-lm!304
-
-
mshoeybi authored
Merge branch 'update-sample-bert-pretrain-args' of https://github.com/roclark/Megatron-LM into github_small_prs
-
-
-
-