- 01 Sep, 2021 1 commit
-
-
rprenger authored
-
- 31 Aug, 2021 2 commits
- 27 Aug, 2021 7 commits
-
-
Jared Casper authored
Use nvfuser at pytorch >= 1.10 See merge request ADLR/megatron-lm!314
-
Sangkug Lym authored
-
Jared Casper authored
Adding API server See merge request ADLR/megatron-lm!294
-
Ryan Prenger authored
-
rprenger authored
-
rprenger authored
-
Jared Casper authored
Revisited distributing checkpointed activations along the tensor parallel ranks See merge request ADLR/megatron-lm!311
-
- 26 Aug, 2021 4 commits
- 24 Aug, 2021 1 commit
-
-
Jared Casper authored
Fused softmax checks and additions from Github (#133) See merge request ADLR/megatron-lm!312
-
- 23 Aug, 2021 7 commits
-
-
mshoeybi authored
-
hyunwoongko authored
-
hyunwoongko authored
-
hyunwoongko authored
-
hyunwoongko authored
-
hyunwoongko authored
-
mshoeybi authored
-
- 21 Aug, 2021 5 commits
- 19 Aug, 2021 9 commits
-
-
Jared Casper authored
Checkpoint a set number of individual Transformer layers See merge request ADLR/megatron-lm!301
-
Jared Casper authored
Memory optimizations for contiguous buffers See merge request ADLR/megatron-lm!310
-
slym authored
-
slym authored
-
slym authored
consider the case of pipeline-model prallelism clean up arugments argument naming cleanup update readme and examples
-
mshoeybi authored
-
mshoeybi authored
-
mshoeybi authored
-
mshoeybi authored
-
- 18 Aug, 2021 3 commits
-
-
mshoeybi authored
-
Jared Casper authored
bug fix for previous MR (Flag to call empty_cache() each iteration, to reduce fragmentation) See merge request ADLR/megatron-lm!309
-
Lawrence McAfee authored
-
- 17 Aug, 2021 1 commit
-
-
Jared Casper authored
minor changes from github issues See merge request ADLR/megatron-lm!308
-