- 08 Sep, 2021 1 commit
-
-
rprenger authored
-
- 07 Sep, 2021 2 commits
-
-
Jared Casper authored
use low-priority stream for nccl overlapping all-reduce and gemm See merge request ADLR/megatron-lm!319
-
Sangkug Lym authored
-
- 03 Sep, 2021 2 commits
-
-
Jared Casper authored
allreduce overlap with wgrad gemm See merge request ADLR/megatron-lm!316
-
slym authored
-
- 02 Sep, 2021 4 commits
-
-
slym authored
-
slym authored
-
Sangkug Lym authored
allreduce overlap with wgrad gemm change custom delay to dummy add
-
Jared Casper authored
Letting server return the log-probabilities of the context and generated text See merge request ADLR/megatron-lm!317
-
- 01 Sep, 2021 2 commits
- 31 Aug, 2021 2 commits
- 27 Aug, 2021 7 commits
-
-
Jared Casper authored
Use nvfuser at pytorch >= 1.10 See merge request ADLR/megatron-lm!314
-
Sangkug Lym authored
-
Jared Casper authored
Adding API server See merge request ADLR/megatron-lm!294
-
Ryan Prenger authored
-
rprenger authored
-
rprenger authored
-
Jared Casper authored
Revisited distributing checkpointed activations along the tensor parallel ranks See merge request ADLR/megatron-lm!311
-
- 26 Aug, 2021 4 commits
- 24 Aug, 2021 1 commit
-
-
Jared Casper authored
Fused softmax checks and additions from Github (#133) See merge request ADLR/megatron-lm!312
-
- 23 Aug, 2021 7 commits
-
-
mshoeybi authored
-
hyunwoongko authored
-
hyunwoongko authored
-
hyunwoongko authored
-
hyunwoongko authored
-
hyunwoongko authored
-
mshoeybi authored
-
- 21 Aug, 2021 5 commits
- 19 Aug, 2021 3 commits
-
-
Jared Casper authored
Checkpoint a set number of individual Transformer layers See merge request ADLR/megatron-lm!301
-
Jared Casper authored
Memory optimizations for contiguous buffers See merge request ADLR/megatron-lm!310
-
slym authored
-