- 02 Dec, 2021 2 commits
- 28 Nov, 2021 1 commit
-
-
zihanl authored
-
- 27 Nov, 2021 3 commits
- 26 Nov, 2021 3 commits
- 24 Nov, 2021 4 commits
-
-
zihanl authored
-
Jared Casper authored
Fixed distributed checkpoint activations to work on all layers See merge request ADLR/megatron-lm!365
-
Jared Casper authored
Add support for signal-based dynamic checkpointing See merge request ADLR/megatron-lm!361
-
Szymon Migacz authored
-
- 23 Nov, 2021 1 commit
-
-
Lawrence McAfee authored
-
- 22 Nov, 2021 5 commits
-
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
zihanl authored
-
zihanl authored
-
- 21 Nov, 2021 1 commit
-
-
zihanl authored
-
- 18 Nov, 2021 2 commits
-
-
Jared Casper authored
add a kernel import guard for persistent layer norm See merge request ADLR/megatron-lm!363
-
Sangkug Lym authored
-
- 17 Nov, 2021 1 commit
-
-
Jared Casper authored
Persistent layer norm See merge request ADLR/megatron-lm!351
-
- 11 Nov, 2021 1 commit
-
-
Sangkug Lym authored
fix the guard to fall back to the baseline fused layer norm kernel Persisten ln: move the guard for supported hidden sizes to layer norm module
-
- 05 Nov, 2021 1 commit
-
-
Jared Casper authored
Fix finetuning tasks after T5 pipeline merge. See merge request ADLR/megatron-lm!343
-
- 29 Oct, 2021 2 commits
-
-
Jared Casper authored
made model stateless with respect to inference See merge request ADLR/megatron-lm!348
-
mshoeybi authored
-
- 25 Oct, 2021 1 commit
-
-
Jared Casper authored
This adds a function for the case where the user only wants the log-probabilities "tokens_to_generate=0". See merge request ADLR/megatron-lm!345
-
- 22 Oct, 2021 1 commit
-
-
rprenger authored
-
- 20 Oct, 2021 1 commit
-
-
rprenger authored
-
- 18 Oct, 2021 3 commits
-
-
rprenger authored
-
Mohammad Shoeybi authored
Clarify README regarding benchmarks. See merge request ADLR/megatron-lm!344
-
Jared Casper authored
-
- 16 Oct, 2021 3 commits
- 15 Oct, 2021 4 commits
-
-
Jared Casper authored
Inference refactoring See merge request ADLR/megatron-lm!339
-
Jared Casper authored
-
rprenger authored
-
mshoeybi authored
-