- 27 Jul, 2022 1 commit
-
-
Jared Casper authored
added a flag to be able to switch between pytorch and ring exchange p2p See merge request ADLR/megatron-lm!434
-
- 26 Jul, 2022 3 commits
-
-
Mohammad Shoeybi authored
-
Jared Casper authored
Remove old merge tool. See merge request ADLR/megatron-lm!433
-
Jared Casper authored
-
- 20 Jul, 2022 2 commits
-
-
Jared Casper authored
Adding checks for total number of tokens to keep server from crashing See merge request ADLR/megatron-lm!428
-
Jared Casper authored
Implements the top_p decay and top_p bound parameters so from the Factual Sampling work See merge request ADLR/megatron-lm!423
-
- 19 Jul, 2022 5 commits
-
-
Jared Casper authored
Distributed optimizer readme section. See merge request ADLR/megatron-lm!429
-
Jared Casper authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
- 18 Jul, 2022 4 commits
-
-
rprenger authored
-
rprenger authored
-
Jared Casper authored
Checkpoint conversion utility See merge request ADLR/megatron-lm!289
-
Jared Casper authored
Fix bugs for beam search when using pipeline parallelization See merge request ADLR/megatron-lm!426
-
- 15 Jul, 2022 3 commits
- 12 Jul, 2022 1 commit
-
-
Jared Casper authored
-
- 24 Jun, 2022 6 commits
-
-
Jared Casper authored
-
Jared Casper authored
Update ci image. See merge request ADLR/megatron-lm!425
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
- 21 Jun, 2022 1 commit
-
-
rprenger authored
-
- 08 Jun, 2022 7 commits
-
-
rprenger authored
-
Jared Casper authored
Distributed optimizer See merge request ADLR/megatron-lm!408
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
- 01 Jun, 2022 1 commit
-
-
Lawrence McAfee authored
-
- 26 May, 2022 3 commits
-
-
Jared Casper authored
bug fix for get_data_parallel_src_rank See merge request ADLR/megatron-lm!420
-
Vijay Korthikanti authored
-
Jared Casper authored
Beam search See merge request ADLR/megatron-lm!396
-
- 25 May, 2022 2 commits
-
-
Peng Xu authored
-
Jared Casper authored
preallocating global buffer to avoid memory fragmentation See merge request ADLR/megatron-lm!419
-
- 24 May, 2022 1 commit
-
-
Vijay Korthikanti authored
-