- 18 Jul, 2022 1 commit
-
-
Jared Casper authored
Fix bugs for beam search when using pipeline parallelization See merge request ADLR/megatron-lm!426
-
- 15 Jul, 2022 3 commits
- 24 Jun, 2022 2 commits
-
-
Jared Casper authored
Update ci image. See merge request ADLR/megatron-lm!425
-
Jared Casper authored
-
- 08 Jun, 2022 6 commits
-
-
Jared Casper authored
Distributed optimizer See merge request ADLR/megatron-lm!408
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
- 01 Jun, 2022 1 commit
-
-
Lawrence McAfee authored
-
- 26 May, 2022 3 commits
-
-
Jared Casper authored
bug fix for get_data_parallel_src_rank See merge request ADLR/megatron-lm!420
-
Vijay Korthikanti authored
-
Jared Casper authored
Beam search See merge request ADLR/megatron-lm!396
-
- 25 May, 2022 2 commits
-
-
Peng Xu authored
-
Jared Casper authored
preallocating global buffer to avoid memory fragmentation See merge request ADLR/megatron-lm!419
-
- 24 May, 2022 9 commits
-
-
Vijay Korthikanti authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
Vijay Korthikanti authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
-
Lawrence McAfee authored
merge sequence parallelism's layernorm all-reduce into distributed optimizer.
-
Jared Casper authored
fix for sequence parallelism in bert pooling See merge request ADLR/megatron-lm!418
-
Vijay Korthikanti authored
-
- 20 May, 2022 4 commits
-
-
Jared Casper authored
Update readme with sequence parallelism and selective activation recompute. See merge request ADLR/megatron-lm!417
-
Jared Casper authored
Sequence parallelism + attention checkpoint See merge request ADLR/megatron-lm!413
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
- 19 May, 2022 4 commits
-
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
- 18 May, 2022 1 commit
-
-
Vijay Korthikanti authored
-
- 17 May, 2022 4 commits
-
-
Jared Casper authored
Data preprocessing testing changes + fixes See merge request ADLR/megatron-lm!416
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-
Vijay Korthikanti authored
-