- 29 Nov, 2022 3 commits
-
-
Jared Casper authored
Fix DropPath for hidden shape [s, b, h] See merge request ADLR/megatron-lm!485
-
Jared Casper authored
bug fixes in partitioned data preprocessor See merge request ADLR/megatron-lm!486
-
Mike Chrzanowski authored
-
- 24 Nov, 2022 2 commits
-
-
Boxin Wang authored
Open sourcing lm detoxification code See merge request ADLR/megatron-lm!471
-
Boxin Wang authored
-
- 22 Nov, 2022 1 commit
-
-
Yu Yao authored
-
- 17 Nov, 2022 2 commits
-
-
Mike Chrzanowski authored
Data Preprocessing Optimizations See merge request ADLR/megatron-lm!483
-
Vijay Korthikanti authored
-
- 11 Nov, 2022 2 commits
-
-
Jared Casper authored
Support for variable sequence lengths across micro-batches See merge request ADLR/megatron-lm!472
-
Vijay Korthikanti authored
-
- 10 Nov, 2022 2 commits
-
-
Jared Casper authored
ViT Backbone Tensor Shape Fix See merge request ADLR/megatron-lm!479
-
Yu Yao authored
-
- 08 Nov, 2022 2 commits
-
-
Jared Casper authored
Fix merge error. See merge request ADLR/megatron-lm!478
-
Jared Casper authored
-
- 02 Nov, 2022 2 commits
-
-
Jared Casper authored
Move most of mpu functionality into a new "Megatron core" See merge request ADLR/megatron-lm!462
-
Jared Casper authored
Sending in prompts with the wrong type hangs the server. This is a check to make sure it's a list See merge request ADLR/megatron-lm!473
-
- 27 Oct, 2022 1 commit
-
-
rprenger authored
-
- 20 Oct, 2022 2 commits
-
-
Jared Casper authored
Disable newline after colon See merge request ADLR/megatron-lm!469
-
Peng Xu authored
-
- 14 Oct, 2022 5 commits
-
-
Jared Casper authored
inverse_square_root learning param schedule See merge request ADLR/megatron-lm!466
-
Jared Casper authored
Remove noop used to try to force scheduling and check for environment variable instead. See merge request ADLR/megatron-lm!463
-
Jared Casper authored
-
Vijay Korthikanti authored
-
Jared Casper authored
Core merge main See merge request ADLR/megatron-lm!464
-
- 13 Oct, 2022 3 commits
-
-
Jared Casper authored
-
Jared Casper authored
Adding proper test cases See merge request ADLR/megatron-lm!460
-
Shanmugam Ramasamy authored
-
- 11 Oct, 2022 3 commits
-
-
Jared Casper authored
Validation dataset update 1 See merge request ADLR/megatron-lm!455
-
Jared Casper authored
different encoder/decoder num-layers support See merge request ADLR/megatron-lm!453
-
Vijay Korthikanti authored
-
- 10 Oct, 2022 1 commit
-
-
Anmol Gupta authored
-
- 07 Oct, 2022 6 commits
-
-
shanmugamr authored
-
shanmugamr authored
-
shanmugamr authored
-
shanmugamr authored
'
-
shanmugamr authored
'
-
shanmugamr authored
-
- 06 Oct, 2022 3 commits
-
-
shanmugamr authored
-
shanmugamr authored
-
shanmugamr authored
-