- 06 Apr, 2023 14 commits
-
-
Jared Casper authored
Add support for GPTSentencePieceTokenizer and related fixes. See merge request ADLR/megatron-lm!561
-
Jared Casper authored
Untie Embeddings See merge request ADLR/megatron-lm!558
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
layernorm1p added See merge request ADLR/megatron-lm!557
-
Jared Casper authored
merging rope to main See merge request ADLR/megatron-lm!556
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
Add swiglu and squared relu activations and ability to disable bias. See merge request ADLR/megatron-lm!553
-
Shanmugam Ramasamy authored
Update .gitlab-ci.yml See merge request ADLR/megatron-lm!560
-
- 05 Apr, 2023 4 commits
-
-
Shanmugam Ramasamy authored
-
MaximumEntropy authored
Signed-off-by:MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
- 04 Apr, 2023 5 commits
-
-
Sandeep Subramanian authored
-
Shanmugam Ramasamy authored
Better output See merge request ADLR/megatron-lm!559
-
Shanmugam Ramasamy authored
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
- 03 Apr, 2023 14 commits
-
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
Jimmy Zhang authored
-
MaximumEntropy authored
Signed-off-by:MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
-
MaximumEntropy authored
Merge branch 'untie_embeddings' of https://gitlab-master.nvidia.com/ADLR/megatron-lm into untie_embeddings
-
MaximumEntropy authored
Signed-off-by:MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
-
Jimmy Zhang authored
-
Mostofa Patwary authored
-
Sandeep Subramanian authored
-
Jimmy Zhang authored
-
MaximumEntropy authored
Signed-off-by:MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
-
MaximumEntropy authored
Signed-off-by:MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
-
MaximumEntropy authored
Signed-off-by:MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
-
Jared Casper authored
-
- 02 Apr, 2023 1 commit
-
-
Mostofa Patwary authored
-
- 01 Apr, 2023 2 commits
-
-
Jared Casper authored
-
Jared Casper authored
Fix bug in core pipeline schedule See merge request ADLR/megatron-lm!552
-