- 19 Apr, 2023 2 commits
-
-
Jared Casper authored
Some quick fixes to checkpoint_util. See merge request ADLR/megatron-lm!568
-
Jared Casper authored
-
- 14 Apr, 2023 1 commit
-
-
Jared Casper authored
Update tests with new gold numbers See merge request ADLR/megatron-lm!565
-
- 13 Apr, 2023 1 commit
-
-
Jared Casper authored
Recent commit changed the bias dropout add jitted function which slightly changes numbers, manually tested to be accurate.
-
- 09 Apr, 2023 1 commit
-
-
Jared Casper authored
Add parallel_state helper functions See merge request ADLR/megatron-lm!564
-
- 07 Apr, 2023 4 commits
-
-
Abhinav Khattar authored
Signed-off-by:Abhinav Khattar <aklife97@gmail.com>
-
Jared Casper authored
Some quick fixes. See merge request ADLR/megatron-lm!563
-
Jared Casper authored
-
Jared Casper authored
Flash Attention inference fix See merge request ADLR/megatron-lm!562
-
- 06 Apr, 2023 17 commits
-
-
Jimmy Zhang authored
-
Jimmy Zhang authored
-
Jimmy Zhang authored
-
Jared Casper authored
Add support for GPTSentencePieceTokenizer and related fixes. See merge request ADLR/megatron-lm!561
-
Jared Casper authored
Untie Embeddings See merge request ADLR/megatron-lm!558
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
layernorm1p added See merge request ADLR/megatron-lm!557
-
Jared Casper authored
merging rope to main See merge request ADLR/megatron-lm!556
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
-
Jared Casper authored
Add swiglu and squared relu activations and ability to disable bias. See merge request ADLR/megatron-lm!553
-
Shanmugam Ramasamy authored
Update .gitlab-ci.yml See merge request ADLR/megatron-lm!560
-
- 05 Apr, 2023 4 commits
-
-
Shanmugam Ramasamy authored
-
MaximumEntropy authored
Signed-off-by:MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
- 04 Apr, 2023 5 commits
-
-
Sandeep Subramanian authored
-
Shanmugam Ramasamy authored
Better output See merge request ADLR/megatron-lm!559
-
Shanmugam Ramasamy authored
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
- 03 Apr, 2023 5 commits
-
-
Mostofa Patwary authored
-
Mostofa Patwary authored
-
Jimmy Zhang authored
-
MaximumEntropy authored
Signed-off-by:MaximumEntropy <sandeep.subramanian.1@umontreal.ca>
-
MaximumEntropy authored
Merge branch 'untie_embeddings' of https://gitlab-master.nvidia.com/ADLR/megatron-lm into untie_embeddings
-