"docs/zh_CN/TrainingService/PaiMode.rst" did not exist on "abc221589c65d75b494407c60a81ca87c3020463"
-
Jared Casper authored
Add swiglu and squared relu activations and ability to disable bias. See merge request ADLR/megatron-lm!553
f6d36d03
Add swiglu and squared relu activations and ability to disable bias. See merge request ADLR/megatron-lm!553