"caffe/caffe2onnx/src/args_parser.py" did not exist on "38a732e1fdd9c14b779d884997ef985efa8d8446"
Merge branch 'activations-bias' into 'main'
Add swiglu and squared relu activations and ability to disable bias. See merge request ADLR/megatron-lm!553
Showing
Please register or sign in to comment