[gpt-neox] Add attention_bias config to support model trained without attention biases (#28126)
* add attention_bias hparam for a model trained without attention biases * fix argument documentation error
Showing
Please register or sign in to comment