"...git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "4965aee064f7aaf380269e88b2dd650867fb2199"
Correct TF formatting to exclude LayerNorms from weight decay (#4448)
* Exclude LayerNorms from weight decay * Include both formats of layer norm
Showing
Please register or sign in to comment