"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "6f152572cd0fe5c84127a1bc25668e5fce918744"
Correct TF formatting to exclude LayerNorms from weight decay (#4448)
* Exclude LayerNorms from weight decay * Include both formats of layer norm
Showing
Please register or sign in to comment