"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "053efc5d2d2e87833e9b7290a0dd83fa77cd6ae8"
Fix T5 incorrect weight decay in Trainer and official summarization example (#18002)
* Add ALL_LAYERNORM_LAYERS for LayerNorm * fix bug of appending layer norm
Showing
Please register or sign in to comment