Fix T5 incorrect weight decay in Trainer and official summarization example (#18002)
* Add ALL_LAYERNORM_LAYERS for LayerNorm * fix bug of appending layer norm
Showing
Please register or sign in to comment
* Add ALL_LAYERNORM_LAYERS for LayerNorm * fix bug of appending layer norm