Add norm_first to Transformer Scaffold; Add an option in gated_feedword to...
Add norm_first to Transformer Scaffold; Add an option in gated_feedword to disable the output layer_norm. PiperOrigin-RevId: 333591020
Showing
Please register or sign in to comment