Transformer Encoder: when embedding width differs from hidden size, add a...
Transformer Encoder: when embedding width differs from hidden size, add a projection to hidden size. PiperOrigin-RevId: 312708922
Showing
Please register or sign in to comment