"git@developer.sourcefind.cn:chenpangpang/transformers.git" did not exist on "215db688da1661312884048cbe290910416ff616"
add General attention classes
The modifications that I introduced in a previous commit did break Bert's internal API. I reverted these changes and added more general classes to handle the encoder-decoder attention case. There may be a more elegant way to deal with retro-compatibility (I am not comfortable with the current state of the code), but I cannot see it right now.
Showing
Please register or sign in to comment