fix use of mems in Transformer-XL (#4826)
Fixed duplicated memory use in Transformer-XL generation leading to bad predictions and performance.
Showing
Please register or sign in to comment
Fixed duplicated memory use in Transformer-XL generation leading to bad predictions and performance.