Adding LM Head to Transfo-XL and first step to fixing problem with Adaptive...
Adding LM Head to Transfo-XL and first step to fixing problem with Adaptive Embeddings in TransfoXL (#3286) * first commit * work in progress * make language generation task pass * update to working version for LM * delete print * remove dead code * make style
Showing
Please register or sign in to comment