"test/git@developer.sourcefind.cn:OpenDAS/torchaudio.git" did not exist on "6e0af71353a88b1d6e378b4d97a0b7a608600169"
Xlnet outputs (#5883)
Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.
Showing
Please register or sign in to comment