src/transformers/configuration_utils.py · 4b506a37e3e0ff679235961ba14dd9397843ef3a · chenpangpang / transformers

"git@developer.sourcefind.cn:dadigang/Ventoy.git" did not exist on "d3138479d78a62924c182d809bd10d15191d4a70"

Teven authored Jul 18, 2020

Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.

4b506a37

configuration_utils.py 25.3 KB

Replace configuration_utils.py