Unverified Commit fcdb85e9 authored by Sylvain Gugger's avatar Sylvain Gugger Committed by GitHub
Browse files

Fix reference to XLNet (#11846)

parent f5806041
...@@ -202,7 +202,7 @@ class TrainingArguments: ...@@ -202,7 +202,7 @@ class TrainingArguments:
Number of subprocesses to use for data loading (PyTorch only). 0 means that the data will be loaded in the Number of subprocesses to use for data loading (PyTorch only). 0 means that the data will be loaded in the
main process. main process.
past_index (:obj:`int`, `optional`, defaults to -1): past_index (:obj:`int`, `optional`, defaults to -1):
Some models like :doc:`TransformerXL <../model_doc/transformerxl>` or :doc`XLNet <../model_doc/xlnet>` can Some models like :doc:`TransformerXL <../model_doc/transformerxl>` or :doc:`XLNet <../model_doc/xlnet>` can
make use of the past hidden states for their predictions. If this argument is set to a positive int, the make use of the past hidden states for their predictions. If this argument is set to a positive int, the
``Trainer`` will use the corresponding output (usually index 2) as the past state and feed it to the model ``Trainer`` will use the corresponding output (usually index 2) as the past state and feed it to the model
at the next training step under the keyword argument ``mems``. at the next training step under the keyword argument ``mems``.
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment