xlnet.rst 2.85 KB
Newer Older
1
2
XLNet
----------------------------------------------------
3

Lysandre's avatar
Lysandre committed
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
The XLNet model was proposed in `XLNet: Generalized Autoregressive Pretraining for Language Understanding`_
by Zhilin Yang*, Zihang Dai*, Yiming Yang, Jaime Carbonell, Ruslan Salakhutdinov, Quoc V. Le.
XLnet is an extension of the Transformer-XL model pre-trained using an autoregressive method
to learn bidirectional contexts by maximizing the expected likelihood over all permutations
of the input sequence factorization order.

The specific attention pattern can be controlled at training and test time using the `perm_mask` input.

Due to the difficulty of training a fully auto-regressive model over various factorization order,
XLNet is pretrained using only a sub-set of the output tokens as target which are selected
with the `target_mapping` input.

To use XLNet for sequential decoding (i.e. not in fully bi-directional setting), use the `perm_mask` and
`target_mapping` inputs to control the attention span and outputs (see examples in `examples/run_generation.py`)


20
21
22
``XLNetConfig``
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

23
.. autoclass:: transformers.XLNetConfig
24
25
26
    :members:


27
28
29
``XLNetTokenizer``
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

30
.. autoclass:: transformers.XLNetTokenizer
31
32
33
34
    :members:


``XLNetModel``
35
36
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

37
.. autoclass:: transformers.XLNetModel
38
39
40
    :members:


41
``XLNetLMHeadModel``
42
43
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

44
.. autoclass:: transformers.XLNetLMHeadModel
45
46
47
    :members:


48
``XLNetForSequenceClassification``
49
50
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

51
.. autoclass:: transformers.XLNetForSequenceClassification
52
53
54
    :members:


Lysandre's avatar
Lysandre committed
55
56
57
``XLNetForTokenClassification``
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Lysandre's avatar
Lysandre committed
58
.. autoclass:: transformers.XLNetForTokenClassification
Lysandre's avatar
Lysandre committed
59
60
61
62
63
64
    :members:


``XLNetForMultipleChoice``
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Lysandre's avatar
Lysandre committed
65
.. autoclass:: transformers.XLNetForMultipleChoice
Lysandre's avatar
Lysandre committed
66
67
68
69
70
71
72
73
74
75
    :members:


``XLNetForQuestionAnsweringSimple``
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.XLNetForQuestionAnsweringSimple
    :members:


76
``XLNetForQuestionAnswering``
77
78
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

79
.. autoclass:: transformers.XLNetForQuestionAnswering
80
    :members:
LysandreJik's avatar
LysandreJik committed
81
82
83
84
85


``TFXLNetModel``
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

86
.. autoclass:: transformers.TFXLNetModel
LysandreJik's avatar
LysandreJik committed
87
88
89
90
91
92
    :members:


``TFXLNetLMHeadModel``
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

93
.. autoclass:: transformers.TFXLNetLMHeadModel
LysandreJik's avatar
LysandreJik committed
94
95
96
97
98
99
    :members:


``TFXLNetForSequenceClassification``
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

100
.. autoclass:: transformers.TFXLNetForSequenceClassification
LysandreJik's avatar
LysandreJik committed
101
102
103
    :members:


LysandreJik's avatar
LysandreJik committed
104
``TFXLNetForQuestionAnsweringSimple``
LysandreJik's avatar
LysandreJik committed
105
106
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

107
.. autoclass:: transformers.TFXLNetForQuestionAnsweringSimple
LysandreJik's avatar
LysandreJik committed
108
    :members: