"web/extensions/vscode:/vscode.git/clone" did not exist on "f5a78658b7ba8b3c278f8f6d79c249c73582df87"
bert.rst 4.75 KB
Newer Older
1
2
3
BERT
----------------------------------------------------

Lysandre's avatar
Lysandre committed
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
Overview
~~~~~~~~~~~~~~~~~~~~~

The BERT model was proposed in `BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding <https://arxiv.org/abs/1810.04805>`__
by Jacob Devlin, Ming-Wei Chang, Kenton Lee and Kristina Toutanova. It's a bidirectional transformer
pre-trained using a combination of masked language modeling objective and next sentence prediction
on a large corpus comprising the Toronto Book Corpus and Wikipedia.

The abstract from the paper is the following:

*We introduce a new language representation model called BERT, which stands for Bidirectional Encoder Representations
from Transformers. Unlike recent language representation models, BERT is designed to pre-train deep bidirectional
representations from unlabeled text by jointly conditioning on both left and right context in all layers. As a result,
the pre-trained BERT model can be fine-tuned with just one additional output layer to create state-of-the-art models
for a wide range of tasks, such as question answering and language inference, without substantial task-specific
architecture modifications.*

*BERT is conceptually simple and empirically powerful. It obtains new state-of-the-art results on eleven natural
language processing tasks, including pushing the GLUE score to 80.5% (7.7% point absolute improvement), MultiNLI
accuracy to 86.7% (4.6% absolute improvement), SQuAD v1.1 question answering Test F1 to 93.2 (1.5 point absolute
improvement) and SQuAD v2.0 Test F1 to 83.1 (5.1 point absolute improvement).*

Tips:

- BERT is a model with absolute position embeddings so it's usually advised to pad the inputs on
  the right rather than the left.
30
31
- BERT was trained with the masked language modeling (MLM) and next sentence prediction (NSP) objectives. It is efficient at predicting masked
  tokens and at NLU in general, but is not optimal for text generation.
Lysandre's avatar
Lysandre committed
32

33
34
The original code can be found `here <https://github.com/google-research/bert>`_.

Lysandre's avatar
Lysandre committed
35
BertConfig
36
37
~~~~~~~~~~~~~~~~~~~~~

38
.. autoclass:: transformers.BertConfig
39
    :members:
40
41


Lysandre's avatar
Lysandre committed
42
BertTokenizer
43
~~~~~~~~~~~~~~~~~~~~~
44

45
.. autoclass:: transformers.BertTokenizer
Lysandre Debut's avatar
Lysandre Debut committed
46
47
    :members: build_inputs_with_special_tokens, get_special_tokens_mask,
        create_token_type_ids_from_sequences, save_vocabulary
48
49


50
51
52
53
54
55
56
BertTokenizerFast
~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.BertTokenizerFast
    :members:


57
58
59
Bert specific outputs
~~~~~~~~~~~~~~~~~~~~~

Sylvain Gugger's avatar
Sylvain Gugger committed
60
61
62
63
.. autoclass:: transformers.modeling_bert.BertForPreTrainingOutput
    :members:

.. autoclass:: transformers.modeling_tf_bert.TFBertForPreTrainingOutput
64
65
66
    :members:


Lysandre's avatar
Lysandre committed
67
BertModel
68
69
~~~~~~~~~~~~~~~~~~~~

70
.. autoclass:: transformers.BertModel
71
72
73
    :members:


Lysandre's avatar
Lysandre committed
74
BertForPreTraining
75
76
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

77
.. autoclass:: transformers.BertForPreTraining
78
79
80
    :members:


81
82
83
84
85
86
87
BertModelLMHeadModel
~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.BertLMHeadModel
    :members:


Lysandre's avatar
Lysandre committed
88
BertForMaskedLM
89
90
~~~~~~~~~~~~~~~~~~~~~~~~~~

91
.. autoclass:: transformers.BertForMaskedLM
92
93
94
    :members:


Lysandre's avatar
Lysandre committed
95
BertForNextSentencePrediction
96
97
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

98
.. autoclass:: transformers.BertForNextSentencePrediction
99
100
101
    :members:


Lysandre's avatar
Lysandre committed
102
BertForSequenceClassification
103
104
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

105
.. autoclass:: transformers.BertForSequenceClassification
106
107
108
    :members:


Lysandre's avatar
Lysandre committed
109
BertForMultipleChoice
110
111
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

112
.. autoclass:: transformers.BertForMultipleChoice
113
114
115
    :members:


Lysandre's avatar
Lysandre committed
116
BertForTokenClassification
117
118
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

119
.. autoclass:: transformers.BertForTokenClassification
120
121
122
    :members:


Lysandre's avatar
Lysandre committed
123
BertForQuestionAnswering
124
125
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

126
.. autoclass:: transformers.BertForQuestionAnswering
127
128
    :members:

LysandreJik's avatar
LysandreJik committed
129

Lysandre's avatar
Lysandre committed
130
TFBertModel
LysandreJik's avatar
LysandreJik committed
131
132
~~~~~~~~~~~~~~~~~~~~

133
.. autoclass:: transformers.TFBertModel
LysandreJik's avatar
LysandreJik committed
134
135
136
    :members:


Lysandre's avatar
Lysandre committed
137
TFBertForPreTraining
LysandreJik's avatar
LysandreJik committed
138
139
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

140
.. autoclass:: transformers.TFBertForPreTraining
LysandreJik's avatar
LysandreJik committed
141
142
143
    :members:


144
145
146
147
148
149
150
TFBertModelLMHeadModel
~~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.TFBertLMHeadModel
    :members:


Lysandre's avatar
Lysandre committed
151
TFBertForMaskedLM
LysandreJik's avatar
LysandreJik committed
152
153
~~~~~~~~~~~~~~~~~~~~~~~~~~

154
.. autoclass:: transformers.TFBertForMaskedLM
LysandreJik's avatar
LysandreJik committed
155
156
157
    :members:


Lysandre's avatar
Lysandre committed
158
TFBertForNextSentencePrediction
LysandreJik's avatar
LysandreJik committed
159
160
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

161
.. autoclass:: transformers.TFBertForNextSentencePrediction
LysandreJik's avatar
LysandreJik committed
162
163
164
    :members:


Lysandre's avatar
Lysandre committed
165
TFBertForSequenceClassification
LysandreJik's avatar
LysandreJik committed
166
167
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

168
.. autoclass:: transformers.TFBertForSequenceClassification
LysandreJik's avatar
LysandreJik committed
169
170
171
    :members:


Lysandre's avatar
Lysandre committed
172
TFBertForMultipleChoice
LysandreJik's avatar
LysandreJik committed
173
174
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

175
.. autoclass:: transformers.TFBertForMultipleChoice
LysandreJik's avatar
LysandreJik committed
176
177
178
    :members:


Lysandre's avatar
Lysandre committed
179
TFBertForTokenClassification
LysandreJik's avatar
LysandreJik committed
180
181
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

182
.. autoclass:: transformers.TFBertForTokenClassification
LysandreJik's avatar
LysandreJik committed
183
184
185
    :members:


Lysandre's avatar
Lysandre committed
186
TFBertForQuestionAnswering
LysandreJik's avatar
LysandreJik committed
187
188
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

189
.. autoclass:: transformers.TFBertForQuestionAnswering
LysandreJik's avatar
LysandreJik committed
190
191
    :members: