camembert.rst 3.75 KB
Newer Older
Lysandre's avatar
Lysandre committed
1
2
3
CamemBERT
----------------------------------------------------

Sylvain Gugger's avatar
Sylvain Gugger committed
4
5
6
Overview
~~~~~~~~~~~~~~~~~~~~~

Lysandre's avatar
Lysandre committed
7
The CamemBERT model was proposed in `CamemBERT: a Tasty French Language Model <https://arxiv.org/abs/1911.03894>`__
Lysandre's avatar
Lysandre committed
8
by Louis Martin, Benjamin Muller, Pedro Javier Ortiz Su谩rez, Yoann Dupont, Laurent Romary, 脡ric Villemonte de la
Lysandre's avatar
Lysandre committed
9
10
Clergerie, Djam茅 Seddah, and Beno卯t Sagot. It is based on Facebook's RoBERTa model released in 2019. It is a model
trained on 138GB of French text.
Lysandre's avatar
Lysandre committed
11

Lysandre's avatar
Lysandre committed
12
The abstract from the paper is the following:
Lysandre's avatar
Lysandre committed
13

Lysandre's avatar
Lysandre committed
14
15
16
17
18
19
20
21
*Pretrained language models are now ubiquitous in Natural Language Processing. Despite their success,
most available models have either been trained on English data or on the concatenation of data in multiple
languages. This makes practical use of such models --in all languages except English-- very limited. Aiming
to address this issue for French, we release CamemBERT, a French version of the Bi-directional Encoders for
Transformers (BERT). We measure the performance of CamemBERT compared to multilingual models in multiple
downstream tasks, namely part-of-speech tagging, dependency parsing, named-entity recognition, and natural
language inference. CamemBERT improves the state of the art for most of the tasks considered. We release the
pretrained model for CamemBERT hoping to foster research and downstream applications for French NLP.*
Lysandre's avatar
Lysandre committed
22

Lysandre's avatar
Lysandre committed
23
24
25
26
27
Tips:

- This implementation is the same as RoBERTa. Refer to the `documentation of RoBERTa <./roberta.html>`__ for usage
  examples as well as the information relative to the inputs and outputs.

28
29
The original code can be found `here <https://camembert-model.fr/>`_.

Lysandre's avatar
Lysandre committed
30
CamembertConfig
Lysandre's avatar
Fixes  
Lysandre committed
31
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Lysandre's avatar
Lysandre committed
32
33
34
35
36

.. autoclass:: transformers.CamembertConfig
    :members:


Lysandre's avatar
Lysandre committed
37
CamembertTokenizer
Lysandre's avatar
Fixes  
Lysandre committed
38
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Lysandre's avatar
Lysandre committed
39
40

.. autoclass:: transformers.CamembertTokenizer
Lysandre Debut's avatar
Lysandre Debut committed
41
42
    :members: build_inputs_with_special_tokens, get_special_tokens_mask,
        create_token_type_ids_from_sequences, save_vocabulary
Lysandre's avatar
Lysandre committed
43
44


Lysandre's avatar
Lysandre committed
45
CamembertModel
Lysandre's avatar
Fixes  
Lysandre committed
46
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Lysandre's avatar
Lysandre committed
47
48
49
50
51

.. autoclass:: transformers.CamembertModel
    :members:


Suraj Patil's avatar
Suraj Patil committed
52
53
54
55
56
57
58
CamembertForCausalLM
~~~~~~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.CamembertForCausalLM
    :members:


Lysandre's avatar
Lysandre committed
59
CamembertForMaskedLM
Lysandre's avatar
Fixes  
Lysandre committed
60
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Lysandre's avatar
Lysandre committed
61
62
63
64
65

.. autoclass:: transformers.CamembertForMaskedLM
    :members:


Lysandre's avatar
Lysandre committed
66
CamembertForSequenceClassification
Lysandre's avatar
Fixes  
Lysandre committed
67
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Lysandre's avatar
Lysandre committed
68
69
70
71
72

.. autoclass:: transformers.CamembertForSequenceClassification
    :members:


Lysandre's avatar
Lysandre committed
73
CamembertForMultipleChoice
Lysandre's avatar
Fixes  
Lysandre committed
74
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Lysandre's avatar
Lysandre committed
75
76
77
78
79

.. autoclass:: transformers.CamembertForMultipleChoice
    :members:


Lysandre's avatar
Lysandre committed
80
CamembertForTokenClassification
Lysandre's avatar
Fixes  
Lysandre committed
81
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Lysandre's avatar
Lysandre committed
82
83
84

.. autoclass:: transformers.CamembertForTokenClassification
    :members:
Lysandre's avatar
Lysandre committed
85
86


Sylvain Gugger's avatar
Sylvain Gugger committed
87
88
89
90
91
92
93
CamembertForQuestionAnswering
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.CamembertForQuestionAnswering
    :members:


Lysandre's avatar
Lysandre committed
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
TFCamembertModel
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.TFCamembertModel
    :members:


TFCamembertForMaskedLM
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.TFCamembertForMaskedLM
    :members:


TFCamembertForSequenceClassification
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.TFCamembertForSequenceClassification
    :members:


115
116
117
118
119
120
121
TFCamembertForMultipleChoice
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.TFCamembertForMultipleChoice
    :members:


Lysandre's avatar
Lysandre committed
122
123
124
125
126
TFCamembertForTokenClassification
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.TFCamembertForTokenClassification
    :members:
Sylvain Gugger's avatar
Sylvain Gugger committed
127
128
129
130
131
132
133


TFCamembertForQuestionAnswering
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

.. autoclass:: transformers.TFCamembertForQuestionAnswering
    :members: