encoderdecoder.rst 1.53 KB
Newer Older
1
Encoder Decoder Models
Sylvain Gugger's avatar
Sylvain Gugger committed
2
-----------------------------------------------------------------------------------------------------------------------
3

Sylvain Gugger's avatar
Sylvain Gugger committed
4
5
The :class:`~transformers.EncoderDecoderModel` can be used to initialize a sequence-to-sequence model with any
pretrained autoencoding model as the encoder and any pretrained autoregressive model as the decoder.
6

Sylvain Gugger's avatar
Sylvain Gugger committed
7
8
9
The effectiveness of initializing sequence-to-sequence models with pretrained checkpoints for sequence generation tasks
was shown in `Leveraging Pre-trained Checkpoints for Sequence Generation Tasks <https://arxiv.org/abs/1907.12461>`__ by
Sascha Rothe, Shashi Narayan, Aliaksei Severyn.
10

Sylvain Gugger's avatar
Sylvain Gugger committed
11
12
After such an :class:`~transformers.EncoderDecoderModel` has been trained/fine-tuned, it can be saved/loaded just like
any other models (see the examples for more information).
13

Sylvain Gugger's avatar
Sylvain Gugger committed
14
15
An application of this architecture could be to leverage two pretrained :class:`~transformers.BertModel` as the encoder
and decoder for a summarization model as was shown in: `Text Summarization with Pretrained Encoders
Sylvain Gugger's avatar
Sylvain Gugger committed
16
<https://arxiv.org/abs/1908.08345>`__ by Yang Liu and Mirella Lapata.
17
18


Sylvain Gugger's avatar
Sylvain Gugger committed
19
20
EncoderDecoderConfig
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
21
22
23
24
25

.. autoclass:: transformers.EncoderDecoderConfig
    :members:


Sylvain Gugger's avatar
Sylvain Gugger committed
26
27
EncoderDecoderModel
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
28
29

.. autoclass:: transformers.EncoderDecoderModel
Weizhen's avatar
Weizhen committed
30
    :members: forward, from_encoder_decoder_pretrained