Commits · 8594dd80dd49b388fe2ba29c68e609a8d3ace9f6 · chenpangpang / transformers

01 Apr, 2020 1 commit
- Correct output shape for Bert NSP models in docs (#3482) · 9de9ceb6
  Anirudh Srinivasan authored Apr 02, 2020
  
  9de9ceb6
25 Feb, 2020 2 commits

Lysandre Debut authored Feb 25, 2020

* All Tokenizers

BertTokenizer + few fixes
RobertaTokenizer
OpenAIGPTTokenizer + Fixes
GPT2Tokenizer + fixes
TransfoXLTokenizer
Correct rst for TransformerXL
XLMTokenizer + fixes
XLNet Tokenizer + Style
DistilBERT + Fix XLNet RST
CTRLTokenizer
CamemBERT Tokenizer
FlaubertTokenizer
XLMRobertaTokenizer
cleanup

* cleanup

bb7c4685

Change masking to direct labeling for TPU support. (#2982) · e8ce63ff
srush authored Feb 25, 2020
```
* change masking to direct labelings

* fix black

* switch to ignore index

* .

* fix black
```
e8ce63ff

21 Feb, 2020 1 commit
- Remove double bias (#2958) · 94ff2d6e
  Lysandre Debut authored Feb 21, 2020
  
  94ff2d6e
13 Feb, 2020 1 commit
- get_activation('relu') provides a simple mapping from strings i… (#2807) · ef74b0f0
  Sam Shleifer authored Feb 13, 2020
```
* activations.py contains a mapping from string to activation function
* resolves some `gelu` vs `gelu_new` ambiguity
```
  ef74b0f0
11 Feb, 2020 1 commit

BERT decoder: Fix causal mask dtype. · ee5de0ba

Oleksiy Syvokon authored Feb 06, 2020

PyTorch < 1.3 requires multiplication operands to be of the same type.
This was violated when using default attention mask (i.e.,
attention_mask=None in arguments) given BERT in the decoder mode.

In particular, this was breaking Model2Model and made tutorial
from the quickstart failing.

ee5de0ba

07 Feb, 2020 1 commit
- Fix importing unofficial TF models with extra optimizer weights · 73368963
  monologg authored Jan 27, 2020
  
  73368963
04 Feb, 2020 1 commit
- Revert erroneous fix · 3bf54172
  Lysandre authored Feb 04, 2020
  
  3bf54172
03 Feb, 2020 1 commit

[Follow up 213] · 239dd23f

Lysandre authored Feb 03, 2020

Masked indices should have -1 and not -100. Updating documentation + scripts that were forgotten

239dd23f

28 Jan, 2020 1 commit
- Add Dutch pre-trained BERT model · f5a236c3
  Wietse de Vries authored Dec 19, 2019
  
  f5a236c3
23 Jan, 2020 7 commits
- Run the examples in slow · 24d5ad1d
  Lysandre authored Jan 22, 2020
  
  24d5ad1d
- Tips + whitespaces · 9ddf60b6
  Lysandre authored Jan 21, 2020
  
  9ddf60b6
- Fixes · 0e9899f4
  Lysandre authored Jan 20, 2020
  
  0e9899f4
- PyTorch CTRL + Style · 7511f3dd
  Lysandre authored Jan 20, 2020
  
  7511f3dd
- XLM-RoBERTa · 980211a6
  Lysandre authored Jan 20, 2020
  
  980211a6
- Pytorch RoBERTa · 3e1bc27e
  Lysandre authored Jan 20, 2020
  
  3e1bc27e
- BERT PyTorch models · cd77c750
  Lysandre authored Jan 16, 2020
  
  cd77c750
15 Jan, 2020 1 commit
- 💄 super · 83a41d39
  Julien Chaumond authored Jan 15, 2020
  
  83a41d39
14 Jan, 2020 1 commit

Bias should be resized with the weights · 100e3b6f

Lysandre authored Jan 14, 2020

Created a link between the linear layer bias and the model attribute bias. This does not change anything for the user nor for the conversion scripts, but allows the `resize_token_embeddings` method to resize the bias as well as the weights of the decoder.

Added a test.

100e3b6f

07 Jan, 2020 2 commits
- Make doc regarding masked indices more clear. · 569da80c
  Romain Keramitas authored Jan 07, 2020
```
Signed-off-by: Romain Keramitas <r.keramitas@gmail.com>
```
  569da80c
- Fix typograpical errors (#2438) · d6a677b1
  Genta Indra Winata authored Jan 08, 2020
  
  d6a677b1
06 Jan, 2020 3 commits
- GPU text generation: mMoved the encoded_prompt to correct device · 81d6841b
  alberduris authored Dec 31, 2019
  
  81d6841b
- Moved the encoded_prompts to correct device · dd4df80f
  alberduris authored Dec 31, 2019
  
  dd4df80f
- Example snippet for BertForQuestionAnswering · 74755c89
  Lysandre authored Jan 06, 2020
  
  74755c89
22 Dec, 2019 6 commits
- Remove sys.version_info[0] == 2 or 3. · 798b3b38
  Aymeric Augustin authored Dec 22, 2019
  
  798b3b38
- Remove __future__ imports. · c824d15a
  Aymeric Augustin authored Dec 22, 2019
  
  c824d15a
- Move source code inside a src subdirectory. · 6be7cdda
  Aymeric Augustin authored Dec 22, 2019
```
This prevents transformers from being importable simply because the CWD
is the root of the git repository, while not being importable from other
directories. That led to inconsistent behavior, especially in examples.

Once you fetch this commit, in your dev environment, you must run:

    $ pip uninstall transformers
    $ pip install -e .
```
  6be7cdda
- Fix F821 flake8 warning (x47). · 2ab78325
  Aymeric Augustin authored Dec 21, 2019
```
Ignore warnings related to Python 2, because it's going away soon.
```
  2ab78325
- Fix E741 flake8 warning (x14). · b0f7db73
  Aymeric Augustin authored Dec 21, 2019
  
  b0f7db73
- Sort imports with isort. · 158e82e0
  Aymeric Augustin authored Dec 21, 2019
```
This is the result of:

    $ isort --recursive examples templates transformers utils hubconf.py setup.py
```
  158e82e0
21 Dec, 2019 1 commit

Reformat source code with black. · fa84ae26

Aymeric Augustin authored Dec 21, 2019

This is the result of:

    $ black --line-length 119 examples templates transformers utils hubconf.py setup.py

There's a lot of fairly long lines in the project. As a consequence, I'm
picking the longest widely accepted line length, 119 characters.

This is also Thomas' preference, because it allows for explicit variable
names, to make the code easier to understand.

fa84ae26

18 Dec, 2019 3 commits
- [s3] mv files and update links · 7ffa8173
  Julien Chaumond authored Dec 16, 2019
  
  7ffa8173
- Uploaded files to AWS. · c5f35e61
  Antti Virtanen authored Dec 16, 2019
  
  c5f35e61
- Adding Finnish BERT. · 8ac840ff
  Antti Virtanen authored Dec 16, 2019
  
  8ac840ff
11 Dec, 2019 2 commits
- Update links to weights · 36fc52a3
  Julien Chaumond authored Dec 10, 2019
  
  36fc52a3
- Add support for Japanese BERT models by cl-tohoku · c03c0dfd
  Masatoshi Suzuki authored Nov 15, 2019
  
  c03c0dfd
10 Dec, 2019 3 commits
- Patch documentation · ec6fb25c
  LysandreJik authored Dec 10, 2019
  
  ec6fb25c
- Uniforming the ignored indices · 41858924
  LysandreJik authored Dec 10, 2019
  
  41858924
- cast bool tensor to long for pytorch < 1.3 · 4d181999
  Rémi Louf authored Nov 12, 2019
  
  4d181999
09 Dec, 2019 1 commit

create encoder attention mask from shape of hidden states · 3520be78

Rémi Louf authored Dec 09, 2019

We currently create encoder attention masks (when they're not provided)
based on the shape of the inputs to the encoder. This is obviously
wrong; sequences can be of different lengths. We now create the encoder
attention mask based on the batch_size and sequence_length of the
encoder hidden states.

3520be78