Commits · e4c07faf0ae125559c30998c3513dd85f012c88e · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "ef7588b617df3b861b687ab6aefc95fb4e0c5e1e"

01 Jun, 2020 1 commit
- add sparsity modules · e4c07faf
  Victor SANH authored May 27, 2020
  
  e4c07faf
29 May, 2020 2 commits

Fix BERT example code for NSP and Multiple Choice (#3953) · e2230ba7
Simon Böhm authored May 29, 2020
```
Change the example code to use encode_plus since the token_type_id
wasn't being correctly set.
```
e2230ba7

[Longformer] Multiple choice for longformer (#4645) · 9c172564

Patrick von Platen authored May 29, 2020

* add multiple choice for longformer

* add models to docs

* adapt docstring

* add test to longformer

* add longformer for mc in init and modeling auto

* fix tests

9c172564

19 May, 2020 1 commit

Fix nn.DataParallel compatibility in PyTorch 1.5 (#4300) · 4c068936

Julien Chaumond authored May 18, 2020

* Test case for #3936

* multigpu tests pass on pytorch 1.4.0

* Fixup

* multigpu tests pass on pytorch 1.5.0

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py

* rename multigpu to require_multigpu

* mode doc

4c068936

30 Apr, 2020 1 commit
- Fixed Style Inconsistency (#3976) · 7f9193ef
  Jordan authored Apr 30, 2020
  
  7f9193ef
29 Apr, 2020 1 commit

CDN urls (#4030) · 455c6390

Julien Chaumond authored Apr 28, 2020

* [file_utils] use_cdn + documentation

* Move to cdn. urls for weights

* [urls] Hotfix for bert-base-japanese

455c6390

28 Apr, 2020 1 commit

Clean Encoder-Decoder models with Bart/T5-like API and add generate possibility (#3383) · fa49b9af

Patrick von Platen authored Apr 28, 2020

* change encoder decoder style to bart & t5 style

* make encoder decoder generation dummy work for bert

* make style

* clean init config in encoder decoder

* add tests for encoder decoder models

* refactor and add last tests

* refactor and add last tests

* fix attn masks for bert encoder decoder

* make style

* refactor prepare inputs for Bert

* refactor

* finish encoder decoder

* correct typo

* add docstring to config

* finish

* add tests

* better naming

* make style

* fix flake8

* clean docstring

* make style

* rename

fa49b9af

23 Apr, 2020 1 commit
- [housekeeping] super() · 7c2a32ff
  Julien Chaumond authored Apr 23, 2020
  
  7c2a32ff
21 Apr, 2020 1 commit
- Fix Documentation issue in BertForMaskedLM forward (#3855) · 7d40901c
  Bharat Raghunathan authored Apr 21, 2020
  
  7d40901c
17 Apr, 2020 1 commit

Fix token_type_id in BERT question-answering example (#3790) · edf0582c

Simon Böhm authored Apr 17, 2020

token_type_id is converted into the segment embedding. For question answering,
this needs to highlight whether a token belongs to sequence 0 or 1.
encode_plus takes care of correctly setting this parameter automatically.

edf0582c

16 Apr, 2020 2 commits
- change pad token id to config pad token id (#3793) · a5b24947
  Patrick von Platen authored Apr 16, 2020
  
  a5b24947
- [cleanup] factor out get_head_mask, invert_attn_mask, get_exten… (#3806) · dbd04124
  Sam Shleifer authored Apr 16, 2020
```
* Delete some copy pasted code
```
  dbd04124
03 Apr, 2020 1 commit

ELECTRA (#3257) · d5d7d886

Lysandre Debut authored Apr 03, 2020

* Electra wip

* helpers

* Electra wip

* Electra v1

* ELECTRA may be saved/loaded

* Generator & Discriminator

* Embedding size instead of halving the hidden size

* ELECTRA Tokenizer

* Revert BERT helpers

* ELECTRA Conversion script

* Archive maps

* PyTorch tests

* Start fixing tests

* Tests pass

* Same configuration for both models

* Compatible with base + large

* Simplification + weight tying

* Archives

* Auto + Renaming to standard names

* ELECTRA is uncased

* Tests

* Slight API changes

* Update tests

* wip

* ElectraForTokenClassification

* temp

* Simpler arch + tests

Removed ElectraForPreTraining which will be in a script

* Conversion script

* Auto model

* Update links to S3

* Split ElectraForPreTraining and ElectraForTokenClassification

* Actually test PreTraining model

* Remove num_labels from configuration

* wip

* wip

* From discriminator and generator to electra

* Slight API changes

* Better naming

* TensorFlow ELECTRA tests

* Accurate conversion script

* Added to conversion script

* Fast ELECTRA tokenizer

* Style

* Add ELECTRA to README

* Modeling Pytorch Doc + Real style

* TF Docs

* Docs

* Correct links

* Correct model intialized

* random fixes

* style

* Addressing Patrick's and Sam's comments

* Correct links in docs

d5d7d886

01 Apr, 2020 1 commit
- Correct output shape for Bert NSP models in docs (#3482) · 9de9ceb6
  Anirudh Srinivasan authored Apr 02, 2020
  
  9de9ceb6
25 Feb, 2020 2 commits

Documentation (#2989) · bb7c4685

Lysandre Debut authored Feb 25, 2020

* All Tokenizers

BertTokenizer + few fixes
RobertaTokenizer
OpenAIGPTTokenizer + Fixes
GPT2Tokenizer + fixes
TransfoXLTokenizer
Correct rst for TransformerXL
XLMTokenizer + fixes
XLNet Tokenizer + Style
DistilBERT + Fix XLNet RST
CTRLTokenizer
CamemBERT Tokenizer
FlaubertTokenizer
XLMRobertaTokenizer
cleanup

* cleanup

bb7c4685

Change masking to direct labeling for TPU support. (#2982) · e8ce63ff
srush authored Feb 25, 2020
```
* change masking to direct labelings

* fix black

* switch to ignore index

* .

* fix black
```
e8ce63ff

21 Feb, 2020 1 commit
- Remove double bias (#2958) · 94ff2d6e
  Lysandre Debut authored Feb 21, 2020
  
  94ff2d6e
13 Feb, 2020 1 commit
- get_activation('relu') provides a simple mapping from strings i… (#2807) · ef74b0f0
  Sam Shleifer authored Feb 13, 2020
```
* activations.py contains a mapping from string to activation function
* resolves some `gelu` vs `gelu_new` ambiguity
```
  ef74b0f0
11 Feb, 2020 1 commit

BERT decoder: Fix causal mask dtype. · ee5de0ba

Oleksiy Syvokon authored Feb 06, 2020

PyTorch < 1.3 requires multiplication operands to be of the same type.
This was violated when using default attention mask (i.e.,
attention_mask=None in arguments) given BERT in the decoder mode.

In particular, this was breaking Model2Model and made tutorial
from the quickstart failing.

ee5de0ba

07 Feb, 2020 1 commit
- Fix importing unofficial TF models with extra optimizer weights · 73368963
  monologg authored Jan 27, 2020
  
  73368963
04 Feb, 2020 1 commit
- Revert erroneous fix · 3bf54172
  Lysandre authored Feb 04, 2020
  
  3bf54172
03 Feb, 2020 1 commit

[Follow up 213] · 239dd23f

Lysandre authored Feb 03, 2020

Masked indices should have -1 and not -100. Updating documentation + scripts that were forgotten

239dd23f

28 Jan, 2020 1 commit
- Add Dutch pre-trained BERT model · f5a236c3
  Wietse de Vries authored Dec 19, 2019
  
  f5a236c3
23 Jan, 2020 7 commits
- Run the examples in slow · 24d5ad1d
  Lysandre authored Jan 22, 2020
  
  24d5ad1d
- Tips + whitespaces · 9ddf60b6
  Lysandre authored Jan 21, 2020
  
  9ddf60b6
- Fixes · 0e9899f4
  Lysandre authored Jan 20, 2020
  
  0e9899f4
- PyTorch CTRL + Style · 7511f3dd
  Lysandre authored Jan 20, 2020
  
  7511f3dd
- XLM-RoBERTa · 980211a6
  Lysandre authored Jan 20, 2020
  
  980211a6
- Pytorch RoBERTa · 3e1bc27e
  Lysandre authored Jan 20, 2020
  
  3e1bc27e
- BERT PyTorch models · cd77c750
  Lysandre authored Jan 16, 2020
  
  cd77c750
15 Jan, 2020 1 commit
- 💄 super · 83a41d39
  Julien Chaumond authored Jan 15, 2020
  
  83a41d39
14 Jan, 2020 1 commit

Bias should be resized with the weights · 100e3b6f

Lysandre authored Jan 14, 2020

Created a link between the linear layer bias and the model attribute bias. This does not change anything for the user nor for the conversion scripts, but allows the `resize_token_embeddings` method to resize the bias as well as the weights of the decoder.

Added a test.

100e3b6f

07 Jan, 2020 2 commits
- Make doc regarding masked indices more clear. · 569da80c
  Romain Keramitas authored Jan 07, 2020
```
Signed-off-by: Romain Keramitas <r.keramitas@gmail.com>
```
  569da80c
- Fix typograpical errors (#2438) · d6a677b1
  Genta Indra Winata authored Jan 08, 2020
  
  d6a677b1
06 Jan, 2020 3 commits
- GPU text generation: mMoved the encoded_prompt to correct device · 81d6841b
  alberduris authored Dec 31, 2019
  
  81d6841b
- Moved the encoded_prompts to correct device · dd4df80f
  alberduris authored Dec 31, 2019
  
  dd4df80f
- Example snippet for BertForQuestionAnswering · 74755c89
  Lysandre authored Jan 06, 2020
  
  74755c89
22 Dec, 2019 3 commits

Remove sys.version_info[0] == 2 or 3. · 798b3b38
Aymeric Augustin authored Dec 22, 2019

798b3b38
Remove __future__ imports. · c824d15a
Aymeric Augustin authored Dec 22, 2019

c824d15a

Move source code inside a src subdirectory. · 6be7cdda

Aymeric Augustin authored Dec 22, 2019

This prevents transformers from being importable simply because the CWD
is the root of the git repository, while not being importable from other
directories. That led to inconsistent behavior, especially in examples.

Once you fetch this commit, in your dev environment, you must run:

    $ pip uninstall transformers
    $ pip install -e .

6be7cdda