Commits · a75c64d80c76c3dc71f735d9197a4a601847e0cd · chenpangpang / transformers

26 Aug, 2020 1 commit
- Black 20 release · a75c64d8
  Lysandre authored Aug 26, 2020
  
  a75c64d8
26 Jun, 2020 1 commit

[tokenizers] Updates data processors, docstring, examples and model cards to the new API (#5308) · 601d4d69

Thomas Wolf authored Jun 26, 2020

* remove references to old API in docstring - update data processors

* style

* fix tests - better type checking error messages

* better type checking

* include awesome fix by @LysandreJik for #5310

* updated doc and examples

601d4d69

02 Jun, 2020 1 commit

Kill model archive maps (#4636) · d4c2cb40

Julien Chaumond authored Jun 02, 2020

* Kill model archive maps

* Fixup

* Also kill model_archive_map for MaskedBertPreTrainedModel

* Unhook config_archive_map

* Tokenizers: align with model id changes

* make style && make quality

* Fix CI

d4c2cb40

01 Jun, 2020 3 commits
- weird import · 9d7d9b3a
  Victor SANH authored May 29, 2020
  
  9d7d9b3a
- commplying with isort · 5c8e5b37
  Victor SANH authored May 28, 2020
  
  5c8e5b37
- add sparsity modules · e4c07faf
  Victor SANH authored May 27, 2020
  
  e4c07faf
29 May, 2020 2 commits

Fix BERT example code for NSP and Multiple Choice (#3953) · e2230ba7
Simon Böhm authored May 29, 2020
```
Change the example code to use encode_plus since the token_type_id
wasn't being correctly set.
```
e2230ba7

[Longformer] Multiple choice for longformer (#4645) · 9c172564

Patrick von Platen authored May 29, 2020

* add multiple choice for longformer

* add models to docs

* adapt docstring

* add test to longformer

* add longformer for mc in init and modeling auto

* fix tests

9c172564

19 May, 2020 1 commit

Fix nn.DataParallel compatibility in PyTorch 1.5 (#4300) · 4c068936

Julien Chaumond authored May 18, 2020

* Test case for #3936

* multigpu tests pass on pytorch 1.4.0

* Fixup

* multigpu tests pass on pytorch 1.5.0

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py

* rename multigpu to require_multigpu

* mode doc

4c068936

30 Apr, 2020 1 commit
- Fixed Style Inconsistency (#3976) · 7f9193ef
  Jordan authored Apr 30, 2020
  
  7f9193ef
29 Apr, 2020 1 commit

CDN urls (#4030) · 455c6390

Julien Chaumond authored Apr 28, 2020

* [file_utils] use_cdn + documentation

* Move to cdn. urls for weights

* [urls] Hotfix for bert-base-japanese

455c6390

28 Apr, 2020 1 commit

Clean Encoder-Decoder models with Bart/T5-like API and add generate possibility (#3383) · fa49b9af

Patrick von Platen authored Apr 28, 2020

* change encoder decoder style to bart & t5 style

* make encoder decoder generation dummy work for bert

* make style

* clean init config in encoder decoder

* add tests for encoder decoder models

* refactor and add last tests

* refactor and add last tests

* fix attn masks for bert encoder decoder

* make style

* refactor prepare inputs for Bert

* refactor

* finish encoder decoder

* correct typo

* add docstring to config

* finish

* add tests

* better naming

* make style

* fix flake8

* clean docstring

* make style

* rename

fa49b9af

23 Apr, 2020 1 commit
- [housekeeping] super() · 7c2a32ff
  Julien Chaumond authored Apr 23, 2020
  
  7c2a32ff
21 Apr, 2020 1 commit
- Fix Documentation issue in BertForMaskedLM forward (#3855) · 7d40901c
  Bharat Raghunathan authored Apr 21, 2020
  
  7d40901c
17 Apr, 2020 1 commit

Fix token_type_id in BERT question-answering example (#3790) · edf0582c

Simon Böhm authored Apr 17, 2020

token_type_id is converted into the segment embedding. For question answering,
this needs to highlight whether a token belongs to sequence 0 or 1.
encode_plus takes care of correctly setting this parameter automatically.

edf0582c

16 Apr, 2020 2 commits
- change pad token id to config pad token id (#3793) · a5b24947
  Patrick von Platen authored Apr 16, 2020
  
  a5b24947
- [cleanup] factor out get_head_mask, invert_attn_mask, get_exten… (#3806) · dbd04124
  Sam Shleifer authored Apr 16, 2020
```
* Delete some copy pasted code
```
  dbd04124
03 Apr, 2020 1 commit

ELECTRA (#3257) · d5d7d886

Lysandre Debut authored Apr 03, 2020

* Electra wip

* helpers

* Electra wip

* Electra v1

* ELECTRA may be saved/loaded

* Generator & Discriminator

* Embedding size instead of halving the hidden size

* ELECTRA Tokenizer

* Revert BERT helpers

* ELECTRA Conversion script

* Archive maps

* PyTorch tests

* Start fixing tests

* Tests pass

* Same configuration for both models

* Compatible with base + large

* Simplification + weight tying

* Archives

* Auto + Renaming to standard names

* ELECTRA is uncased

* Tests

* Slight API changes

* Update tests

* wip

* ElectraForTokenClassification

* temp

* Simpler arch + tests

Removed ElectraForPreTraining which will be in a script

* Conversion script

* Auto model

* Update links to S3

* Split ElectraForPreTraining and ElectraForTokenClassification

* Actually test PreTraining model

* Remove num_labels from configuration

* wip

* wip

* From discriminator and generator to electra

* Slight API changes

* Better naming

* TensorFlow ELECTRA tests

* Accurate conversion script

* Added to conversion script

* Fast ELECTRA tokenizer

* Style

* Add ELECTRA to README

* Modeling Pytorch Doc + Real style

* TF Docs

* Docs

* Correct links

* Correct model intialized

* random fixes

* style

* Addressing Patrick's and Sam's comments

* Correct links in docs

d5d7d886

01 Apr, 2020 1 commit
- Correct output shape for Bert NSP models in docs (#3482) · 9de9ceb6
  Anirudh Srinivasan authored Apr 02, 2020
  
  9de9ceb6
25 Feb, 2020 2 commits

Documentation (#2989) · bb7c4685

Lysandre Debut authored Feb 25, 2020

* All Tokenizers

BertTokenizer + few fixes
RobertaTokenizer
OpenAIGPTTokenizer + Fixes
GPT2Tokenizer + fixes
TransfoXLTokenizer
Correct rst for TransformerXL
XLMTokenizer + fixes
XLNet Tokenizer + Style
DistilBERT + Fix XLNet RST
CTRLTokenizer
CamemBERT Tokenizer
FlaubertTokenizer
XLMRobertaTokenizer
cleanup

* cleanup

bb7c4685

Change masking to direct labeling for TPU support. (#2982) · e8ce63ff
srush authored Feb 25, 2020
```
* change masking to direct labelings

* fix black

* switch to ignore index

* .

* fix black
```
e8ce63ff

21 Feb, 2020 1 commit
- Remove double bias (#2958) · 94ff2d6e
  Lysandre Debut authored Feb 21, 2020
  
  94ff2d6e
13 Feb, 2020 1 commit
- get_activation('relu') provides a simple mapping from strings i… (#2807) · ef74b0f0
  Sam Shleifer authored Feb 13, 2020
```
* activations.py contains a mapping from string to activation function
* resolves some `gelu` vs `gelu_new` ambiguity
```
  ef74b0f0
11 Feb, 2020 1 commit

BERT decoder: Fix causal mask dtype. · ee5de0ba

Oleksiy Syvokon authored Feb 06, 2020

PyTorch < 1.3 requires multiplication operands to be of the same type.
This was violated when using default attention mask (i.e.,
attention_mask=None in arguments) given BERT in the decoder mode.

In particular, this was breaking Model2Model and made tutorial
from the quickstart failing.

ee5de0ba

07 Feb, 2020 1 commit
- Fix importing unofficial TF models with extra optimizer weights · 73368963
  monologg authored Jan 27, 2020
  
  73368963
04 Feb, 2020 1 commit
- Revert erroneous fix · 3bf54172
  Lysandre authored Feb 04, 2020
  
  3bf54172
03 Feb, 2020 1 commit

[Follow up 213] · 239dd23f

Lysandre authored Feb 03, 2020

Masked indices should have -1 and not -100. Updating documentation + scripts that were forgotten

239dd23f

28 Jan, 2020 1 commit
- Add Dutch pre-trained BERT model · f5a236c3
  Wietse de Vries authored Dec 19, 2019
  
  f5a236c3
23 Jan, 2020 7 commits
- Run the examples in slow · 24d5ad1d
  Lysandre authored Jan 22, 2020
  
  24d5ad1d
- Tips + whitespaces · 9ddf60b6
  Lysandre authored Jan 21, 2020
  
  9ddf60b6
- Fixes · 0e9899f4
  Lysandre authored Jan 20, 2020
  
  0e9899f4
- PyTorch CTRL + Style · 7511f3dd
  Lysandre authored Jan 20, 2020
  
  7511f3dd
- XLM-RoBERTa · 980211a6
  Lysandre authored Jan 20, 2020
  
  980211a6
- Pytorch RoBERTa · 3e1bc27e
  Lysandre authored Jan 20, 2020
  
  3e1bc27e
- BERT PyTorch models · cd77c750
  Lysandre authored Jan 16, 2020
  
  cd77c750
15 Jan, 2020 1 commit
- 💄 super · 83a41d39
  Julien Chaumond authored Jan 15, 2020
  
  83a41d39
14 Jan, 2020 1 commit

Bias should be resized with the weights · 100e3b6f

Lysandre authored Jan 14, 2020

Created a link between the linear layer bias and the model attribute bias. This does not change anything for the user nor for the conversion scripts, but allows the `resize_token_embeddings` method to resize the bias as well as the weights of the decoder.

Added a test.

100e3b6f

07 Jan, 2020 2 commits
- Make doc regarding masked indices more clear. · 569da80c
  Romain Keramitas authored Jan 07, 2020
```
Signed-off-by: Romain Keramitas <r.keramitas@gmail.com>
```
  569da80c
- Fix typograpical errors (#2438) · d6a677b1
  Genta Indra Winata authored Jan 08, 2020
  
  d6a677b1
06 Jan, 2020 1 commit
- GPU text generation: mMoved the encoded_prompt to correct device · 81d6841b
  alberduris authored Dec 31, 2019
  
  81d6841b