Commits · 64c393ee74a2294d1608dc327a997683b4ea689e · chenpangpang / transformers

24 Jun, 2020 1 commit
- [Benchmark] Extend Benchmark to all model type extensions (#5241) · 9fe09cec
  Patrick von Platen authored Jun 24, 2020
```
* add benchmark for all kinds of models

* improved import

* delete bogus files

* make style
```
  9fe09cec
23 Jun, 2020 4 commits

[bart] add config.extra_pos_embeddings to facilitate reuse (#5190) · 58918c76
Sam Shleifer authored Jun 23, 2020

58918c76

Tokenizers API developments (#5103) · 11fdde02

Thomas Wolf authored Jun 23, 2020



* Add return lengths

* make pad a bit more flexible so it can be used as collate_fn

* check all kwargs sent to encoding method are known

* fixing kwargs in encodings

* New AddedToken class in python

This class let you specify specifique tokenization behaviors for some special tokens. Used in particular for GPT2 and Roberta, to control how white spaces are stripped around special tokens.

* style and quality

* switched to hugginface tokenizers library for AddedTokens

* up to tokenizer 0.8.0-rc3 - update API to use AddedToken state

* style and quality

* do not raise an error on additional or unused kwargs for tokenize() but only a warning

* transfo-xl pretrained model requires torch

* Update src/transformers/tokenization_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

11fdde02

[fix] remove unused import (#5206) · 51441040
Sam Shleifer authored Jun 22, 2020

51441040
[fix] mobilebert had wrong path, causing slow test failure (#5205) · 0d158e38
Sam Shleifer authored Jun 22, 2020

0d158e38

22 Jun, 2020 4 commits

[tokenizers] Fix #5081 and improve backward compatibility (#5125) · ebc36108

Thomas Wolf authored Jun 22, 2020

* fix #5081 and improve backward compatibility (slightly)

* add nlp to setup.cfg - style and quality

* align default to previous default

* remove test that doesn't generalize

ebc36108

Output hidden states (#4978) · f4e1f022

Joseph Liu authored Jun 22, 2020



* Configure all models to use output_hidden_states as argument passed to foward()

* Pass all tests

* Remove cast_bool_to_primitive in TF Flaubert model

* correct tf xlnet

* add pytorch test

* add tf test

* Fix broken tests

* Configure all models to use output_hidden_states as argument passed to foward()

* Pass all tests

* Remove cast_bool_to_primitive in TF Flaubert model

* correct tf xlnet

* add pytorch test

* add tf test

* Fix broken tests

* Refactor output_hidden_states for mobilebert

* Reset and remerge to master
Co-authored-by: Joseph Liu <joseph.liu@coinflex.com>
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

f4e1f022

Added feature to move added tokens in vocabulary for Transformer-XL (#4953) · b99ad457

RafaelWO authored Jun 22, 2020



* Fixed resize_token_embeddings for transfo_xl model

* Fixed resize_token_embeddings for transfo_xl.

Added custom methods to TransfoXLPreTrainedModel for resizing layers of
the AdaptiveEmbedding.

* Updated docstring

* Fixed resizinhg cutoffs; added check for new size of embedding layer.

* Added test for resize_token_embeddings

* Fixed code quality

* Fixed unchanged cutoffs in model.config

* Added feature to move added tokens in tokenizer.

* Fixed code quality

* Added feature to move added tokens in tokenizer.

* Fixed code quality

* Fixed docstring, renamed sym to 	oken.
Co-authored-by: Rafael Weingartner <rweingartner.its-b2015@fh-salzburg.ac.at>

b99ad457

Benchmarks (#4912) · fa0be6d7

Patrick von Platen authored Jun 22, 2020

* finish benchmark

* fix isort

* fix setup cfg

* retab

* fix time measuring of tf graph mode

* fix tf cuda

* clean code

* better error message

fa0be6d7

19 Jun, 2020 2 commits

Add MobileBert (#4901) · 9a3f9108

Vasily Shamporov authored Jun 19, 2020



* Add MobileBert

* Quality + Conversion script

* style

* Update src/transformers/modeling_mobilebert.py

* Links to S3

* Style

* TFMobileBert

Slight fixes to the pytorch MobileBert
Style

* MobileBertForMaskedLM (PT + TF)

* MobileBertForNextSentencePrediction (PT + TF)

* MobileFor{MultipleChoice, TokenClassification} (PT + TF)


ss

* Tests + Auto

* Doc

* Tests

* Addressing @sgugger's comments

* Adressing @patrickvonplaten's comments

* Style

* Style

* Integration test

* style

* Model card
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

9a3f9108

AutoTokenizer supports mbart-large-en-ro (#5121) · 84be482f
Sam Shleifer authored Jun 18, 2020

84be482f

18 Jun, 2020 3 commits

Fix #5114 (#5122) · 5f721ad6
Sylvain Gugger authored Jun 18, 2020

5f721ad6

tf add resize_token_embeddings method (#4351) · 32e94cff

Deniz authored Jun 18, 2020



* resize token embeddings

* add tokens

* add tokens

* add tokens

* add t5 token method

* add t5 token method

* add t5 token method

* typo

* debugging input

* debugging input

* debug

* debug

* debug

* trying to set embedding tokens properly

* set embeddings for generation head too

* set embeddings for generation head too

* debugging

* debugging

* enable generation

* add base method

* add base method

* add base method

* return logits in the main call

* reverting to generation

* revert back

* set embeddings for the bert main layer

* description

* fix conflicts

* logging

* set base model as self

* refactor

* tf_bert add method

* tf_bert add method

* tf_bert add method

* tf_bert add method

* tf_bert add method

* tf_bert add method

* tf_bert add method

* tf_bert add method

* v0

* v0

* finalize

* final

* black

* add tests

* revert back the emb call

* comments

* comments

* add the second test

* add vocab size condig

* add tf models

* add tf models. add common tests

* remove model specific embedding tests

* stylish

* remove files

* stylez

* Update src/transformers/modeling_tf_transfo_xl.py

change the error.
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* adding unchanged weight test
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

32e94cff

ElectraForMultipleChoice (#4954) · ca2d0f98

Suraj Patil authored Jun 19, 2020



* add ElectraForMultipleChoice

* add  test_for_multiple_choice

* add ElectraForMultipleChoice in auto model

* add ElectraForMultipleChoice in all_model_classes

* add SequenceSummary related parameters

* get rid pooler, use SequenceSummary instead

* add electra multiple choice test
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

ca2d0f98

17 Jun, 2020 1 commit
- Make default_data_collator more flexible and deprecate old behavior (#5060) · 20fa8289
  Sylvain Gugger authored Jun 17, 2020
```
* Make default_data_collator more flexible

* Accept tensors for all features

* Document code

* Refactor

* Formatting
```
  20fa8289
16 Jun, 2020 3 commits

Fix marian tokenizer save pretrained (#5043) · 3d495c61
Sam Shleifer authored Jun 16, 2020

3d495c61
[cleanup] Hoist ModelTester objects to top level (#4939) · c852036b
Amil Khare authored Jun 16, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
c852036b

Ability to pickle/unpickle BatchEncoding pickle (reimport) (#5039) · 9e033649

Funtowicz Morgan authored Jun 16, 2020

* Added is_fast property on BatchEncoding to indicate if the object comes from a Fast Tokenizer.

* Added __get_state__() & __set_state__() to be pickable.

* Correct tokens() return type from List[int] to List[str]

* Added unittest for BatchEncoding pickle/unpickle

* Added unittest for BatchEncoding is_fast

* More careful checking on BatchEncoding unpickle tests.

* Formatting.

* is_fast should assertTrue on Rust tokenizers.

* Ensure tensorflow has correct way of checking array_equal

* More formatting.

9e033649

15 Jun, 2020 5 commits

Add DistilBertForMultipleChoice (#5032) · f9f8a531
Sylvain Gugger authored Jun 15, 2020
```
* Add `DistilBertForMultipleChoice`
```
f9f8a531

[HUGE] Refactoring tokenizers backend - padding - truncation - pre-tokenized... · 36434220

Anthony MOI authored Jun 15, 2020


[HUGE] Refactoring tokenizers backend - padding - truncation - pre-tokenized pipeline - fast tokenizers - tests (#4510)

* Use tokenizers pre-tokenized pipeline

* failing pretrokenized test

* Fix is_pretokenized in python

* add pretokenized tests

* style and quality

* better tests for batched pretokenized inputs

* tokenizers clean up - new padding_strategy - split the files

* [HUGE] refactoring tokenizers - padding - truncation - tests

* style and quality

* bump up requied tokenizers version to 0.8.0-rc1

* switched padding/truncation API - simpler better backward compat

* updating tests for custom tokenizers

* style and quality - tests on pad

* fix QA pipeline

* fix backward compatibility for max_length only

* style and quality

* Various cleans up - add verbose

* fix tests

* update docstrings

* Fix tests

* Docs reformatted

* __call__ method documented
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

36434220

[Bart] Question Answering Model is added to tests (#5024) · ebba39e4
Patrick von Platen authored Jun 15, 2020
```
* fix test

* Update tests/test_modeling_common.py

* Update tests/test_modeling_common.py
```
ebba39e4
Add bart-base (#5014) · a9f1fc6c
Sam Shleifer authored Jun 15, 2020

a9f1fc6c

Make DataCollator a callable (#5015) · 1affde2f

Sylvain Gugger authored Jun 15, 2020



* Make DataCollator a callable

* Update src/transformers/data/data_collator.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

1affde2f

12 Jun, 2020 4 commits
- BartForQuestionAnswering (#4908) · e93ccb32
  Suraj Patil authored Jun 13, 2020
  
  e93ccb32
- Add AlbertForMultipleChoice (#4959) · 538531cd
  Sylvain Gugger authored Jun 12, 2020
```
* Add AlbertForMultipleChoice

* Make up to date and add all models to common tests
```
  538531cd
- [AutoModel] Split AutoModelWithLMHead into clm, mlm, encoder-decoder (#4933) · 86578bb0
  Patrick von Platen authored Jun 12, 2020
```
* first commit

* add new auto models

* better naming

* fix bert automodel

* fix automodel for pretraining

* add models to init

* fix name typo

* fix typo

* better naming

* future warning instead of depreciation warning
```
  86578bb0
- [mbart] Fix fp16 testing logic (#4949) · 56200331
  Sam Shleifer authored Jun 11, 2020
  
  56200331
11 Jun, 2020 2 commits
- MBartTokenizer:add language codes (#3776) · 08b59d10
  Sam Shleifer authored Jun 11, 2020
  
  08b59d10
- Support multiple choice in tf common model tests (#4920) · 20451195
  Sylvain Gugger authored Jun 11, 2020
```
* Support multiple choice in tf common model tests

* Add the input_embeds test
```
  20451195
10 Jun, 2020 8 commits

Fix resize_token_embeddings for Transformer-XL (#4759) · e80d6c68

RafaelWO authored Jun 11, 2020



* Fixed resize_token_embeddings for transfo_xl model

* Fixed resize_token_embeddings for transfo_xl.

Added custom methods to TransfoXLPreTrainedModel for resizing layers of
the AdaptiveEmbedding.

* Updated docstring

* Fixed resizinhg cutoffs; added check for new size of embedding layer.

* Added test for resize_token_embeddings

* Fixed code quality

* Fixed unchanged cutoffs in model.config
Co-authored-by: Rafael Weingartner <rweingartner.its-b2015@fh-salzburg.ac.at>

e80d6c68

Make multiple choice models work with input_embeds (#4921) · d541938c
Sylvain Gugger authored Jun 10, 2020

d541938c

Split LMBert model in two (#4874) · 1e2631d6

Sylvain Gugger authored Jun 10, 2020

* Split LMBert model in two

* Fix example

* Remove lm_labels

* Adapt tests, refactor prepare_for_generation

* Fix merge

* Hide BeartLMHeadModel

1e2631d6

ElectraForQuestionAnswering (#4913) · ef2dcdcc

Suraj Patil authored Jun 11, 2020

* ElectraForQuestionAnswering

* udate __init__

* add test for electra qa model

* add ElectraForQuestionAnswering in auto models

* add ElectraForQuestionAnswering in all_model_classes

* fix outputs, input_ids defaults to None

* add ElectraForQuestionAnswering in docs

* remove commented line

ef2dcdcc

[ctrl] fix pruning of MultiHeadAttention (#4904) · 5d63ca6c
Amil Khare authored Jun 10, 2020

5d63ca6c
Add more models to common tests (#4910) · 4e10acb3
Sylvain Gugger authored Jun 10, 2020

4e10acb3
Fix the CI (#4903) · ac99217e
Sylvain Gugger authored Jun 10, 2020
```
* Fix CI
```
ac99217e
Deal with multiple choice in common tests (#4886) · 0a375f5a
Sylvain Gugger authored Jun 10, 2020
```
* Deal with multiple choice in common tests
```
0a375f5a

09 Jun, 2020 2 commits

[All models] Extend config.output_attentions with output_attentions function arguments (#4538) · 6e603cb7

Bharat Raghunathan authored Jun 10, 2020



* DOC: Replace instances of ``config.output_attentions`` with function argument ``output_attentions``

* DOC: Apply Black Formatting

* Fix errors where output_attentions was undefined

* Remove output_attentions in classes per review

* Fix regressions on tests having `output_attention`

* Fix further regressions in tests relating to `output_attentions`

Ensure proper propagation of `output_attentions` as a function parameter
to all model subclasses

* Fix more regressions in `test_output_attentions`

* Fix issues with BertEncoder

* Rename related variables to `output_attentions`

* fix pytorch tests

* fix bert and gpt2 tf

* Fix most TF tests for `test_output_attentions`

* Fix linter errors and more TF tests

* fix conflicts

* DOC: Apply Black Formatting

* Fix errors where output_attentions was undefined

* Remove output_attentions in classes per review

* Fix regressions on tests having `output_attention`

* fix conflicts

* fix conflicts

* fix conflicts

* fix conflicts

* fix pytorch tests

* fix conflicts

* fix conflicts

* Fix linter errors and more TF tests

* fix tf tests

* make style

* fix isort

* improve output_attentions

* improve tensorflow
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

6e603cb7

[Benchmark] add tpu and torchscipt for benchmark (#4850) · 2cfb947f

Patrick von Platen authored Jun 09, 2020



* add tpu and torchscipt for benchmark

* fix name in tests

* "fix email"

* make style

* better log message for tpu

* add more print and info for tpu

* allow possibility to print tpu metrics

* correct cpu usage

* fix test for non-install

* remove bugus file

* include psutil in testing

* run a couple of times before tracing in torchscript

* do not allow tpu memory tracing for now

* make style

* add torchscript to env

* better name for torch tpu
Co-authored-by: Patrick von Platen <patrick@huggingface.co>

2cfb947f

08 Jun, 2020 1 commit
- fix PR (#4810) · c0554776
  Patrick von Platen authored Jun 08, 2020
  
  c0554776