Commits · c65863ce5341c3dac162fb5e301ba7f6bee800fa · chenpangpang / transformers

16 Oct, 2020 4 commits

fix/hide warnings (#7837) · d8ca57d2
Stas Bekman authored Oct 16, 2020
```
s
```
d8ca57d2
[cleanup] assign todos, faster bart-cnn test (#7835) · 96e47d92
Sam Shleifer authored Oct 16, 2020
```
* 2 beam output

* unassign/remove TODOs

* remove one more
```
96e47d92

rmroczkowski authored Oct 16, 2020



* HerBERT transformer model for Polish language understanding.

* HerbertTokenizerFast generated with HerbertConverter

* Herbert base and large model cards

* Herbert model cards with tags

* Herbert tensorflow models

* Herbert model tests based on Bert test suit

* src/transformers/tokenization_herbert.py edited online with Bitbucket

* src/transformers/tokenization_herbert.py edited online with Bitbucket

* docs/source/model_doc/herbert.rst edited online with Bitbucket

* Herbert tokenizer tests and bug fixes

* src/transformers/configuration_herbert.py edited online with Bitbucket

* Copyrights and tests for TFHerbertModel

* model_cards/allegro/herbert-base-cased/README.md edited online with Bitbucket

* model_cards/allegro/herbert-large-cased/README.md edited online with Bitbucket

* Bug fixes after testing

* Reformat modified_only_fixup

* Proper order of configuration

* Herbert proper documentation formatting

* Formatting with make modified_only_fixup

* Dummies fixed

* Adding missing models to documentation

* Removing HerBERT model as it is a simple extension of BERT

* Update model_cards/allegro/herbert-base-cased/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Update model_cards/allegro/herbert-large-cased/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* HerbertTokenizer deprecated configuration removed
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

7b13bd01

Fix DeBERTa integration tests (#7729) · 52c9e842
Lysandre Debut authored Oct 16, 2020

52c9e842

15 Oct, 2020 1 commit

Improving Pipelines by defaulting to framework='tf' when pytorch seems unavailable. (#7728) · 0911b6bd

Nicolas Patry authored Oct 15, 2020

* Improving Pipelines by defaulting to framework='tf' when

pytorch seems unavailable.

* Actually changing the default resolution order to account for model
defaults

Adding a new tests for each pipeline to check that pipeline(task) works
too without manually adding the framework too.

0911b6bd

14 Oct, 2020 2 commits

Add predict step accumulation (#7767) · a1d1b332

Sylvain Gugger authored Oct 14, 2020



* Add eval_accumulation_step and clean distributed eval

* Add TPU test

* Add TPU stuff

* Fix arg name

* Fix Seq2SeqTrainer

* Fix total_size

* Update src/transformers/trainer_pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Doc and add test to TPU

* Add unit test

* Adapt name
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

a1d1b332

Add batch inferencing support for GPT2LMHeadModel (#7552) · 121dd433

Jonathan Chang authored Oct 14, 2020



* Add support for gpt2 batch inferencing

* add test

* remove typo
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

121dd433

13 Oct, 2020 4 commits
- Fix typo · 7968051a
  Sylvain Gugger authored Oct 13, 2020
  
  7968051a
- Faster pegasus tokenization test with reduced data size (#7762) · 2977bd52
  Sam Shleifer authored Oct 13, 2020
  
  2977bd52
- [Rag] Fix loading of pretrained Rag Tokenizer (#7756) · 82b09a84
  Patrick von Platen authored Oct 13, 2020
```
* fix rag

* Update tokenizer save_pretrained
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>
```
  82b09a84
- Gpt1 for sequence classification (#7683) · dcba9ee0
  Felipe Curti authored Oct 13, 2020
```
* Add Documentation for GPT-1 Classification

* Add GPT-1 with Classification head

* Add tests for GPT-1 Classification

* Add GPT-1 For Classification to auto models

* Remove authorized missing keys, change checkpoint to openai-gpt
```
  dcba9ee0
10 Oct, 2020 1 commit
- Fix flaky test in test_trainer (#7689) · c6e18de9
  Sylvain Gugger authored Oct 09, 2020
  
  c6e18de9
09 Oct, 2020 2 commits

[pegasus] Faster tokenizer tests (#7672) · b0f05e0c
Stas Bekman authored Oct 09, 2020

b0f05e0c

Reintroduce clean_text on BertTokenizer call which was removed by mistake in #4723 (#5749) · 21ed3a6b

Funtowicz Morgan authored Oct 09, 2020



* Reintroduce clean_text call which was removed by mistake in #4723
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Added unittest for clean_text parameter on Bert tokenizer.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Better unittest name.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Adapt unittest to use untrained tokenizer.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Code quality + update test
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

21ed3a6b

08 Oct, 2020 2 commits

Adding Fast tokenizers for SentencePiece based tokenizers - Breaking: remove... · 9aeacb58

Thomas Wolf authored Oct 08, 2020


Adding Fast tokenizers for SentencePiece based tokenizers - Breaking: remove Transfo-XL fast tokenizer (#7141)

* [WIP] SP tokenizers

* fixing tests for T5

* WIP tokenizers

* serialization

* update T5

* WIP T5 tokenization

* slow to fast conversion script

* Refactoring to move tokenzier implementations inside transformers

* Adding gpt - refactoring - quality

* WIP adding several tokenizers to the fast world

* WIP Roberta - moving implementations

* update to dev4 switch file loading to in-memory loading

* Updating and fixing

* advancing on the tokenizers - updating do_lower_case

* style and quality

* moving forward with tokenizers conversion and tests

* MBart, T5

* dumping the fast version of transformer XL

* Adding to autotokenizers + style/quality

* update init and space_between_special_tokens

* style and quality

* bump up tokenizers version

* add protobuf

* fix pickle Bert JP with Mecab

* fix newly added tokenizers

* style and quality

* fix bert japanese

* fix funnel

* limite tokenizer warning to one occurence

* clean up file

* fix new tokenizers

* fast tokenizers deep tests

* WIP adding all the special fast tests on the new fast tokenizers

* quick fix

* adding more fast tokenizers in the fast tests

* all tokenizers in fast version tested

* Adding BertGenerationFast

* bump up setup.py for CI

* remove BertGenerationFast (too early)

* bump up tokenizers version

* Clean old docstrings

* Typo

* Update following Lysandre comments
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

9aeacb58

Fix 3 failing slow bart/blender tests (#7652) · e3e65173
Sam Shleifer authored Oct 07, 2020

e3e65173

07 Oct, 2020 2 commits

Blenderbot (#7418) · 960faaaf

Sam Shleifer authored Oct 07, 2020


Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

960faaaf

Trainer callbacks (#7596) · 08ba4b49

Sylvain Gugger authored Oct 07, 2020



* Initial callback proposal

* Finish various callbacks

* Post-rebase conflicts

* Fix tests

* Don't use something that's not set

* Documentation

* Remove unwanted print.

* Document all models can work

* Add tests + small fixes

* Update docs/source/internal/trainer_utils.rst
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* Fix TF tests

* Real fix this time

* This one should work

* Fix typo

* Really fix typo
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

08ba4b49

06 Oct, 2020 1 commit
- Add GPT2ForSequenceClassification based on DialogRPT (#7501) · 59824318
  Lysandre Debut authored Oct 06, 2020
```
* Add GPT2ForSequenceClassification based on DialogRPT

* Better documentation

* Code quality
```
  59824318
05 Oct, 2020 3 commits

Custom TF weights loading (#7422) · 9cf7b23b

Julien Plu authored Oct 05, 2020



* First try

* Fix TF utils

* Handle authorized unexpected keys when loading weights

* Add several more authorized unexpected keys

* Apply style

* Fix test

* Address Patrick's comments.

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply style

* Make return_dict the default behavior and display a warning message

* Revert

* Replace wrong keyword

* Revert code

* Add forgot key

* Fix bug in loading PT models from a TF one.

* Fix sort

* Add a test for custom load weights in BERT

* Apply style

* Remove unused import
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

9cf7b23b

Expand test to locate flakiness (#7580) · d3adb985
Sylvain Gugger authored Oct 05, 2020

d3adb985

SqueezeBERT architecture (#7083) · 02ef825b

Forrest Iandola authored Oct 05, 2020

* configuration_squeezebert.py

thin wrapper around bert tokenizer

fix typos

wip sb model code

wip modeling_squeezebert.py. Next step is to get the multi-layer-output interface working

set up squeezebert to use BertModelOutput when returning results.

squeezebert documentation

formatting

allow head mask that is an array of [None, ..., None]

docs

docs cont'd

path to vocab

docs and pointers to cloud files (WIP)

line length and indentation

squeezebert model cards

formatting of model cards

untrack modeling_squeezebert_scratchpad.py

update aws paths to vocab and config files

get rid of stub of NSP code, and advise users to pretrain with mlm only

fix rebase issues

redo rebase of modeling_auto.py

fix issues with code formatting

more code format auto-fixes

move squeezebert before bert in tokenization_auto.py and modeling_auto.py because squeezebert inherits from bert

tests for squeezebert modeling and tokenization

fix typo

move squeezebert before bert in modeling_auto.py to fix inheritance problem

disable test_head_masking, since squeezebert doesn't yet implement head masking

fix issues exposed by the test_modeling_squeezebert.py

fix an issue exposed by test_tokenization_squeezebert.py

fix issue exposed by test_modeling_squeezebert.py

auto generated code style improvement

issue that we inherited from modeling_xxx.py: SqueezeBertForMaskedLM.forward() calls self.cls(), but there is no self.cls, and I think the goal was actually to call self.lm_head()

update copyright

resolve failing 'test_hidden_states_output' and remove unused encoder_hidden_states and encoder_attention_mask

docs

add integration test. rename squeezebert-mnli --> squeezebert/squeezebert-mnli

autogenerated formatting tweaks

integrate feedback from patrickvonplaten and sgugger to programming style and documentation strings

* tiny change to order of imports

02ef825b

01 Oct, 2020 3 commits

Clean the Trainer state (#7490) · 29baa8fa

Sylvain Gugger authored Oct 01, 2020

* Trainer should not modify its TrainingArguments

* Trainer should not modify its TrainingArguments

* Trainer should not modify its TrainingArguments

* Add test of resumed training

* Fixes

* Non multiGPU test

* Clean Trainer state

* Add more to the state

* Documentation

* One last test

* Make resume training test more complete

* Unwanted changes

29baa8fa

[Seq2Seq] Fix a couple of bugs and clean examples (#7474) · 62f5ae68

Patrick von Platen authored Oct 01, 2020



* clean T5

* fix t5 tests

* fix index typo

* fix tf common test

* fix examples

* change positional ordering for Bart and FSTM

* add signature test

* clean docs and add tests

* add docs to encoder decoder

* clean docs

* correct two doc strings

* remove sig test for TF Elektra & Funnel

* fix tf t5 slow tests

* fix input_ids to inputs in tf

* Update src/transformers/modeling_bart.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_bart.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* implement lysandre results

* make style

* fix encoder decoder typo

* fix tf slow tests

* fix slow tests

* renaming

* remove unused input
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

62f5ae68

Enable pegasus fp16 by clamping large activations (#7243) · 9e80f972

Sam Shleifer authored Oct 01, 2020

* Clean clamp

* boom boom

* Take some other changes

* boom boom

* boom boom

* boom boom

* one chg

* fix test

* Use finfo

* style

9e80f972

30 Sep, 2020 2 commits

Add DeBERTa model (#5929) · 7a0cf0ec

Pengcheng He authored Sep 30, 2020



* Add DeBERTa model

* Remove dependency of deberta

* Address comments

* Patch DeBERTa
Documentation
Style

* Add final tests

* Style

* Enable tests + nitpicks

* position IDs

* BERT -> DeBERTa

* Quality

* Style

* Tokenization

* Last updates.

* @patrickvonplaten's comments

* Not everything can be a copy

* Apply most of @sgugger's review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Last reviews

* DeBERTa -> Deberta
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

7a0cf0ec

Get a better error when check_copies fails (#7457) · 4ba24874
Sylvain Gugger authored Sep 30, 2020
```
* Get a better error when check_copies fails

* Fix tests
```
4ba24874

29 Sep, 2020 3 commits

Fix Trainer tests in a multiGPU env (#7458) · 8546dc55
Sylvain Gugger authored Sep 29, 2020

8546dc55

Adding gradient checkpointing to GPT2 (#7446) · 9e9a1fb8

Teven authored Sep 29, 2020



* GPT2 gradient checkpointing

* find_unused_parameters removed if checkpointing

* find_unused_parameters removed if checkpointing

* Update src/transformers/configuration_gpt2.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Added a test for generation with checkpointing

* Update src/transformers/configuration_gpt2.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

9e9a1fb8

Add automatic best model loading to Trainer (#7431) · 52e8392b
Sylvain Gugger authored Sep 29, 2020
```
* Add automatic best model loading to Trainer

* Some small fixes

* Formatting
```
52e8392b

28 Sep, 2020 2 commits
- Flos fix (#7384) · 4083a55a
  Marcin Zabłocki authored Sep 28, 2020
  
  4083a55a
- [T5] allow config.decoder_layers to control decoder size (#7409) · 748425d4
  Sam Shleifer authored Sep 28, 2020
```
* Working assymmetrical T5

* rename decoder_layers -> num_decoder_layers

* Fix docstring

* Allow creation of asymmetric t5 students
```
  748425d4
25 Sep, 2020 5 commits
- [Longformer, Bert, Roberta, ...] Fix multi gpu training (#7272) · e50a931c
  Patrick von Platen authored Sep 25, 2020
```
* fix multi-gpu

* fix longformer

* force to delete unnecessary layers

* fix notifications

* fix warning

* fix roberta

* fix tests

* remove hasattr

* fix tests

* fix roberta

* merge and clean authorized keys
```
  e50a931c
- fix rag retriever save pretrained (#7399) · 2c8ecdf8
  Patrick von Platen authored Sep 25, 2020
  
  2c8ecdf8
- Fix FP16 and attention masks in FunnelTransformer (#7374) · ad39271a
  Sylvain Gugger authored Sep 25, 2020
```
* Fix #7371

* Fix training

* Fix test values

* Apply the fix to TF as well
```
  ad39271a
- [RAG] Fix retrieval offset in RAG's HfIndex and better integration tests (#7372) · cf1c88e0
  Quentin Lhoest authored Sep 25, 2020
```
* Fix retrieval offset in RAG's HfIndex

* update slow tests

* style

* fix new test

* style

* add better tests
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  cf1c88e0
- modeling_bart: 3 small cleanups that dont change outputs (#7381) · 3c6bf899
  Sam Shleifer authored Sep 25, 2020
```
* Mbart passing

* boom boom

* cleaner assert

* add assert

* Fix tests
```
  3c6bf899
24 Sep, 2020 2 commits
- correct attention mask (#7373) · 0804d077
  Patrick von Platen authored Sep 24, 2020
  
  0804d077
- Check decorator order (#7326) · 1ff5bd38
  Sylvain Gugger authored Sep 24, 2020
```
* Check decorator order

* Adapt for parametrized decorators

* Fix typos
```
  1ff5bd38
23 Sep, 2020 1 commit

[Benchmarks] Change all args to from `no_...` to their positive form (#7075) · d2666136

Felipe Curti authored Sep 23, 2020



* Changed name to all no_... arguments and all references to them, inverting the boolean condition

* Change benchmark tests to use new Benchmark Args

* Update src/transformers/benchmark/benchmark_args_utils.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/benchmark/benchmark.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Fix Style. Add --no options in help

* fix some part of tests

* Update src/transformers/benchmark/benchmark_args_utils.py

* Update src/transformers/benchmark/benchmark_args_utils.py

* Update src/transformers/benchmark/benchmark_args_utils.py

* fix all tests

* make style

* add backwards compability

* make backwards compatible
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: fmcurti <fcurti@DESKTOP-RRQURBM.localdomain>

d2666136