Commits · 901507335f6ed59cad6bbbc2b5d8d9eba8a1b4e1 · chenpangpang / transformers

16 Nov, 2020 3 commits

Switch `return_dict` to `True` by default. (#8530) · 1073a2bd

Sylvain Gugger authored Nov 16, 2020

* Use the CI to identify failing tests

* Remove from all examples and tests

* More default switch

* Fixes

* More test fixes

* More fixes

* Last fixes hopefully

* Use the CI to identify failing tests

* Remove from all examples and tests

* More default switch

* Fixes

* More test fixes

* More fixes

* Last fixes hopefully

* Run on the real suite

* Fix slow tests

1073a2bd

Fix GPT2DoubleHeadsModel to work with model.generate() (#6601) · afb50c66

LSinev authored Nov 16, 2020

* Fix passing token_type_ids during GPT2DoubleHeadsModel.generate() if used

and for GPT2LMHeadModel too

* Update tests to check token_type_ids usage in GPT2 models

afb50c66

Adding the prepare_seq2seq_batch function to ProphetNet (#8515) · 04d8136b

Yusuke Mori authored Nov 16, 2020

* Simply insert T5Tokenizer's prepare_seq2seq_batch

* Update/Add some 'import'

* fix RunTimeError caused by '.view'

* Moves .view related error avoidance from seq2seq_trainer to inside prophetnet

* Update test_tokenization_prophetnet.py

* Format the test code with black

* Re-format the test code

* Update test_tokenization_prophetnet.py

* Add importing require_torch in the test code

* Add importing BatchEncoding in the test code

* Re-format the test code on Colab

04d8136b

15 Nov, 2020 1 commit

[breaking|pipelines|tokenizers] Adding slow-fast tokenizers equivalence tests... · f4e04cd2

Thomas Wolf authored Nov 15, 2020


[breaking|pipelines|tokenizers] Adding slow-fast tokenizers equivalence tests pipelines - Removing sentencepiece as a required dependency (#8073)

* Fixing roberta for slow-fast tests

* WIP getting equivalence on pipelines

* slow-to-fast equivalence - working on question-answering pipeline

* optional FAISS tests

* Pipeline Q&A

* Move pipeline tests to their own test job again

* update tokenizer to add sequence id methods

* update to tokenizers 0.9.4

* set sentencepiecce as optional

* clean up squad

* clean up pipelines to use sequence_ids

* style/quality

* wording

* Switch to use_fast = True by default

* update tests for use_fast at True by default

* fix rag tokenizer test

* removing protobuf from required dependencies

* fix NER test for use_fast = True by default

* fixing example tests (Q&A examples use slow tokenizers for now)

* protobuf in main deps extras["sentencepiece"] and example deps

* fix protobug install test

* try to fix seq2seq by switching to slow tokenizers for now

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

f4e04cd2

13 Nov, 2020 2 commits

Rework some TF tests (#8492) · 24184e73

Julien Plu authored Nov 13, 2020

* Update some tests

* Small update

* Apply style

* Use max_position_embeddings

* Create a fake attribute

* Create a fake attribute

* Update wrong name

* Wrong TransfoXL model file

* Keep the common tests agnostic

24184e73

[T5] Bug correction & Refactor (#8518) · 42e2d02e
Patrick von Platen authored Nov 13, 2020
```
* fix bug

* T5 refactor

* refactor tf

* apply sylvains suggestions
```
42e2d02e

12 Nov, 2020 1 commit

Add pretraining loss computation for TF Bert pretraining (#8470) · 5d805394

Julien Plu authored Nov 12, 2020

* Add pretraining loss computation for TF Bert pretraining

* Fix labels creation

* Fix T5 model

* restore T5 kwargs

* try a generic fix for pretraining models

* Apply style

* Overide the prepare method for the BERT tests

5d805394

11 Nov, 2020 3 commits

Skip test until investigation · c7b6bbec
Lysandre authored Nov 11, 2020

c7b6bbec

Add TFDPR (#8203) · 026a2ff2

Ratthachat (Jung) authored Nov 12, 2020

* Create modeling_tf_dpr.py

* Add TFDPR

* Add back TFPegasus, TFMarian, TFMBart, TFBlenderBot

last commit accidentally deleted these 4 lines, so I recover them back

* Add TFDPR

* Add TFDPR

* clean up some comments, add TF input-style doc string

* Add TFDPR

* Make return_dict=False as default

* Fix return_dict bug (in .from_pretrained)

* Add get_input_embeddings()

* Create test_modeling_tf_dpr.py

The current version is already passed all 27 tests!
Please see the test run at : 
https://colab.research.google.com/drive/1czS_m9zy5k-iSJbzA_DP1k1xAAC_sdkf?usp=sharing



* fix quality

* delete init weights

* run fix copies

* fix repo consis

* del config_class, load_tf_weights

They shoud be 'pytorch only'

* add config_class back

after removing it, test failed ... so totally only removing "use_tf_weights = None" on Lysandre suggestion

* newline after .. note::

* import tf, np (Necessary for ModelIntegrationTest)

* slow_test from_pretrained with from_pt=True

At the moment we don't have TF weights (since we don't have official official TF model)
Previously, I did not run slow test, so I missed this bug

* Add simple TFDPRModelIntegrationTest

Note that this is just a test that TF and Pytorch gives approx. the same output.
However, I could not test with the official DPR repo's output yet

* upload correct tf model

* remove position_ids as missing keys
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: patrickvonplaten <patrick@huggingface.co>

026a2ff2

Add next sentence prediction loss computation (#8462) · da842e4e

Julien Plu authored Nov 11, 2020

* Add next sentence prediction loss computation

* Apply style

* Fix tests

* Add forgotten import

* Add forgotten import

* Use a new parameter

* Remove kwargs and use positional arguments

da842e4e

10 Nov, 2020 7 commits

fix t5 token type ids (#8437) · 70708cca
Patrick von Platen authored Nov 10, 2020

70708cca

[No merge] TF integration testing (#7621) · 9fd1f562

Lysandre Debut authored Nov 10, 2020

* stash

* TF Integration testing for ELECTRA, BERT, Longformer

* Trigger slow tests

* Apply suggestions from code review

9fd1f562

using multi_gpu consistently (#8446) · 02bdfc02
Stas Bekman authored Nov 10, 2020
```
* s|multiple_gpu|multi_gpu|g; s|multigpu|multi_gpu|g'

* doc
```
02bdfc02
fix t5 special tokens (#8435) · b9356945
Patrick von Platen authored Nov 10, 2020

b9356945
[docs] improve bart/marian/mBART/pegasus docs (#8421) · c314b1fd
Sam Shleifer authored Nov 10, 2020

c314b1fd

Patch token classification pipeline (#8364) · 850afb42

Lysandre Debut authored Nov 10, 2020



* Patch token classification pipeline

* Some added tests for TokenClassificationArgumentHandler (#8366)
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

850afb42

Model versioning (#8324) · 70f622fa

Julien Chaumond authored Nov 10, 2020

* fix typo

* rm use_cdn & references, and implement new hf_bucket_url

* I'm pretty sure we don't need to `read` this file

* same here

* [BIG] file_utils.networking: do not gobble up errors anymore

* Fix CI 😇



* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Tiny doc tweak

* Add doc + pass kwarg everywhere

* Add more tests and explain

cc @sshleifer let me know if better
Co-Authored-By: Sam Shleifer <sshleifer@gmail.com>

* Also implement revision in pipelines

In the case where we're passing a task name or a string model identifier

* Fix CI 😇



* Fix CI

* [hf_api] new methods + command line implem

* make style

* Final endpoints post-migration

* Fix post-migration

* Py3.6 compat

cc @stefan-it

Thank you @stas00
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

70f622fa

09 Nov, 2020 3 commits

[Tests] Add Common Test for Training + Fix a couple of bugs (#8415) · 9c83b96e

Patrick von Platen authored Nov 09, 2020

* add training tests

* correct longformer

* fix docs

* fix some tests

* fix some more train tests

* remove ipdb

* fix multiple edge case model training

* fix funnel and prophetnet

* clean gpt models

* undo renaming of albert

9c83b96e

Add new token classification example (#8340) · 908a2889

Sylvain Gugger authored Nov 09, 2020



* Add new token classification example

* Remove txt file

* Add test

* With actual testing done

* Less warmup is better

* Update examples/token-classification/run_ner_new.py
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

* Address review comments

* Fix test

* Make Lysandre happy

* Last touches and rename

* Rename in tests

* Address review comments

* More run_ner -> run_ner_old
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

908a2889

[fsmt tokenizer] support lowercase tokenizer (#8389) · 78d706f3
Stas Bekman authored Nov 09, 2020
```
* support lowercase tokenizer

* fix arg pos
```
78d706f3

06 Nov, 2020 1 commit

[All Seq2Seq model + CLM models that can be used with EncoderDecoder] Add... · bc0d26d1

Yossi Synett authored Nov 06, 2020


[All Seq2Seq model + CLM models that can be used with EncoderDecoder] Add cross-attention weights to outputs (#8071)

* Output cross-attention with decoder attention output

* Update src/transformers/modeling_bert.py

* add cross-attention for t5 and bart as well

* fix tests

* correct typo in docs

* add sylvains and sams comments

* correct typo
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

bc0d26d1

05 Nov, 2020 2 commits

Make Trainer evaluation handle dynamic seq_length (#8336) · 04e442d5

Sylvain Gugger authored Nov 05, 2020

* Make Trainer evaluation handle dynamic seq_length

* Document behavior.

* Fix test

* Better fix

* Fixes for realsies this time

* Address review comments

* Without forgetting to save...

04e442d5

Output global_attentions in Longformer models (#7562) · 27b402ca

Guillaume Filion authored Nov 05, 2020



* Output global_attentions in Longformer models

* make style

* small refactoring

* fix tests

* make fix-copies

* add for tf as well

* remove comments in test

* make fix-copies

* make style

* add docs

* make docstring pretty
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

27b402ca

04 Nov, 2020 3 commits

Clean up data collators and datasets (#8308) · 9c4aa4ac

Sylvain Gugger authored Nov 04, 2020



* Clean up data collators and datasets

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Remove needless clone
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

9c4aa4ac

Improve QA pipeline error handling (#8286) · 7342d9a5

Nicolas Patry authored Nov 04, 2020

- The issue is that with previous code we would have the following:

```python
qa_pipeline = (...)
qa_pipeline(question="Where was he born ?", context="")
-> IndexError: Dimension out of range (expected to be in range of [-1, 0], but got 1)
```

The goal here is to improve this to actually return a ValueError
wherever possible.

While at it, I tried to simplify QuestionArgumentHandler's code to
make it smaller and more compat while keeping backward compat.

7342d9a5

[Generate Test] fix greedy generate test (#8293) · cb966e64
Patrick von Platen authored Nov 04, 2020
```
* fix greedy generate test

* delet ipdb
```
cb966e64

03 Nov, 2020 7 commits

[WIP] Ner pipeline grouped_entities fixes (#5970) · 29b536a7

Ceyda Cinarel authored Nov 04, 2020



* Bug fix: NER pipeline shouldn't group separate entities of same type

* style fix

* [Bug Fix] Shouldn't group entities that are both 'B' even if they are same type
	(B-type1 B-type1) != (B-type1 I-type1)
[Bug Fix] add an option `ignore_subwords` to ignore subsequent ##wordpieces in predictions. Because some models train on only the first token of a word and not on the subsequent wordpieces (BERT NER default). So it makes sense doing the same thing at inference time.
	The simplest fix is to just group the subwords with the first wordpiece.
	[TODO] how to handle ignored scores? just set them to 0 and calculate zero invariant mean ?
	[TODO] handle different wordpiece_prefix ## ? possible approaches:
		get it from tokenizer? but currently most tokenizers dont have a wordpiece_prefix property?
		have an _is_subword(token)
[Feature add] added option to `skip_special_tokens`. Cause It was harder to remove them after grouping.
[Additional Changes] remove B/I prefix on returned grouped_entities
[Feature Request/TODO] Return indexes?
[Bug TODO]  can't use fast tokenizer with grouped_entities ('BertTokenizerFast' object has no attribute 'convert_tokens_to_string')

* use offset_mapping to fix [UNK] token problem

* ignore score for subwords

* modify ner_pipeline test

* modify ner_pipeline test

* modify ner_pipeline test

* ner_pipeline change ignore_subwords default to true

* add ner_pipeline ignore_subword=False test case

* fix offset_mapping index

* fix style again duh

* change is_subword and convert_tokens_to_string logic

* merge tests with new test structure

* change test names

* remove old tests

* ner tests for fast tokenizer

* fast tokenizers have convert_tokens_to_string

* Fix the incorrect merge
Co-authored-by: Ceyda Cinarel <snu-ceyda@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

29b536a7

[CIs] Better reports everywhere (#8275) · 1bb4bba5

Stas Bekman authored Nov 03, 2020

* make it possible to invoke testconf.py in both test suites without crashing on having the same option added

* perl -pi -e 's|--make_reports|--make-reports|' to be consistent with other opts

* add `pytest --make-reports` to all CIs (and artifacts)

* fix

1bb4bba5

Data collator for token classification (#8274) · 7f556d2e
Sylvain Gugger authored Nov 03, 2020
```
* Add DataCollatorForTokenClassification and clean tests

* Make quality
```
7f556d2e
Clean Trainer tests and datasets dep (#8268) · 4c19f3ba
Sylvain Gugger authored Nov 03, 2020

4c19f3ba

Updated ConversationalPipeline to work with encoder-decoder models (#8207) · 74f6f91a

guillaume-be authored Nov 03, 2020



* Updated ConversationalPipeline to work with encoder-decoder models (e.g. BlenderBot)

* Addition of integration test for EncoderDecoder conversation model
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

74f6f91a

[FIX] TextGenerationPipeline is currently broken. (#8256) · c66ffa3a

Nicolas Patry authored Nov 03, 2020

* [FIX] TextGenerationPipeline is currently broken.

It's most likely due to #8180.
What's missing is a multi vs single string handler at the beginning of
the pipe.
And also there was no testing of this pipeline.

* Fixing Conversational tests too.

c66ffa3a

Refactoring the generate() function (#6949) · a1bbcf3f

Patrick von Platen authored Nov 03, 2020

* first draft

* show design proposition for new generate method

* up

* make better readable

* make first version

* gpt2 tests pass

* make beam search for gpt2 work

* add first encoder-decoder code

* delete typo

* make t5 work

* save indermediate

* make bart work with beam search

* finish beam search bart / t5

* add default kwargs

* make more tests pass

* fix no bad words sampler

* some fixes and tests for all distribution processors

* fix test

* fix rag slow tests

* merge to master

* add nograd to generate

* make all slow tests pass

* speed up generate

* fix edge case bug

* small fix

* correct typo

* add type hints and docstrings

* fix typos in tests

* add beam search tests

* add tests for beam scorer

* fix test rag

* finish beam search tests

* move generation tests in seperate file

* fix generation tests

* more tests

* add aggressive generation tests

* fix tests

* add gpt2 sample test

* add more docstring

* add more docs

* finish doc strings

* apply some more of sylvains and sams comments

* fix some typos

* make fix copies

* apply lysandres and sylvains comments

* final corrections on examples

* small fix for reformer

a1bbcf3f

02 Nov, 2020 3 commits
- 2 SinusoidalPositionalEmbedding fixes (#8226) · 504ff7bb
  Stas Bekman authored Nov 02, 2020
  
  504ff7bb
- Fix ignore list behavior in doctests (#8213) · 0c92e7d9
  Santiago Castro authored Nov 02, 2020
  
  0c92e7d9
- Fix the behaviour of DefaultArgumentHandler (removing it). (#8180) · 84caa233
  Nicolas Patry authored Nov 02, 2020
```
* Some work to fix the behaviour of DefaultArgumentHandler by removing it.

* Fixing specific pipelines argument checking.
```
  84caa233
30 Oct, 2020 3 commits

Replace swish with silu (#8166) · 00112c35

TFUsers authored Oct 30, 2020



* Replace swish with silu

* revert nn.silu to nn.swish due to older version

* simplify optimized silu conditional and fix format

* Update activations.py

* Update activations_tf.py

* Update modeling_flax_utils.py

* Update modeling_openai.py

* add swish testcase

* add pytorch swish testcase

* Add more robust python version check

* more formatting fixes
Co-authored-by: TFUsers <TFUsers@gmail.com>

00112c35

TFMarian, TFMbart, TFPegasus, TFBlenderbot (#7987) · 566b083e

Sam Shleifer authored Oct 30, 2020



* Start plumbing

* Marian close

* Small stubs for all children

* Fixed bart

* marian working

* pegasus test is good, but failing

* Checkin tests

* More model files

* Subtle marian, pegasus integration test failures

* Works well

* rm print

* boom boom

* Still failing model2doc

* merge master

* Equivalence test failing, all others fixed

* cleanup

* Fix embed_scale

* Cleanup marian pipeline test

* Undo extra changes

* Smaller delta

* Cleanup model testers

* undo delta

* fix tests import structure

* cross test decorator

* Cleaner set_weights

* Respect authorized_unexpected_keys

* No warnings

* No warnings

* style

* Nest tf import

* black

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* functional dropout

* fixup

* Fixup

* style_doc

* embs

* shape list

* delete slow force_token_id_to_be_generated func

* fixup
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

566b083e

Ci test tf super slow (#8007) · 10f8c636

Lysandre Debut authored Oct 30, 2020

* Test TF GPU CI

* Change cache

* Fix missing torch requirement

* Fix some model tests


Style

* LXMERT

* MobileBERT

* Longformer skip test

* XLNet

* The rest of the tests

* RAG goes OOM in multi gpu setup

* YAML test files

* Last fixes

* Skip doctests

* Fill mask tests

* Yaml files

* Last test fix

* Style

* Update cache

* Change ONNX tests to slow + use tiny model

10f8c636

29 Oct, 2020 1 commit
- Smarter prediction loop and no- -> no_ in console args (#8151) · acf56408
  Sylvain Gugger authored Oct 29, 2020
```
* Smarter prediction loop and no- -> no_ in console args

* Fix test
```
  acf56408