Commits · 1073a2bde5d608f9891d6da6df7b63921dca1b71 · chenpangpang / transformers

16 Nov, 2020 9 commits

Switch `return_dict` to `True` by default. (#8530) · 1073a2bd

Sylvain Gugger authored Nov 16, 2020

* Use the CI to identify failing tests

* Remove from all examples and tests

* More default switch

* Fixes

* More test fixes

* More fixes

* Last fixes hopefully

* Use the CI to identify failing tests

* Remove from all examples and tests

* More default switch

* Fixes

* More test fixes

* More fixes

* Last fixes hopefully

* Run on the real suite

* Fix slow tests

1073a2bd

Update version to v4.0.0-dev (#8568) · 0d0a0785
Sylvain Gugger authored Nov 16, 2020

0d0a0785

Fix GPT2DoubleHeadsModel to work with model.generate() (#6601) · afb50c66

LSinev authored Nov 16, 2020

* Fix passing token_type_ids during GPT2DoubleHeadsModel.generate() if used

and for GPT2LMHeadModel too

* Update tests to check token_type_ids usage in GPT2 models

afb50c66

Adding the prepare_seq2seq_batch function to ProphetNet (#8515) · 04d8136b

Yusuke Mori authored Nov 16, 2020

* Simply insert T5Tokenizer's prepare_seq2seq_batch

* Update/Add some 'import'

* fix RunTimeError caused by '.view'

* Moves .view related error avoidance from seq2seq_trainer to inside prophetnet

* Update test_tokenization_prophetnet.py

* Format the test code with black

* Re-format the test code

* Update test_tokenization_prophetnet.py

* Add importing require_torch in the test code

* Add importing BatchEncoding in the test code

* Re-format the test code on Colab

04d8136b

[doc] typo fix (#8535) · 931b1097

Stas Bekman authored Nov 16, 2020



* [doc] typo fix

@sgugger

* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

931b1097

Clearer Model Versioning Example (#8562) · 6db21a06
Branden Chan authored Nov 16, 2020

6db21a06
Readme for Wiki Summary [Persian] bert2bert (#8558) · daaa6845
Mehrdad Farahani authored Nov 16, 2020

daaa6845
Readme for News Headline Generation (bert2bert) (#8557) · 06d468d3
Mehrdad Farahani authored Nov 16, 2020

06d468d3

Create README.md for Chinese RoBERTa Miniatures (#8550) · 9b7fb8a3

zhezhaoa authored Nov 16, 2020



* Create README.md

* Update model_cards/uer/chinese_roberta_L-2_H-128/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

9b7fb8a3

15 Nov, 2020 1 commit

[breaking|pipelines|tokenizers] Adding slow-fast tokenizers equivalence tests... · f4e04cd2

Thomas Wolf authored Nov 15, 2020


[breaking|pipelines|tokenizers] Adding slow-fast tokenizers equivalence tests pipelines - Removing sentencepiece as a required dependency (#8073)

* Fixing roberta for slow-fast tests

* WIP getting equivalence on pipelines

* slow-to-fast equivalence - working on question-answering pipeline

* optional FAISS tests

* Pipeline Q&A

* Move pipeline tests to their own test job again

* update tokenizer to add sequence id methods

* update to tokenizers 0.9.4

* set sentencepiecce as optional

* clean up squad

* clean up pipelines to use sequence_ids

* style/quality

* wording

* Switch to use_fast = True by default

* update tests for use_fast at True by default

* fix rag tokenizer test

* removing protobuf from required dependencies

* fix NER test for use_fast = True by default

* fixing example tests (Q&A examples use slow tokenizers for now)

* protobuf in main deps extras["sentencepiece"] and example deps

* fix protobug install test

* try to fix seq2seq by switching to slow tokenizers for now

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

f4e04cd2

13 Nov, 2020 11 commits

Rework some TF tests (#8492) · 24184e73

Julien Plu authored Nov 13, 2020

* Update some tests

* Small update

* Apply style

* Use max_position_embeddings

* Create a fake attribute

* Create a fake attribute

* Update wrong name

* Wrong TransfoXL model file

* Keep the common tests agnostic

24184e73

fix load weights (#8528) · f6cdafde
Patrick von Platen authored Nov 13, 2020
```
* fix load weights

* delete line
```
f6cdafde
Add bart-large-mnli model card (#8527) · f6f4da8d
Joe Davison authored Nov 13, 2020

f6f4da8d

Model sharing doc: more tweaks (#8520) · 72526974

Julien Chaumond authored Nov 13, 2020



* More doc tweaks

* Update model_sharing.rst

* make style

* missing newline

* Add email tip
Co-authored-by: Pierric Cistac <pierric@huggingface.co>

72526974

Fix paths in github YAML · 9d519dab
LysandreJik authored Nov 13, 2020

9d519dab

Model templates encoder only (#8509) · 826f0457

Lysandre Debut authored Nov 13, 2020



* Model templates

* TensorFlow

* Remove pooler

* CI

* Tokenizer + Refactoring

* Encoder-Decoder

* Let's go testing

* Encoder-Decoder in TF

* Let's go testing in TF

* Documentation

* README

* Fixes

* Better names

* Style

* Update docs

* Choose to skip either TF or PT

* Code quality fixes

* Add to testing suite

* Update file path

* Cookiecutter path

* Update `transformers` path

* Handle rebasing

* Remove seq2seq from model templates

* Remove s2s config

* Apply Sylvain and Patrick comments

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Last fixes from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

826f0457

[T5] Bug correction & Refactor (#8518) · 42e2d02e
Patrick von Platen authored Nov 13, 2020
```
* fix bug

* T5 refactor

* refactor tf

* apply sylvains suggestions
```
42e2d02e
Merge remote-tracking branch 'origin/master' · 42f63e38
Sylvain Gugger authored Nov 13, 2020

42f63e38
Update doc for v3.5.1 · bb03a14e
Sylvain Gugger authored Nov 13, 2020

bb03a14e
Update deepset/roberta-base-squad2 model card (#8522) · 4df6b593
Branden Chan authored Nov 13, 2020
```
* Update README.md

* Update README.md
```
4df6b593
Remove typo · 0c9bae09
Sylvain Gugger authored Nov 12, 2020

0c9bae09

12 Nov, 2020 9 commits
- Add pretraining loss computation for TF Bert pretraining (#8470) · 5d805394
  Julien Plu authored Nov 12, 2020
```
* Add pretraining loss computation for TF Bert pretraining

* Fix labels creation

* Fix T5 model

* restore T5 kwargs

* try a generic fix for pretraining models

* Apply style

* Overide the prepare method for the BERT tests
```
  5d805394
- Use LF instead of os.linesep (#8491) · 91a67b75
  Julien Plu authored Nov 12, 2020
  
  91a67b75
- Try to understand and apply Sylvain's comments (#8458) · 27b3ff31
  Julien Plu authored Nov 12, 2020
  
  27b3ff31
- fix SqueezeBertForMaskedLM (#8479) · 0fa03498
  Forrest Iandola authored Nov 12, 2020
  
  0fa03498
- Model sharing doc (#8498) · 79330546
  Sylvain Gugger authored Nov 12, 2020
```
* Model sharing doc

* Style
```
  79330546
- Fix doc bug (#8500) · d65e0bfe
  Chengxi Guo authored Nov 13, 2020
```
* fix doc bug
Signed-off-by: mymusise <mymusise1@gmail.com>

* fix example bug
Signed-off-by: mymusise <mymusise1@gmail.com>
```
  d65e0bfe
- quick fix on concatenating text to support more datasets (#8474) · 924c624a
  zeyuyun1 authored Nov 12, 2020
  
  924c624a
- Fix typo in roberta-base-squad2-v2 model card (#8489) · 17b1fd80
  Antonio Lanza authored Nov 12, 2020
  
  17b1fd80
- [model_cards] other chars than [\w\-_] not allowed anymore in model names · c6c08ebf
  Julien Chaumond authored Nov 12, 2020
```
cc @Pierrci
```
  c6c08ebf
11 Nov, 2020 10 commits

Update deploy-docs dependencies on CI to enable Flax (#8475) · 121c24ef

Funtowicz Morgan authored Nov 12, 2020



* Update deploy-docs dependencies on CI to enable Flax
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Added pair of ""
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

121c24ef

[s2s] distill t5-large -> t5-small (#8376) · 81ebd706
Sumithra Bhakthavatsalam authored Nov 11, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
81ebd706

Flax/Jax documentation (#8331) · a5b68232

Funtowicz Morgan authored Nov 11, 2020



* First addition of Flax/Jax documentation
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* make style

* Ensure input order match between Bert & Roberta
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Install dependencies "all" when building doc
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* wraps build_doc deps with ""
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Addressing @sgugger comments.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Use list to highlight JAX features.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Make style.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Let's not look to much into the future for now.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Style
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

a5b68232

Skip test until investigation · c7b6bbec
Lysandre authored Nov 11, 2020

c7b6bbec
Replaced some iadd operations on lists with proper list methods. (#8433) · aa2a2c65
Beomsoo Kim authored Nov 12, 2020

aa2a2c65

Add TFDPR (#8203) · 026a2ff2

Ratthachat (Jung) authored Nov 12, 2020

* Create modeling_tf_dpr.py

* Add TFDPR

* Add back TFPegasus, TFMarian, TFMBart, TFBlenderBot

last commit accidentally deleted these 4 lines, so I recover them back

* Add TFDPR

* Add TFDPR

* clean up some comments, add TF input-style doc string

* Add TFDPR

* Make return_dict=False as default

* Fix return_dict bug (in .from_pretrained)

* Add get_input_embeddings()

* Create test_modeling_tf_dpr.py

The current version is already passed all 27 tests!
Please see the test run at : 
https://colab.research.google.com/drive/1czS_m9zy5k-iSJbzA_DP1k1xAAC_sdkf?usp=sharing



* fix quality

* delete init weights

* run fix copies

* fix repo consis

* del config_class, load_tf_weights

They shoud be 'pytorch only'

* add config_class back

after removing it, test failed ... so totally only removing "use_tf_weights = None" on Lysandre suggestion

* newline after .. note::

* import tf, np (Necessary for ModelIntegrationTest)

* slow_test from_pretrained with from_pt=True

At the moment we don't have TF weights (since we don't have official official TF model)
Previously, I did not run slow test, so I missed this bug

* Add simple TFDPRModelIntegrationTest

Note that this is just a test that TF and Pytorch gives approx. the same output.
However, I could not test with the official DPR repo's output yet

* upload correct tf model

* remove position_ids as missing keys
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: patrickvonplaten <patrick@huggingface.co>

026a2ff2

Example NER script predicts on tokenized dataset (#8468) · a38d1c7c

sarnoult authored Nov 11, 2020

The new run_ner.py script tries to run prediction on the input
test set `datasets["test"]`, but it should be the tokenized set
`tokenized_datasets["test"]`

a38d1c7c

Fix next sentence output (#8466) · 069b6384
Julien Plu authored Nov 11, 2020

069b6384

Add next sentence prediction loss computation (#8462) · da842e4e

Julien Plu authored Nov 11, 2020

* Add next sentence prediction loss computation

* Apply style

* Fix tests

* Add forgotten import

* Add forgotten import

* Use a new parameter

* Remove kwargs and use positional arguments

da842e4e

Fix TF Longformer (#8460) · 23290836
Julien Plu authored Nov 11, 2020

23290836