Commits · d8c26ed1391a4df5a67aa7fbabc0c0789818515f · chenpangpang / transformers

22 Jun, 2020 26 commits

Specify dataset used for crossvalidation (#5175) · d8c26ed1
bogdankostic authored Jun 23, 2020

d8c26ed1
Create README.md (#5149) · a34fb91d
Adriano Diniz authored Jun 22, 2020

a34fb91d
Create README.md (#5160) · ffabcf52
Adriano Diniz authored Jun 22, 2020

ffabcf52

Adriano Diniz authored Jun 22, 2020



* Create README.md

* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

3363a19b

Add link to new comunity notebook (optimization) (#5195) · 0cca6192

Michaël Benesty authored Jun 22, 2020

* Add link to new comunity notebook (optimization)

related to https://github.com/huggingface/transformers/issues/4842#event-3469184635

This notebook is about benchmarking model training with/without dynamic padding optimization. 
https://github.com/ELS-RD/transformers-notebook 

Using dynamic padding on MNLI provides a **4.7 times training time reduction**, with max pad length set to 512. The effect is strong because few examples are >> 400 tokens in this dataset. IRL, it will depend of the dataset, but it always bring improvement and, after more than 20 experiments listed in this [article](https://towardsdatascience.com/divide-hugging-face-transformers-training-time-by-2-or-more-21bf7129db9q-21bf7129db9e?source=friends_link&sk=10a45a0ace94b3255643d81b6475f409

), it seems to not hurt performance.

Following advice from @patrickvonplaten I do the PR myself :-)

* Update notebooks/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

0cca6192

Add README.md (nyu-mll) (#5174) · 1c5cd8e5

Lee Haau-Sing authored Jun 22, 2020



* nyu-mll: roberta on smaller datasets

* Update README.md

* Update README.md
Co-authored-by: Alex Warstadt <alexwarstadt@gmail.com>

1c5cd8e5

Switch master/stable doc and add older releases (#5193) · c4397524
Sylvain Gugger authored Jun 22, 2020

c4397524

Quick tour (#5145) · 417e492f

Sylvain Gugger authored Jun 22, 2020



* Quicktour part 1

* Update

* All done

* Typos
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

* Address comments in quick tour

* Update docs/source/quicktour.rst
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update from feedback
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

417e492f

Cleaner warning when loading pretrained models (#4557) · 75e1eed8

Thomas Wolf authored Jun 22, 2020



* Cleaner warning when loading pretrained models

This make more explicit logging messages when using the various `from_pretrained` methods. It also make these messages as `logging.warning` because it's a common source of silent mistakes.

* Update src/transformers/modeling_utils.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* style and quality
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

75e1eed8

Have documentation fail on warning (#5189) · 4e741efa

Lysandre Debut authored Jun 22, 2020

* Have documentation fail on warning

* Force ci failure

* Revert "Force ci failure"

This reverts commit f0a4666ec2eb4cd00a4da48af3357defc63324a0.

4e741efa

Add TF auto model to the docs + fix sphinx warnings (#5187) · 1262495a
Sylvain Gugger authored Jun 22, 2020

1262495a
Create README.md (#5165) · 88429c57
Adriano Diniz authored Jun 22, 2020

88429c57

Create README.md (#5107) · 76ee9c8b

Manuel Romero authored Jun 22, 2020



* Create README.md

@julien-c check out that dataset meta tag is right

* Fix typo
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

76ee9c8b

Model card for t5-base-finetuned-emotion (recognition) (#5179) · bf493d55
Manuel Romero authored Jun 22, 2020

bf493d55
improve doc (#5185) · e9ef2117
Patrick von Platen authored Jun 22, 2020

e9ef2117

[tokenizers] Fix #5081 and improve backward compatibility (#5125) · ebc36108

Thomas Wolf authored Jun 22, 2020

* fix #5081 and improve backward compatibility (slightly)

* add nlp to setup.cfg - style and quality

* align default to previous default

* remove test that doesn't generalize

ebc36108

Check if `text` is set to avoid IndexError (#4209) · d2a7c86d
Malte authored Jun 22, 2020
```
Fix for https://github.com/huggingface/transformers/issues/3809
```
d2a7c86d

Add support for gradient checkpointing in BERT (#4659) · 90f4b245

Iz Beltagy authored Jun 22, 2020



* add support for gradient checkpointing in BERT

* fix unit tests

* isort

* black

* workaround for `torch.utils.checkpoint.checkpoint` not accepting bool

* Revert "workaround for `torch.utils.checkpoint.checkpoint` not accepting bool"

This reverts commit 5eb68bb804f5ffbfc7ba13c45a47717f72d04574.

* workaround for `torch.utils.checkpoint.checkpoint` not accepting bool
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

90f4b245

Output hidden states (#4978) · f4e1f022

Joseph Liu authored Jun 22, 2020



* Configure all models to use output_hidden_states as argument passed to foward()

* Pass all tests

* Remove cast_bool_to_primitive in TF Flaubert model

* correct tf xlnet

* add pytorch test

* add tf test

* Fix broken tests

* Configure all models to use output_hidden_states as argument passed to foward()

* Pass all tests

* Remove cast_bool_to_primitive in TF Flaubert model

* correct tf xlnet

* add pytorch test

* add tf test

* Fix broken tests

* Refactor output_hidden_states for mobilebert

* Reset and remerge to master
Co-authored-by: Joseph Liu <joseph.liu@coinflex.com>
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

f4e1f022

Add model cards for Microsoft's MiniLM (#5178) · 866a8cca

Kevin Canwen Xu authored Jun 22, 2020

* Add model cards for Microsoft's MiniLM

* XLMRobertaTokenizer

* format

* Add thumbnail

* finishing up

866a8cca

Added feature to move added tokens in vocabulary for Transformer-XL (#4953) · b99ad457

RafaelWO authored Jun 22, 2020



* Fixed resize_token_embeddings for transfo_xl model

* Fixed resize_token_embeddings for transfo_xl.

Added custom methods to TransfoXLPreTrainedModel for resizing layers of
the AdaptiveEmbedding.

* Updated docstring

* Fixed resizinhg cutoffs; added check for new size of embedding layer.

* Added test for resize_token_embeddings

* Fixed code quality

* Fixed unchanged cutoffs in model.config

* Added feature to move added tokens in tokenizer.

* Fixed code quality

* Added feature to move added tokens in tokenizer.

* Fixed code quality

* Fixed docstring, renamed sym to 	oken.
Co-authored-by: Rafael Weingartner <rweingartner.its-b2015@fh-salzburg.ac.at>

b99ad457

Update glossary (#5148) · eb0ca71e

Sylvain Gugger authored Jun 22, 2020



* Update glossary

* Update docs/source/glossary.rst
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

eb0ca71e

Benchmarks (#4912) · fa0be6d7

Patrick von Platen authored Jun 22, 2020

* finish benchmark

* fix isort

* fix setup cfg

* retab

* fix time measuring of tf graph mode

* fix tf cuda

* clean code

* better error message

fa0be6d7

fix bart doc (#5132) · 18a0150b
Zihao Fu authored Jun 22, 2020
```
fix bart doc
```
18a0150b
Fixing docs for Encoder Decoder Config (#5171) · 3fe75c7f
Mikael Souza authored Jun 22, 2020

3fe75c7f
Typo (#5147) · 59345cc8
flozi00 authored Jun 22, 2020

59345cc8

21 Jun, 2020 1 commit
- [examples] fixes arguments for summarization finetune scripts (#5157) · bc3a0c06
  Ilya Boytsov authored Jun 21, 2020
```
Authored-by: i.boytsov <i.boytsov@MAC867.local>
```
  bc3a0c06
20 Jun, 2020 7 commits

Fix typo in root README (#5073) · 68e19f1c
Tim Suchanek authored Jun 20, 2020

68e19f1c
Fix PABEE's result table (#5158) · c0c577cf
Kevin Canwen Xu authored Jun 20, 2020

c0c577cf

SummarizationPipeline: init required task name (#5086) · aa6a29bc

Julien Chaumond authored Jun 20, 2020



* SummarizationPipeline: init required task name

* Update src/transformers/pipelines.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Apply suggestions from code review
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

aa6a29bc

Add BERT Loses Patience (Patience-based Early Exit) (#5078) · 2fd28d43

Kevin Canwen Xu authored Jun 20, 2020

* Add BERT Loses Patience (Patience-based Early Exit)

* update model archive

* update format

* sort import

* flake8

* Add results

* full results

* align the table

* refactor to inherit

* default per gpu eval = 1

* Formatting

* Formatting

* isort

* modify readme

* Add check

* Fix format

* Fix format

* Doc strings

* ALBERT & BERT for sequence classification don't inherit from the original anymore

* Remove incorrect comments

* Remove incorrect comments

* Remove incorrect comments

* Sync up with new code

* Sync up with new code

* Add a test

* Add a test

* Add a test

* Add a test

* Add a test

* Add a test

* Finishing up!

2fd28d43

Fix dropout in TFMobileBert (#5150) · f1679d7c
Zhu Baohe authored Jun 20, 2020

f1679d7c
Update note to avoid confusion (#5131) · 5ed94b23
Kevin Canwen Xu authored Jun 20, 2020

5ed94b23
Correct device assignment · d97b4176
Lysandre authored Jun 19, 2020

d97b4176

19 Jun, 2020 5 commits

Add MobileBert (#4901) · 9a3f9108

Vasily Shamporov authored Jun 19, 2020



* Add MobileBert

* Quality + Conversion script

* style

* Update src/transformers/modeling_mobilebert.py

* Links to S3

* Style

* TFMobileBert

Slight fixes to the pytorch MobileBert
Style

* MobileBertForMaskedLM (PT + TF)

* MobileBertForNextSentencePrediction (PT + TF)

* MobileFor{MultipleChoice, TokenClassification} (PT + TF)


ss

* Tests + Auto

* Doc

* Tests

* Addressing @sgugger's comments

* Adressing @patrickvonplaten's comments

* Style

* Style

* Integration test

* style

* Model card
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

9a3f9108

[bart-mnli] Fix class flipping bug (#5141) · f45e8739
Sam Shleifer authored Jun 19, 2020

f45e8739
Fix in Reformer Config documentation (#5138) · e33929ef
Erick Rocha Fonseca authored Jun 19, 2020

e33929ef
AutoTokenizer supports mbart-large-en-ro (#5121) · 84be482f
Sam Shleifer authored Jun 18, 2020

84be482f
[cleanup] remove redundant code in SummarizationDataset (#5119) · 2db1e2f4
Sam Shleifer authored Jun 18, 2020

2db1e2f4

18 Jun, 2020 1 commit
- Fix #5114 (#5122) · 5f721ad6
  Sylvain Gugger authored Jun 18, 2020
  
  5f721ad6