Commits · 54e9ce785d1b2d3c7c6d4c1d4f5c5850f451ac52 · chenpangpang / transformers

24 Jun, 2020 1 commit
- Fix PABEE division by zero error (#5233) · 54e9ce78
  Kevin Canwen Xu authored Jun 24, 2020
```
* Fix PABEE division by zero error

* patience=0 by default
```
  54e9ce78
23 Jun, 2020 11 commits

Only put tensors on a device (#5223) · 9022ef02
Sylvain Gugger authored Jun 23, 2020
```
* Only put tensors on a device

* Type hint and unpack list comprehension
```
9022ef02

Add version control menu (#5222) · 173528e3

Sylvain Gugger authored Jun 23, 2020



* Add version control menu

* Constify things
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

173528e3

[pl_examples] revert deletion of optimizer_step (#5227) · 76e5af4c
Sam Shleifer authored Jun 23, 2020

76e5af4c
[file_utils] Type user-agent · c01480bb
Julien Chaumond authored Jun 23, 2020

c01480bb
[bart] add config.extra_pos_embeddings to facilitate reuse (#5190) · 58918c76
Sam Shleifer authored Jun 23, 2020

58918c76
More clear error message in the use-case of #5169 (#5184) · b28b5371
Thomas Wolf authored Jun 23, 2020

b28b5371

Tokenizers API developments (#5103) · 11fdde02

Thomas Wolf authored Jun 23, 2020



* Add return lengths

* make pad a bit more flexible so it can be used as collate_fn

* check all kwargs sent to encoding method are known

* fixing kwargs in encodings

* New AddedToken class in python

This class let you specify specifique tokenization behaviors for some special tokens. Used in particular for GPT2 and Roberta, to control how white spaces are stripped around special tokens.

* style and quality

* switched to hugginface tokenizers library for AddedTokens

* up to tokenizer 0.8.0-rc3 - update API to use AddedToken state

* style and quality

* do not raise an error on additional or unused kwargs for tokenize() but only a warning

* transfo-xl pretrained model requires torch

* Update src/transformers/tokenization_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

11fdde02

[Reformer] Axial Pos Emb Improve mem usage reformer (#5209) · 1ae132a0
Patrick von Platen authored Jun 23, 2020
```
* improve mem handling

* improve mem for pos ax encodings
```
1ae132a0
[fix] remove unused import (#5206) · 51441040
Sam Shleifer authored Jun 22, 2020

51441040
[fix] mobilebert had wrong path, causing slow test failure (#5205) · 0d158e38
Sam Shleifer authored Jun 22, 2020

0d158e38
Upgrade examples to pl=0.8.1(#5146) · f5c2a122
Sam Shleifer authored Jun 22, 2020

f5c2a122

22 Jun, 2020 28 commits

[Modelcard] bart-squadv2 (#5011) · 06b60c8b
flozi00 authored Jun 23, 2020
```
* [Modelcard] bart-squadv2

* Update README.md

* Update README.md
```
06b60c8b
Create README.md (#5013) · 35e06872
flozi00 authored Jun 23, 2020

35e06872

Create README.md for finetuned BERT model (#5009) · 22d2c8ea

Fran Martinez authored Jun 23, 2020

* Create README.md

* changes in model usage section

* minor changes in output visualization

* minor errata in readme

22d2c8ea

Add model card for StackOBERTflow-comments-small (#5008) · 25895056
furunkel authored Jun 23, 2020
```
* Create README.md

* Update README.md
```
25895056
Specify dataset used for crossvalidation (#5175) · d8c26ed1
bogdankostic authored Jun 23, 2020

d8c26ed1
Create README.md (#5149) · a34fb91d
Adriano Diniz authored Jun 22, 2020

a34fb91d
Create README.md (#5160) · ffabcf52
Adriano Diniz authored Jun 22, 2020

ffabcf52

Create README.md (#5152) · 3363a19b

Adriano Diniz authored Jun 22, 2020



* Create README.md

* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

3363a19b

Add link to new comunity notebook (optimization) (#5195) · 0cca6192

Michaël Benesty authored Jun 22, 2020

* Add link to new comunity notebook (optimization)

related to https://github.com/huggingface/transformers/issues/4842#event-3469184635

This notebook is about benchmarking model training with/without dynamic padding optimization. 
https://github.com/ELS-RD/transformers-notebook 

Using dynamic padding on MNLI provides a **4.7 times training time reduction**, with max pad length set to 512. The effect is strong because few examples are >> 400 tokens in this dataset. IRL, it will depend of the dataset, but it always bring improvement and, after more than 20 experiments listed in this [article](https://towardsdatascience.com/divide-hugging-face-transformers-training-time-by-2-or-more-21bf7129db9q-21bf7129db9e?source=friends_link&sk=10a45a0ace94b3255643d81b6475f409

), it seems to not hurt performance.

Following advice from @patrickvonplaten I do the PR myself :-)

* Update notebooks/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

0cca6192

Add README.md (nyu-mll) (#5174) · 1c5cd8e5

Lee Haau-Sing authored Jun 22, 2020



* nyu-mll: roberta on smaller datasets

* Update README.md

* Update README.md
Co-authored-by: Alex Warstadt <alexwarstadt@gmail.com>

1c5cd8e5

Switch master/stable doc and add older releases (#5193) · c4397524
Sylvain Gugger authored Jun 22, 2020

c4397524

Quick tour (#5145) · 417e492f

Sylvain Gugger authored Jun 22, 2020



* Quicktour part 1

* Update

* All done

* Typos
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

* Address comments in quick tour

* Update docs/source/quicktour.rst
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update from feedback
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

417e492f

Cleaner warning when loading pretrained models (#4557) · 75e1eed8

Thomas Wolf authored Jun 22, 2020



* Cleaner warning when loading pretrained models

This make more explicit logging messages when using the various `from_pretrained` methods. It also make these messages as `logging.warning` because it's a common source of silent mistakes.

* Update src/transformers/modeling_utils.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* style and quality
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

75e1eed8

Have documentation fail on warning (#5189) · 4e741efa

Lysandre Debut authored Jun 22, 2020

* Have documentation fail on warning

* Force ci failure

* Revert "Force ci failure"

This reverts commit f0a4666ec2eb4cd00a4da48af3357defc63324a0.

4e741efa

Add TF auto model to the docs + fix sphinx warnings (#5187) · 1262495a
Sylvain Gugger authored Jun 22, 2020

1262495a
Create README.md (#5165) · 88429c57
Adriano Diniz authored Jun 22, 2020

88429c57

Create README.md (#5107) · 76ee9c8b

Manuel Romero authored Jun 22, 2020



* Create README.md

@julien-c check out that dataset meta tag is right

* Fix typo
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

76ee9c8b

Model card for t5-base-finetuned-emotion (recognition) (#5179) · bf493d55
Manuel Romero authored Jun 22, 2020

bf493d55
improve doc (#5185) · e9ef2117
Patrick von Platen authored Jun 22, 2020

e9ef2117

[tokenizers] Fix #5081 and improve backward compatibility (#5125) · ebc36108

Thomas Wolf authored Jun 22, 2020

* fix #5081 and improve backward compatibility (slightly)

* add nlp to setup.cfg - style and quality

* align default to previous default

* remove test that doesn't generalize

ebc36108

Check if `text` is set to avoid IndexError (#4209) · d2a7c86d
Malte authored Jun 22, 2020
```
Fix for https://github.com/huggingface/transformers/issues/3809
```
d2a7c86d

Add support for gradient checkpointing in BERT (#4659) · 90f4b245

Iz Beltagy authored Jun 22, 2020



* add support for gradient checkpointing in BERT

* fix unit tests

* isort

* black

* workaround for `torch.utils.checkpoint.checkpoint` not accepting bool

* Revert "workaround for `torch.utils.checkpoint.checkpoint` not accepting bool"

This reverts commit 5eb68bb804f5ffbfc7ba13c45a47717f72d04574.

* workaround for `torch.utils.checkpoint.checkpoint` not accepting bool
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

90f4b245

Output hidden states (#4978) · f4e1f022

Joseph Liu authored Jun 22, 2020



* Configure all models to use output_hidden_states as argument passed to foward()

* Pass all tests

* Remove cast_bool_to_primitive in TF Flaubert model

* correct tf xlnet

* add pytorch test

* add tf test

* Fix broken tests

* Configure all models to use output_hidden_states as argument passed to foward()

* Pass all tests

* Remove cast_bool_to_primitive in TF Flaubert model

* correct tf xlnet

* add pytorch test

* add tf test

* Fix broken tests

* Refactor output_hidden_states for mobilebert

* Reset and remerge to master
Co-authored-by: Joseph Liu <joseph.liu@coinflex.com>
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

f4e1f022

Add model cards for Microsoft's MiniLM (#5178) · 866a8cca

Kevin Canwen Xu authored Jun 22, 2020

* Add model cards for Microsoft's MiniLM

* XLMRobertaTokenizer

* format

* Add thumbnail

* finishing up

866a8cca

Added feature to move added tokens in vocabulary for Transformer-XL (#4953) · b99ad457

RafaelWO authored Jun 22, 2020



* Fixed resize_token_embeddings for transfo_xl model

* Fixed resize_token_embeddings for transfo_xl.

Added custom methods to TransfoXLPreTrainedModel for resizing layers of
the AdaptiveEmbedding.

* Updated docstring

* Fixed resizinhg cutoffs; added check for new size of embedding layer.

* Added test for resize_token_embeddings

* Fixed code quality

* Fixed unchanged cutoffs in model.config

* Added feature to move added tokens in tokenizer.

* Fixed code quality

* Added feature to move added tokens in tokenizer.

* Fixed code quality

* Fixed docstring, renamed sym to 	oken.
Co-authored-by: Rafael Weingartner <rweingartner.its-b2015@fh-salzburg.ac.at>

b99ad457

Update glossary (#5148) · eb0ca71e

Sylvain Gugger authored Jun 22, 2020



* Update glossary

* Update docs/source/glossary.rst
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

eb0ca71e

Benchmarks (#4912) · fa0be6d7

Patrick von Platen authored Jun 22, 2020

* finish benchmark

* fix isort

* fix setup cfg

* retab

* fix time measuring of tf graph mode

* fix tf cuda

* clean code

* better error message

fa0be6d7

fix bart doc (#5132) · 18a0150b
Zihao Fu authored Jun 22, 2020
```
fix bart doc
```
18a0150b