Commits · 0a3d0e02c5af20bfe9091038c4fd11fb79175546 · chenpangpang / transformers

24 Jun, 2020 18 commits
- Replace labels with -100 to skip loss calc (#4718) · 0a3d0e02
  Setu Shah authored Jun 24, 2020
  
  0a3d0e02
- Fix version controller links (for realsies) (#5251) · 6894b486
  Sylvain Gugger authored Jun 24, 2020
  
  6894b486
- Model cards for Hate-speech-CNERG models (#5236) · 1121ce9f
  Sai Saketh Aluru authored Jun 24, 2020
```
* Add dehatebert-mono-arabic readme card

* Update dehatebert-mono-arabic model card

* model cards for Hate-speech-CNERG models
```
  1121ce9f
- Cleaning TensorFlow models (#5229) · cf10d4cf
  Lysandre Debut authored Jun 24, 2020
```
* Cleaning TensorFlow models

Update all classes


stylr

* Don't average loss
```
  cf10d4cf
- Fix links (#5248) · 609e0c58
  Sylvain Gugger authored Jun 24, 2020
  
  609e0c58
- delay decay schedule until the end of warmup (#4940) · c9163a8d
  Ali Modarressi authored Jun 24, 2020
  
  c9163a8d
- Fix deploy doc (#5246) · f216b606
  Sylvain Gugger authored Jun 24, 2020
```
* Try with the same command

* Try like this
```
  f216b606
- Add some prints to debug (#5244) · 49f6e7a3
  Sylvain Gugger authored Jun 24, 2020
  
  49f6e7a3
- [Use cache] Align logic of `use_cache` with output_attentions and output_hidden_states (#5194) · c2a26ec8
  Patrick von Platen authored Jun 24, 2020
```
* fix use cache

* add bart use cache

* fix bart

* finish bart
```
  c2a26ec8
- Don't recreate old docs (#5243) · 64c393ee
  Sylvain Gugger authored Jun 24, 2020
  
  64c393ee
- fix print in benchmark (#5242) · b2968373
  Patrick von Platen authored Jun 24, 2020
  
  b2968373
- [Benchmark] Extend Benchmark to all model type extensions (#5241) · 9fe09cec
  Patrick von Platen authored Jun 24, 2020
```
* add benchmark for all kinds of models

* improved import

* delete bogus files

* make style
```
  9fe09cec
- Add hugs (#5225) · 7c41057d
  Sylvain Gugger authored Jun 24, 2020
  
  7c41057d
- Use the script in utils (#5224) · 5e85b324
  Sylvain Gugger authored Jun 24, 2020
  
  5e85b324
- Create README.md (#5108) · 5e31a98a
  flozi00 authored Jun 24, 2020
```
* Create README.md

* Update model_cards/a-ware/roberta-large-squad-classification/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  5e31a98a
- Update README.md (#5199) · 033124e5
  Adriano Diniz authored Jun 24, 2020
```
Fix/add information in README.md
```
  033124e5
- Create README.md (#5217) · 7ca6627e
  ahotrod authored Jun 24, 2020
```
electra_large_discriminator_squad2_512 Question Answering LM
```
  7ca6627e
- Fix PABEE division by zero error (#5233) · 54e9ce78
  Kevin Canwen Xu authored Jun 24, 2020
```
* Fix PABEE division by zero error

* patience=0 by default
```
  54e9ce78
23 Jun, 2020 11 commits

Only put tensors on a device (#5223) · 9022ef02
Sylvain Gugger authored Jun 23, 2020
```
* Only put tensors on a device

* Type hint and unpack list comprehension
```
9022ef02

Add version control menu (#5222) · 173528e3

Sylvain Gugger authored Jun 23, 2020



* Add version control menu

* Constify things
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

173528e3

[pl_examples] revert deletion of optimizer_step (#5227) · 76e5af4c
Sam Shleifer authored Jun 23, 2020

76e5af4c
[file_utils] Type user-agent · c01480bb
Julien Chaumond authored Jun 23, 2020

c01480bb
[bart] add config.extra_pos_embeddings to facilitate reuse (#5190) · 58918c76
Sam Shleifer authored Jun 23, 2020

58918c76
More clear error message in the use-case of #5169 (#5184) · b28b5371
Thomas Wolf authored Jun 23, 2020

b28b5371

Tokenizers API developments (#5103) · 11fdde02

Thomas Wolf authored Jun 23, 2020



* Add return lengths

* make pad a bit more flexible so it can be used as collate_fn

* check all kwargs sent to encoding method are known

* fixing kwargs in encodings

* New AddedToken class in python

This class let you specify specifique tokenization behaviors for some special tokens. Used in particular for GPT2 and Roberta, to control how white spaces are stripped around special tokens.

* style and quality

* switched to hugginface tokenizers library for AddedTokens

* up to tokenizer 0.8.0-rc3 - update API to use AddedToken state

* style and quality

* do not raise an error on additional or unused kwargs for tokenize() but only a warning

* transfo-xl pretrained model requires torch

* Update src/transformers/tokenization_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

11fdde02

[Reformer] Axial Pos Emb Improve mem usage reformer (#5209) · 1ae132a0
Patrick von Platen authored Jun 23, 2020
```
* improve mem handling

* improve mem for pos ax encodings
```
1ae132a0
[fix] remove unused import (#5206) · 51441040
Sam Shleifer authored Jun 22, 2020

51441040
[fix] mobilebert had wrong path, causing slow test failure (#5205) · 0d158e38
Sam Shleifer authored Jun 22, 2020

0d158e38
Upgrade examples to pl=0.8.1(#5146) · f5c2a122
Sam Shleifer authored Jun 22, 2020

f5c2a122

22 Jun, 2020 11 commits

[Modelcard] bart-squadv2 (#5011) · 06b60c8b
flozi00 authored Jun 23, 2020
```
* [Modelcard] bart-squadv2

* Update README.md

* Update README.md
```
06b60c8b
Create README.md (#5013) · 35e06872
flozi00 authored Jun 23, 2020

35e06872

Create README.md for finetuned BERT model (#5009) · 22d2c8ea

Fran Martinez authored Jun 23, 2020

* Create README.md

* changes in model usage section

* minor changes in output visualization

* minor errata in readme

22d2c8ea

Add model card for StackOBERTflow-comments-small (#5008) · 25895056
furunkel authored Jun 23, 2020
```
* Create README.md

* Update README.md
```
25895056
Specify dataset used for crossvalidation (#5175) · d8c26ed1
bogdankostic authored Jun 23, 2020

d8c26ed1
Create README.md (#5149) · a34fb91d
Adriano Diniz authored Jun 22, 2020

a34fb91d
Create README.md (#5160) · ffabcf52
Adriano Diniz authored Jun 22, 2020

ffabcf52

Create README.md (#5152) · 3363a19b

Adriano Diniz authored Jun 22, 2020



* Create README.md

* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

3363a19b

Add link to new comunity notebook (optimization) (#5195) · 0cca6192

Michaël Benesty authored Jun 22, 2020

* Add link to new comunity notebook (optimization)

related to https://github.com/huggingface/transformers/issues/4842#event-3469184635

This notebook is about benchmarking model training with/without dynamic padding optimization. 
https://github.com/ELS-RD/transformers-notebook 

Using dynamic padding on MNLI provides a **4.7 times training time reduction**, with max pad length set to 512. The effect is strong because few examples are >> 400 tokens in this dataset. IRL, it will depend of the dataset, but it always bring improvement and, after more than 20 experiments listed in this [article](https://towardsdatascience.com/divide-hugging-face-transformers-training-time-by-2-or-more-21bf7129db9q-21bf7129db9e?source=friends_link&sk=10a45a0ace94b3255643d81b6475f409

), it seems to not hurt performance.

Following advice from @patrickvonplaten I do the PR myself :-)

* Update notebooks/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

0cca6192

Add README.md (nyu-mll) (#5174) · 1c5cd8e5

Lee Haau-Sing authored Jun 22, 2020



* nyu-mll: roberta on smaller datasets

* Update README.md

* Update README.md
Co-authored-by: Alex Warstadt <alexwarstadt@gmail.com>

1c5cd8e5

Switch master/stable doc and add older releases (#5193) · c4397524
Sylvain Gugger authored Jun 22, 2020

c4397524