Commits · f6cb0f806efecb64df40c946dacaad0adad33d53 · chenpangpang / transformers

11 Aug, 2020 15 commits

[s2s] wmt download script use less ram (#6405) · f6cb0f80
Stas Bekman authored Aug 11, 2020

f6cb0f80
pl version: examples/requirements.txt is single source of truth (#6309) · 7c6a085e
Stas Bekman authored Aug 11, 2020

7c6a085e
Create Model Card File (#6357) · 1d1d5bec
Pranav Vadrevu authored Aug 11, 2020

1d1d5bec

Abed khooli authored Aug 11, 2020

* Create README.md

Model card for https://huggingface.co/akhooli/gpt2-small-arabic



* Update model_cards/akhooli/gpt2-small-arabic/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

00ce881c

switch Hindi-BERT to S3 README (#6396) · 3ae30787
Nick Doiron authored Aug 11, 2020

3ae30787

Create README.md (#6397) · 824e651e

Abed khooli authored Aug 11, 2020



* Create README.md

* Update model_cards/akhooli/gpt2-small-arabic-poetry/README.md

* Update model_cards/akhooli/gpt2-small-arabic-poetry/README.md

* Update model_cards/akhooli/gpt2-small-arabic-poetry/README.md

* Update model_cards/akhooli/gpt2-small-arabic-poetry/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

824e651e

[Performance improvement] "Bad tokens ids" optimization (#6064) · 40478291

guillaume-be authored Aug 11, 2020

* Optimized banned token masking

* Avoid duplicate EOS masking if in bad_words_id

* Updated mask generation to handle empty banned token list

* Addition of unit tests for the updated bad_words_ids masking

* Updated timeout handling in `test_postprocess_next_token_scores_large_bad_words_list` unit test

* Updated timeout handling in `test_postprocess_next_token_scores_large_bad_words_list` unit test (timeout does not work on Windows)

* Moving Marian import to the test context to allow TF only environments to run

* Moving imports to torch_available test

* Updated operations device and test

* Updated operations device and test

* Added docstring and comment for in-place scores modification

* Moving test to own test_generation_utils, use of lighter models for testing

* removed unneded imports in test_modeling_common

* revert formatting change for ModelTesterMixin

* Updated caching, simplified eos token id test, removed unnecessary @require_torch

* formatting compliance

40478291

Warn if debug requested without TPU fixes (#6308) (#6390) · 87e124c2

David LaPalomento authored Aug 11, 2020



* Warn if debug requested without TPU fixes (#6308)
Check whether a PyTorch compatible TPU is available before attempting to print TPU metrics after training has completed. This way, users who apply `--debug` without reading the documentation aren't suprised by a stacktrace.

* Style
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

87e124c2

Fix tokenizer saving and loading error (#6026) · cdf1f7ed

Junyuan Zheng authored Aug 11, 2020



* fix tokenizer saving and loading bugs when adding AddedToken to additional special tokens

* Add tokenizer test

* Style

* Style 2
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

cdf1f7ed

testing utils: capturing std streams context manager (#6231) · 83984a61

Stas Bekman authored Aug 11, 2020

* testing utils: capturing std streams context manager

* style

* missing import

* add the origin of this code

83984a61

add pl_glue example test (#6034) · f6c0680d

Stas Bekman authored Aug 11, 2020

* add pl_glue example test

* for now just test that it runs, next validate results of eval or predict?

* complete the run_pl_glue test to validate the actual outcome

* worked on my machine, CI gets less accuracy - trying higher epochs

* match run_pl.sh hparms

* more epochs?

* trying higher lr

* for now just test that the script runs to a completion

* correct the comment

* if cuda is available, add --fp16 --gpus=1 to cover more bases

* style

f6c0680d

Feed forward chunking (#6024) · b25cec13

Pradhy729 authored Aug 11, 2020



* Chunked feed forward for Bert

This is an initial implementation to test applying feed forward chunking for BERT.
Will need additional modifications based on output and benchmark results.

* Black and cleanup

* Feed forward chunking in BertLayer class.

* Isort

* add chunking for all models

* fix docs

* Fix typo
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

b25cec13

Add TPU testing once again · 8a3db6b3
Lysandre authored Aug 11, 2020

8a3db6b3
Add missing docker arg for TPU CI. (#6393) · f65ac1fa
zcain117 authored Aug 10, 2020

f65ac1fa
[s2s] Script to save wmt data to disk (#6403) · b9ecd92e
Sam Shleifer authored Aug 10, 2020

b9ecd92e

10 Aug, 2020 12 commits

TF Longformer (#5764) · 00bb0b25

Patrick von Platen authored Aug 10, 2020



* improve names and tests longformer

* more and better tests for longformer

* add first tf test

* finalize tf basic op functions

* fix merge

* tf shape test passes

* narrow down discrepancies

* make longformer local attn tf work

* correct tf longformer

* add first global attn function

* add more global longformer func

* advance tf longformer

* finish global attn

* upload big model

* finish all tests

* correct false any statement

* fix common tests

* make all tests pass except keras save load

* fix some tests

* fix torch test import

* finish tests

* fix test

* fix torch tf tests

* add docs

* finish docs

* Update src/transformers/modeling_longformer.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_longformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply Lysandres suggestions

* reverse to assert statement because function will fail otherwise

* applying sylvains recommendations

* Update src/transformers/modeling_longformer.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Update src/transformers/modeling_tf_longformer.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

00bb0b25

[EncoderDecoderModel] add a `add_cross_attention` boolean to config (#6377) · 34259366
Patrick von Platen authored Aug 10, 2020
```
* correct encoder decoder model

* Apply suggestions from code review

* apply sylvains suggestions
```
34259366
Fix links for open in colab (#6391) · 06bc347c
Sylvain Gugger authored Aug 10, 2020

06bc347c
Colab button (#6389) · 3e0fe3cf
Sylvain Gugger authored Aug 10, 2020
```
* Add colab button

* Add colab link for tutorials
```
3e0fe3cf
Ci GitHub caching (#6382) · 79588e6f
Lysandre Debut authored Aug 10, 2020
```
* Cache Github Actions CI

* Remove useless file
```
79588e6f

Patch models (#6326) · b99098ab

Lysandre Debut authored Aug 10, 2020

* TFAlbertFor{TokenClassification, MultipleChoice}

* Patch models

* BERT and TF BERT info


s

* Update check_repo

b99098ab

Small docfile fixes (#6328) · 6028ed92
Sylvain Gugger authored Aug 10, 2020

6028ed92

refactor almost identical tests (#6339) · 1429b920

Stas Bekman authored Aug 10, 2020

* refactor almost identical tests

* important to add a clear assert error message

* make the assert error even more descriptive than the original bt

1429b920

correct pl link in readme (#6364) · 35eb96de
Rohit Gupta authored Aug 10, 2020

35eb96de
the test now works again (#6371) · 0830e795
Stas Bekman authored Aug 09, 2020

0830e795
Update modeling_tf_utils.py (#6372) · 3a556b0f
Alexander Measure authored Aug 10, 2020
```
fix typo: ckeckpoint->checkpoint
```
3a556b0f
Temporarily de-activate TPU CI · 1bbc54a8
Lysandre authored Aug 10, 2020

1bbc54a8

09 Aug, 2020 2 commits
- [model_cards] electra-base-turkish-cased-ner (#6350) · 6e8a3856
  M. Yusuf Sarıgöz authored Aug 09, 2020
```
* for electra-base-turkish-cased-ner

* Add metadata
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  6e8a3856
- [s2s] fix --gpus clarg collision (#6358) · 9a5ef837
  Sam Shleifer authored Aug 08, 2020
  
  9a5ef837
08 Aug, 2020 5 commits
- [GPT2] Correct typo in docs (#6352) · 1aec9916
  Patrick von Platen authored Aug 08, 2020
  
  1aec9916
- Add notebook on fine-tuning and interpreting Electra (#6321) · 9f57e39f
  elsanns authored Aug 08, 2020
```
Co-authored-by: eliska <3648991+elisans@users.noreply.github.com>
```
  9f57e39f
- [s2s] fix label_smoothed_nll_loss (#6344) · 9bed3554
  Suraj Patil authored Aug 08, 2020
  
  9bed3554
- [s2s] tiny QOL improvement: run_eval prints scores (#6341) · 99f73bcc
  Sam Shleifer authored Aug 08, 2020
  
  99f73bcc
- remove a TODO item to use a tiny model (#6338) · 322dffc6
  Stas Bekman authored Aug 07, 2020
```
as discussed with @sshleifer, removing this TODO to switch to a tiny model, since it won't be able to test the results of the evaluation (i.e. the results are meaningless).
```
  322dffc6
07 Aug, 2020 6 commits
- [CI] Self-scheduled runner also pins torch (#6332) · 1f8e8265
  Sam Shleifer authored Aug 07, 2020
  
  1f8e8265
- Add setup for TPU CI to run every hour. (#6219) · 1b8a7ffc
  zcain117 authored Aug 07, 2020
```
* Add setup for TPU CI to run every hour.

* Re-organize config.yml
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
```
  1b8a7ffc
- [examples] consistently use --gpus, instead of --n_gpu (#6315) · 6695450a
  Stas Bekman authored Aug 07, 2020
  
  6695450a
- Fix the tests for Electra (#6284) · 0e36e515
  Julien Plu authored Aug 07, 2020
```
* Fix the tests for Electra

* Apply style
```
  0e36e515
- Add a script to check all models are tested and documented (#6298) · 6ba540b7
  Sylvain Gugger authored Aug 07, 2020
```
* Add a script to check all models are tested and documented

* Apply suggestions from code review
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>

* Address comments
Co-authored-by: Kevin Canwen Xu <canwenxu@126.com>
```
  6ba540b7
- fix the slow tests doc (#6167) · e1638dce
  Stas Bekman authored Aug 07, 2020
```
remove unnecessary duplication wrt `RUN_SLOW=yes`
```
  e1638dce