Commits · 67ff1c314a61a2d5949b3bb48fa3ec7e9b697d7e · chenpangpang / transformers

08 Dec, 2020 9 commits

Templates overhaul 1 (#8993) · 67ff1c31
Lysandre Debut authored Dec 08, 2020

67ff1c31

Sylvain Gugger authored Dec 08, 2020



* Add new SQUAD example

* Same with a task-specific Trainer

* Address review comment.

* Small fixes

* Initial work for XLNet

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Final clean up and working XLNet script

* Test and debug

* Final working version

* Add new SQUAD example

* Same with a task-specific Trainer

* Address review comment.

* Small fixes

* Initial work for XLNet

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Final clean up and working XLNet script

* Test and debug

* Final working version

* Add tick

* Update README

* Address review comments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

447808c8

Removed unused `encoder_hidden_states` and `encoder_attention_mask` (#8972) · 7809eb82

guillaume-be authored Dec 08, 2020

* Removed unused `encoder_hidden_states` and `encoder_attention_mask` from MobileBert

* Removed decoder tests for MobileBert

* Removed now unnecessary import

7809eb82

Fix interaction of return_token_type_ids and add_special_tokens (#8854) · b7cdd00f
Lysandre Debut authored Dec 08, 2020

b7cdd00f
Make `ModelOutput` pickle-able (#8989) · 04c446f7
Sylvain Gugger authored Dec 08, 2020

04c446f7
[model_card] remove bogus testing changes · 0d9e6ca9
Julien Chaumond authored Dec 08, 2020

0d9e6ca9

Optional layers (#8961) · bf7f79cd

Julien Plu authored Dec 08, 2020

* Apply on BERT and ALBERT

* Update TF Bart

* Add input processing to TF BART

* Add input processing for TF CTRL

* Add input processing to TF Distilbert

* Add input processing to TF DPR

* Add input processing to TF Electra

* Add deprecated arguments

* Add input processing to TF XLM

* remove unused imports

* Add input processing to TF Funnel

* Add input processing to TF GPT2

* Add input processing to TF Longformer

* Add input processing to TF Lxmert

* Apply style

* Add input processing to TF Mobilebert

* Add input processing to TF GPT

* Add input processing to TF Roberta

* Add input processing to TF T5

* Add input processing to TF TransfoXL

* Apply style

* Rebase on master

* Fix wrong model name

* Fix BART

* Apply style

* Put the deprecated warnings in the input processing function

* Remove the unused imports

* Raise an error when len(kwargs)>0

* test ModelOutput instead of TFBaseModelOutput

* Address Patrick's comments

* Address Patrick's comments

* Add boolean processing for the inputs

* Take into account the optional layers

* Add missing/unexpected weights in the other models

* Apply style

* rename parameters

* Apply style

* Remove useless

* Remove useless

* Remove useless

* Update num parameters

* Fix tests

* Address Patrick's comment

* Remove useless attribute

bf7f79cd

[training] SAVE_STATE_WARNING was removed in pytorch (#8979) · 9d7d0005

Stas Bekman authored Dec 07, 2020

* [training] SAVE_STATE_WARNING was removed in pytorch

FYI `SAVE_STATE_WARNING` has been removed 3 days ago: pytorch/pytorch#46813

Fixes: #8232

@sgugger

* style, but add () to prevent autoformatters from botching it

* switch to try/except

* cleanup

9d7d0005

Check table as independent script (#8976) · 2ae7388e
Lysandre Debut authored Dec 07, 2020

2ae7388e

07 Dec, 2020 11 commits

Copyright (#8970) · 00aa9dbc
Sylvain Gugger authored Dec 07, 2020
```
* Add copyright everywhere missing

* Style
```
00aa9dbc
add max_length to showcase the use of truncation (#8975) · c108d0b5
Navjot authored Dec 07, 2020

c108d0b5
Small fix to the run clm script (#8973) · 62d30e05
Sylvain Gugger authored Dec 07, 2020

62d30e05

transformers-cli: LFS multipart uploads (> 5GB) (#8663) · 28fa014a

Julien Chaumond authored Dec 07, 2020



* initial commit

* [cli] lfs commands

* Fix FileSlice

* Tweak to FileSlice

* [hf_api] Backport filetype arg from `datasets`

cc @lhoestq

* Silm down the CI while i'm working

* Ok let's try this in CI

* Update config.yml

* Do not try this at home

* one more try

* Update lfs.py

* Revert "Tweak to FileSlice"

This reverts commit d7e32c4b3500400486411e85a2b74e57fb6b52f5.

* Update test_hf_api.py

* Update test_hf_api.py

* Update test_hf_api.py

* CI still green?

* make CI green again?

* Update test_hf_api.py

* make CI red again?

* Update test_hf_api.py

* add CI style back

* Fix CI?

* oh my

* doc + switch back to real staging endpoint

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Pierric Cistac <Pierrci@users.noreply.github.com>

* Fix docblock + f-strings
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Pierric Cistac <Pierrci@users.noreply.github.com>

28fa014a

Create README.md (#8964) · 0bce7c55
Wietse de Vries authored Dec 07, 2020

0bce7c55
Update README.txt (#8957) · 7ccd973e
Nguyen Van Nha authored Dec 08, 2020

7ccd973e

> 30 files leads to hanging on --More-- · 37f4c24f

Stas Bekman authored Dec 07, 2020

cancel debug printing for now. As it can be seen lead to a failing test here:
https://app.circleci.com/pipelines/github/huggingface/transformers/16894/workflows/cc86f7a9-4020-45af-8ab3-c22f79b427cf/jobs/131924

37f4c24f

Use word_ids to get labels in run_ner (#8962) · 7f9ccffc
Sylvain Gugger authored Dec 07, 2020
```
* Use word_ids to get labels in run_ner

* Add sanity check
```
7f9ccffc
Remove sourcerer (#8965) · de6befd4
Clement authored Dec 07, 2020

de6befd4

Add TFGPT2ForSequenceClassification based on DialogRPT (#8714) · 483e1327

sandip authored Dec 07, 2020

* Add TFGPT2ForSequenceClassification based on DialogRPT

* Add TFGPT2ForSequenceClassification based on DialogRPT

* TFGPT2ForSequenceClassification based on DialogRPT-refactored code, implemented review comments and added input processing

* Add TFGPT2ForSequenceClassification based on DialogRPT

* TFGPT2ForSequenceClassification based on DialogRPT-refactored code, implemented review comments and added input processing

* code refactor for latest other TF PR

* code refactor

* code refactor

* Update modeling_tf_gpt2.py

483e1327

Fix QA pipeline on Windows (#8947) · 28c77ddf
Sylvain Gugger authored Dec 07, 2020

28c77ddf

06 Dec, 2020 1 commit

Add model card (#8948) · 72d6c9c6

Philip Tamimi-Sarnikowski authored Dec 06, 2020



* add model card

* lowercase identifier
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

72d6c9c6

05 Dec, 2020 2 commits

Fix typo for `modeling_bert` import resulting in ImportError (#8931) · ef93a254
Machel Reid authored Dec 05, 2020
```
Self-explanatory ;) - Hope it helps!
```
ef93a254

Don't pass in token_type_ids to BART for GLUE (#8929) · 8dfc8c72

Ethan Perez authored Dec 05, 2020

Without this fix, training a `BARTForSequenceClassification` model with `run_pl_glue.py` gives `TypeError: forward() got an unexpected keyword argument 'token_type_ids'`, because BART does not have token_type_ids. I've solved this issue in the same way as it's solved for the "distilbert" model, and I can train BART models on SNLI without errors now.

8dfc8c72

04 Dec, 2020 5 commits

[seq2seq] document the caveat of leaky native amp (#8930) · df311a5c

Stas Bekman authored Dec 04, 2020



* document the caveat of leaky native amp

* Update examples/seq2seq/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

df311a5c

[ci] skip doc jobs - circleCI is not reliable - disable skip for now (#8926) · 73c51f7f
Stas Bekman authored Dec 04, 2020
```
* disable skipping, but leave logging for the future
```
73c51f7f
Fix TF T5 only encoder model with booleans (#8925) · 71688a88
Lysandre Debut authored Dec 04, 2020

71688a88

Better booleans handling in the TF models (#8777) · dcd3046f

Julien Plu authored Dec 04, 2020

* Apply on BERT and ALBERT

* Update TF Bart

* Add input processing to TF BART

* Add input processing for TF CTRL

* Add input processing to TF Distilbert

* Add input processing to TF DPR

* Add input processing to TF Electra

* Add deprecated arguments

* Add input processing to TF XLM

* Add input processing to TF Funnel

* Add input processing to TF GPT2

* Add input processing to TF Longformer

* Add input processing to TF Lxmert

* Apply style

* Add input processing to TF Mobilebert

* Add input processing to TF GPT

* Add input processing to TF Roberta

* Add input processing to TF T5

* Add input processing to TF TransfoXL

* Apply style

* Rebase on master

* Bug fix

* Retry to bugfix

* Retry bug fix

* Fix wrong model name

* Try another fix

* Fix BART

* Fix input precessing

* Apply style

* Put the deprecated warnings in the input processing function

* Remove the unused imports

* Raise an error when len(kwargs)>0

* test ModelOutput instead of TFBaseModelOutput

* Bug fix

* Address Patrick's comments

* Address Patrick's comments

* Address Sylvain's comments

* Add boolean processing for the inputs

* Apply style

* Missing optional

* Fix missing some input proc

* Update the template

* Fix missing inputs

* Missing input

* Fix args parameter

* Trigger CI

* Trigger CI

* Trigger CI

* Address Patrick's and Sylvain's comments

* Replace warn by warning

* Trigger CI

* Fix XLNET

* Fix detection

dcd3046f

[s2s finetune_trainer] add instructions for distributed training (#8884) · 4c3d98dd
Stas Bekman authored Dec 03, 2020

4c3d98dd

03 Dec, 2020 7 commits

Patch model parallel test (#8920) · aa60b230
Lysandre Debut authored Dec 03, 2020
```
* Patch model parallel test

* Remove line

* Remove `ci_*` from scheduled branches
```
aa60b230

Put Transformers on Conda (#8918) · 0c5615af

Lysandre Debut authored Dec 03, 2020



* conda

* Guide

* correct tag

* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/installation.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Sylvain's comments
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

0c5615af

Tweak wording + Add badge w/ number of models on the hub (#8914) · 9ad61943

Julien Chaumond authored Dec 03, 2020

* Add badge w/ number of models on the hub

* try to apease @sgugger 😇



* not sure what this `c` was about [ci skip]

* Fix script and move stuff around

* Fix doc styling error
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

9ad61943

Fix move when the two cache folders exist (#8917) · 6ed7e32f
Sylvain Gugger authored Dec 03, 2020

6ed7e32f
Avoid erasing the attention mask when double padding (#8915) · 8453201c
Sylvain Gugger authored Dec 03, 2020

8453201c
Don't warn that models aren't available if Flax is available. (#8841) · 0deece9c
Skye Wanderman-Milne authored Dec 03, 2020

0deece9c
[model_cards] lm-head was deprecated · 2b7fc9a0
Julien Chaumond authored Dec 03, 2020
```
(and wasn't needed here anyways as it was added automatically)
```
2b7fc9a0

02 Dec, 2020 5 commits

[PyTorch] Refactor Resize Token Embeddings (#8880) · 443f67e8

Patrick von Platen authored Dec 02, 2020

* fix resize tokens

* correct mobile_bert

* move embedding fix into modeling_utils.py

* refactor

* fix lm head resize

* refactor

* break lines to make sylvain happy

* add news tests

* fix typo

* improve test

* skip bart-like for now

* check if base_model = get(...) is necessary

* clean files

* improve test

* fix tests

* revert style templates

* Update templates/adding_a_new_model/cookiecutter-template-{{cookiecutter.modelname}}/modeling_{{cookiecutter.lowercase_modelname}}.py

443f67e8

Update README.md (#8906) · e52f9c0a
Devangi Purkayastha authored Dec 02, 2020

e52f9c0a
Fix typo in docstring (#8905) · 801b2cb3
ryota-mo authored Dec 03, 2020

801b2cb3

[trainer] improve code readability (#8903) · 7e1cb00c

Stas Bekman authored Dec 02, 2020

* [trainer] improve code

This PR:
- removes redundant code 
```
self.model = model if model is not None else None
```
and
```
self.model = model
```
are the same.

* separate attribute assignment from code logic - which simplifies things further.

* whitespace

7e1cb00c

Warning about too long input for fast tokenizers too (#8799) · a8c3f9aa

Nicolas Patry authored Dec 02, 2020

* Warning about too long input for fast tokenizers too

If truncation is not set in tokenizers, but the tokenization is too long
for the model (`model_max_length`), we used to trigger a warning that

The input would probably fail (which it most likely will).

This PR re-enables the warning for fast tokenizers too and uses common
code for the trigger to make sure it's consistent across.

* Checking for pair of inputs too.

* Making the function private and adding it's doc.

* Remove formatting ?? in odd place.

* Missed uppercase.

a8c3f9aa