Commits · a449ffcbd2887b936e6b70a89e533a0bb713743a · chenpangpang / transformers

22 Jan, 2021 5 commits
- Fix test (#9755) · a449ffcb
  Julien Plu authored Jan 22, 2021
  
  a449ffcb
- Add `report_to` training arguments to control the reporting integrations used (#9735) · 82d46feb
  Sylvain Gugger authored Jan 22, 2021
  
  82d46feb
- Fixes to run_seq2seq and instructions (#9734) · 411c5821
  Sylvain Gugger authored Jan 22, 2021
```
* Fixes to run_seq2seq and instructions

* Add more defaults for summarization
```
  411c5821
- Fix some TF slow tests (#9728) · d7c31abf
  Julien Plu authored Jan 22, 2021
```
* Fix saved model tests + fix a graph issue in longformer

* Apply style
```
  d7c31abf
- examples: fix XNLI url (#9741) · 08b22722
  Stefan Schweter authored Jan 22, 2021
  
  08b22722
21 Jan, 2021 11 commits

Fix memory regression in Seq2Seq example (#9713) · 5f80c15e

Sylvain Gugger authored Jan 21, 2021

* Fix memory regression in Seq2Seq example

* Fix test and properly deal with -100

* Easier condition with device safety

* Patch for MBartTokenzierFast

5f80c15e

Fix TF s2s models (#9478) · a7dabfb3

Julien Plu authored Jan 21, 2021

* Fix Seq2Seq models for serving

* Apply style

* Fix lonfgormer

* Fix mBart/Pegasus/Blenderbot

* Apply style

* Add a main intermediate layer

* Apply style

* Remove import

* Apply tf.function to Longformer

* Fix utils check_copy

* Update S2S template

* Fix BART + Blenderbot

* Fix BlenderbotSmall

* Fix BlenderbotSmall

* Fix BlenderbotSmall

* Fix MBart

* Fix Marian

* Fix Pegasus + template

* Apply style

* Fix common attributes test

* Forgot to fix the LED test

* Apply Patrick's comment on LED Decoder

a7dabfb3

Changing model default for TableQuestionAnsweringPipeline. (#9729) · 23e5a36e

Nicolas Patry authored Jan 21, 2021

* Changing model default for TableQuestionAnsweringPipeline.

- Discussion: https://discuss.huggingface.co/t/table-question-answering-is-not-an-available-task-under-pipeline/3284/6

* Updating slow tests that were out of sync.

23e5a36e

Fix mixed precision in TF models (#9163) · 3f290e6c

Julien Plu authored Jan 21, 2021

* Fix Gelu precision

* Fix gelu_fast

* Naming

* Fix usage and apply style

* add TF gelu approximate version

* add TF gelu approximate version

* add TF gelu approximate version

* Apply style

* Fix albert

* Remove the usage of the Activation layer

3f290e6c

fix T5 head mask in model_parallel (#9726) · 248fa1ae
Suraj Patil authored Jan 21, 2021
```
* fix head mask in model_parallel

* pass correct head mask
```
248fa1ae
finish (#9721) · ca422e3d
Patrick von Platen authored Jan 21, 2021

ca422e3d
reduce led memory (#9723) · c8ea582e
Patrick von Platen authored Jan 21, 2021

c8ea582e

Allow text generation for ProphetNetForCausalLM (#9707) · fb36c273

guillaume-be authored Jan 21, 2021

* Moved ProphetNetForCausalLM's parent initialization after config update

* Added unit tests for generation for ProphetNetForCausalLM

fb36c273

Temporarily deactivate TPU tests while we work on fixing them (#9720) · 910aa896
Lysandre Debut authored Jan 21, 2021

910aa896

fix typo (#9708) · 6a346f03

Muennighoff authored Jan 21, 2021



* fix typo
Co-authored-by: Suraj Patil <surajp815@gmail.com>

6a346f03

[trainer] no --deepspeed and --sharded_ddp together (#9712) · 4a20b7c4

Stas Bekman authored Jan 20, 2021



* no --deepspeed and --sharded_ddp together

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4a20b7c4

20 Jan, 2021 17 commits

Add missing new line · 7acfa95a
Sylvain Gugger authored Jan 20, 2021

7acfa95a

Adds flashcards to Glossary & makes small corrections (#8949) · 5a307ece

Darigov Research authored Jan 20, 2021

* fix: Makes small typo corrections & standardises glossary

* feat: Adds introduction & links to transformer flashcards

* feat: Adds attribution & adjustments requested in #8949

* feat: Adds flashcards to community.md

* refactor: Removes flashcards from glossary

5a307ece

Fix WAND_DISABLED test (#9703) · 3cd91e81

Sylvain Gugger authored Jan 20, 2021

* Fix WAND_DISABLED test

* Remove duplicate import

* Make a test that actually works...

* Fix style

3cd91e81

Fix style · 2a703773
Sylvain Gugger authored Jan 20, 2021

2a703773
fix the backward for deepspeed (#9705) · cd5565be
Stas Bekman authored Jan 20, 2021

cd5565be

Fix Trainer and Args to mention AdamW, not Adam. (#9685) · 538245b0

Gunjan Chhablani authored Jan 20, 2021

* Fix Trainer and Args to mention AdamW, not Adam.

* Update the docs for Training Arguments.

* Change arguments adamw_* to adam_*

* Fixed links to AdamW in TrainerArguments docs

* Fix line length in Training Args docs.

538245b0

Add notebook (#9696) · 88583d49
NielsRogge authored Jan 20, 2021

88583d49

Add DeBERTa head models (#9691) · d1370d29

NielsRogge authored Jan 20, 2021

* Add DebertaForMaskedLM, DebertaForTokenClassification, DebertaForQuestionAnswering

* Add docs and fix quality

* Fix Deberta not having pooler

d1370d29

Fix Funnel Transformer conversion script (#9683) · a7b62fec
Sylvain Gugger authored Jan 20, 2021

a7b62fec

Add t5 convert to transformers-cli (#9654) · 8940c766

acul3 authored Jan 20, 2021

* Update run_mlm.py

* add t5 model to transformers-cli convert

* update rum_mlm.py same as master

* update converting model docs

* update converting model docs

* Update convert.py

* Trigger notification

* update import sorted

* fix typo t5

8940c766

Fix template (#9697) · 7251a473
Julien Plu authored Jan 20, 2021

7251a473

New TF embeddings (cleaner and faster) (#9418) · 14042d56

Julien Plu authored Jan 20, 2021



* Create new embeddings + add to BERT

* Add Albert

* Add DistilBert

* Add Albert + Electra + Funnel

* Add Longformer + Lxmert

* Add last models

* Apply style

* Update the template

* Remove unused imports

* Rename attribute

* Import embeddings in their own model file

* Replace word_embeddings per weight

* fix naming

* Fix Albert

* Fix Albert

* Fix Longformer

* Fix Lxmert Mobilebert and MPNet

* Fix copy

* Fix template

* Update the get weights function

* Update src/transformers/modeling_tf_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/electra/modeling_tf_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* address Sylvain's comments
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

14042d56

Fix label datatype in TF Trainer (#9616) · 12f0d7e8
Julien Plu authored Jan 20, 2021
```
* Fix label datatype

* Apply style
```
12f0d7e8
Add a community page to the docs (#9682) · 76f36e18
Sylvain Gugger authored Jan 20, 2021

76f36e18
Use datasets squad_v2 metric in run_qa (#9677) · 582f516a
Sylvain Gugger authored Jan 20, 2021

582f516a
make RepetitionPenaltyLogitsProcessor faster (#9600) · a98173cc
LSinev authored Jan 20, 2021

a98173cc
Restrain tokenizer.model_max_length default (#9681) · a1ad16a4
Sylvain Gugger authored Jan 20, 2021
```
* Restrain tokenizer.model_max_length default

* Fix indent
```
a1ad16a4

19 Jan, 2021 7 commits

Fix model templates and use less than 119 chars (#9684) · 7e662e6a
Sylvain Gugger authored Jan 19, 2021
```
* Fix model templates and use less than 119 chars

* Missing new line
```
7e662e6a

Add separated decoder_head_mask for T5 Models (#9634) · 2ebbbf55

Daniel Stancl authored Jan 19, 2021

* Add decoder_head_mask for PyTorch T5 model

* Add decoder_head_mask args into T5Model and T5ForConditionalGeneration

* Slightly change the order of input args to be in accordance
with the convention from BART-based models introduced within the PR #9569.

* Make style for modeling_t5.py

* Add decoder_head_mask for TF T5 models

* Separate head_mask and decoder_head_mask args in TF T5 models

* Slightly change the order of input args to follow convention
of BART-based models updated in PR #9569

* Update test_forward_signature tests/test_modeling_tf_common.py
w.r.t. the changed order of input args

* Add FutureWarnings for T5 and TFT5 models

* Add FutureWarnings for T5 and TFT5 models warning a user that
input argument `head_mask` was split into two arguments -
`head_mask` and `decoder_head_mask`

* Add default behaviour - `decoder_head_mask` is set to copy
`head_mask`

* Fix T5 modeling and FutureWarning

* Make proper usage of head_mask and decoder_head_mask
in cross_attention

* Fix conditions for raising FutureWarning

* Reformat FutureWarning in T5 modeling

* Refactor the warning message

2ebbbf55

New run_seq2seq script (#9605) · e4c06ed6

Sylvain Gugger authored Jan 19, 2021



* New run_seq2seq script

* Add tests

* Mark as slow

* Update examples/seq2seq/run_seq2seq.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/data/data_collator.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Update src/transformers/data/data_collator.py
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Address review comments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

e4c06ed6

Fix TF Flaubert and XLM (#9661) · fa876aee
Julien Plu authored Jan 19, 2021
```
* Fix Flaubert and XLM

* Fix Flaubert and XLM

* Apply style
```
fa876aee

Update integrations.py (#9652) · 11ec7490

max yue authored Jan 20, 2021

File "/share/apps/anaconda3/envs/my_env/lib/python3.7/site-packages/transformers/integrations.py", line 419, in __init__
self._SummaryWriter = SummaryWriter
UnboundLocalError: local variable 'SummaryWriter' referenced before assignment

11ec7490

Update `past_key_values` in GPT-2 (#9596) · b020a736

Yusuke Mori authored Jan 20, 2021



* Update past_key_values in gpt2 (#9391)

* Update generation_utils, and rename some items

* Update modeling_gpt2 to avoid an error in gradient_checkpointing

* Remove 'reorder_cache' from util and add variations to XLNet, TransfoXL, GPT-2

* Change the location of '_reorder_cache' in modeling files

* Add '_reorder_cache' in modeling_ctrl

* Fix a bug of my last commit in CTRL

* Add '_reorder_cache' to GPT2DoubleHeadsModel

* Manage 'use_cache' in config of test_modeling_gpt2

* Clean up the doc string

* Update src/transformers/models/gpt2/modeling_gpt2.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Fix the doc string (GPT-2, CTRL)

* improve gradient_checkpointing_behavior
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

b020a736

Fix old Seq2SeqTrainer (#9675) · 97b787fb
Sylvain Gugger authored Jan 19, 2021

97b787fb