Commits · 077c99bb5fcd7e1f595985f16d0173c210924e11 · chenpangpang / transformers

22 Oct, 2020 16 commits

add zero shot pipeline tags & examples (#7983) · 077c99bb

Joe Davison authored Oct 22, 2020



* add zero shot pipeline tags

* rm default and fix yaml format

* rm DS_Store

* add bart large default

* don't add more typos
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* add multiple multilingual examples

* improve multilingual examples for single-label
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

077c99bb

Only log total_flos at the end of training (#7981) · 06fc3954
Sylvain Gugger authored Oct 22, 2020
```
* Only log total_flos at the end of training

* Fix test
```
06fc3954

FillMaskPipeline: support passing top_k on __call__ (#7971) · ff65beaf

Julien Chaumond authored Oct 22, 2020

* FillMaskPipeline: support passing top_k on __call__

Also move from topk to top_k

* migrate to new param name in tests

* Review from @sgugger

ff65beaf

New run glue script (#7917) · 2e5052d4

Sylvain Gugger authored Oct 22, 2020



* Start simplification

* More progress

* Finished script

* Address comments and update tests instructions

* Wrong test

* Accept files as inputs and fix test

* Update src/transformers/trainer_utils.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Fix labels and add combined score

* Add special labels

* Update TPU command

* Revert to old label strategy

* Use model labels

* Fix for STT-B

* Styling

* Apply suggestions from code review
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

* Code styling

* Fix review comments
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

2e5052d4

Fixing the "translation", "translation_XX_to_YY" pipelines. (#7975) · 18ce6b8f

Nicolas Patry authored Oct 22, 2020



* Actually make the "translation", "translation_XX_to_YY" task behave correctly.

Background:
- Currently "translation_cn_to_ar" does not work. (only 3 pairs are
supported)
- Some models, contain in their config the correct values for the (src,
tgt) pair they can translate. It's usually just one pair, and we can
infer it automatically from the `model.config.task_specific_params`. If
it's not defined we can still probably load the TranslationPipeline
nevertheless.

Proposed fix:
- A simplified version of what could become more general which is
a `parametrized` task. "translation" + (src, tgt) in this instance
it what we need in the general case. The way we go about it for now
is simply parsing "translation_XX_to_YY". If cases of parametrized task arise
we should preferably go in something closer to what `datasets` propose
which is having a secondary argument `task_options`? that will be close
to what that task requires.
- Should be backward compatible in all cases for instance
`pipeline(task="translation_en_to_de") should work out of the box.
- Should provide a warning when a specific translation pair has been
selected on behalf of the user using
`model.config.task_specific_params`.

* Update src/transformers/pipelines.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

18ce6b8f

Remove the else branch adding 0 to the hidden state if token_type_embeds is None. (#7977) · 901e9b8e
Funtowicz Morgan authored Oct 22, 2020
```
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
```
901e9b8e

[PretrainedConfig] Fix save pretrained config for edge case (#7943) · f34372a9

Patrick von Platen authored Oct 22, 2020



* fix config save

* add test

* add config class variable and another test

* line break

* fix fsmt and typo

* god am I making many errors today :-/

* Update src/transformers/configuration_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f34372a9

adding text classification with DistilBERT/tf notebook (#7964) · cc2e312c

Peter Bayerle authored Oct 22, 2020



Looking at the current community notebooks, it seems that few are targeted for absolute beginners and even fewer are written with TensorFlow. This notebook describes absolutely everything a beginner would need to know, including how to save/load their model and use it for new predictions (this is often omitted in tutorials)
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

cc2e312c

# Add whole word mask support for lm fine-tune (#7925) · a16e568f

wlhgtc authored Oct 22, 2020



* ADD: add whole word mask proxy for both eng and chinese

* MOD: adjust format

* MOD: reformat code

* MOD: update import

* MOD: fix bug

* MOD: add import

* MOD: fix bug

* MOD: decouple code and update readme

* MOD: reformat code

* Update examples/language-modeling/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/run_language_modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/run_language_modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/run_language_modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/run_language_modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* change wwm to whole_word_mask

* reformat code

* reformat

* format

* Code quality

* ADD: update chinese ref readme

* MOD: small changes

* MOD: small changes2

* update readme
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

a16e568f

[fsmt test] basic config test with online model + super tiny model (#7860) · 64b4d25c
Stas Bekman authored Oct 22, 2020
```
* basic config test with online model

* typo

* style

* better test
```
64b4d25c
Disable inference API for t5-11b (#7978) · 3479787e
Julien Chaumond authored Oct 22, 2020

3479787e
[model_card] t5-11b move disclaimer to top of page · a7db81c3
Julien Chaumond authored Oct 22, 2020
```
cc @Narsil @patrickvonplaten
```
a7db81c3
support relative path for best_model_checkpoint (#7973) · f774b2e8
Haebin Shin authored Oct 22, 2020

f774b2e8

[testing] slow tests should be marked as slow (#7895) · 83481056

Stas Bekman authored Oct 22, 2020



* slow tests should be slow

* exception note

* style

* integrate LysandreJik's notes with some expansions

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* another slow test

* fix link, and prose

* clarify.

* note from Sam

* typo
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

83481056

Herbert tokenizer auto load (#7968) · 95792a94
rmroczkowski authored Oct 22, 2020

95792a94

added qg evaluation notebook (#7958) · 4abb7ffc

zolekode authored Oct 22, 2020



* added qg evaluation notebook

* Update notebooks/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

4abb7ffc

21 Oct, 2020 24 commits

[seq2seq testing] multigpu test run via subprocess (#7281) · 8b381733
Stas Bekman authored Oct 21, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
8b381733
[model_cards] camembert: dataset = oscar · f8d3695e
Julien Chaumond authored Oct 21, 2020
```
Hat/tip @pjox
```
f8d3695e
fix 'encode_plus' docstring for 'special_tokens_mask' (0s and 1s were reversed) (#7949) · 16da8771
Evan Pete Walsh authored Oct 21, 2020
```
* fix docstring for 'special_tokens_mask'

* revert auto formatter changes

* revert another auto format

* revert another auto format
```
16da8771
fix test (#7947) · 52decab3
Patrick von Platen authored Oct 21, 2020

52decab3
[ProphetNet] Correct Doc string example (#7944) · 9b6610f7
Patrick von Platen authored Oct 21, 2020
```
* correct xlm prophetnet auto model and examples

* fix line-break docs
```
9b6610f7

TensorBoard/Wandb/optuna/raytune integration improvements. (#7935) · e174bfeb

François Lagunas authored Oct 21, 2020

Improved TensorBoard and Wandb integration, as well as optuna and ray/tune support, with minor modifications to trainer core code.

e174bfeb

Add AI-SOCO models (#7867) · bf162ce8
Ali Hamdi Ali Fadel authored Oct 21, 2020

bf162ce8

Create README.md (#7857) · 58fb25f2

Fangyu Liu authored Oct 21, 2020



* Create README.md

model card for cambridgeltl/BioRedditBERT-uncased.

* Update model_cards/cambridgeltl/BioRedditBERT-uncased/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

58fb25f2

Model card for German BERT fine-tuned for LER/NER (#7855) · 2b07ec78
Manuel Romero authored Oct 21, 2020

2b07ec78
Create README.md (#7819) · 35d2ad5b
MichalPleban authored Oct 21, 2020

35d2ad5b

Create README.md (#7625) · bdda4f22

Wuwei Lan authored Oct 21, 2020



* Create README.md

* Update model_cards/lanwuwei/GigaBERT-v3-Arabic-and-English/README.md

* Update model_cards/lanwuwei/GigaBERT-v3-Arabic-and-English/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

bdda4f22

Add missing comma (#7870) · 8e237496
Manuel Romero authored Oct 21, 2020

8e237496
Create README.md (#7899) · 3eaa007d
Manuel Romero authored Oct 21, 2020

3eaa007d

[model_cards] move hatmimoha/arabic-ner to correct location · 758572ca

Julien Chaumond authored Oct 21, 2020

see https://github.com/huggingface/transformers/commit/16d3cc187ded95946231956460e9004a236474e2 and https://github.com/huggingface/transformers/pull/7836

758572ca

[multiple models] skip saving/loading deterministic state_dict keys (#7878) · 57516c0c

Stas Bekman authored Oct 21, 2020

* make the save_load special key tests common

* handle mbart

* cleaner solution

* fix

* move test_save_load_missing_keys back into fstm for now

* restore

* style

* add marian

* add pegasus

* blenderbot

* revert - no static embed

57516c0c

update model cards of Illuin models (#7930) · 006a1648
quentinheinrich authored Oct 21, 2020

006a1648

model card for arabic-ner model (#7836) · 16d3cc18

hatmimoha authored Oct 21, 2020



* Create README.md

README file for the Arabic NER model

* Update README.md

* Update README.md

* Update hatmimoha/arabic-ner/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

16d3cc18

Add TFBartForConditionalGeneration (#5411) · 82984215

Sam Shleifer authored Oct 21, 2020



* half done

* doc improvement

* Cp test file

* brokedn

* broken test

* undo some mess

* ckpt

* borked

* Halfway

* 6 passing

* boom boom

* Much progress but still 6

* boom boom

* merged master

* 10 passing

* boom boom

* Style

* no t5 changes

* 13 passing

* Integration test failing, but not gibberish

* Frustrated

* Merged master

* 4 fail

* 4 fail

* fix return_dict

* boom boom

* Still only 4

* prepare method

* prepare method

* before delete classif

* Skip tests to avoid adding boilerplate

* boom boom

* fast tests passing

* style

* boom boom

* Switch to supporting many input types

* remove FIXMENORM

* working

* Fixed past_key_values/decoder_cached_states confusion

* new broken test

* Fix attention mask kwarg name

* undo accidental

* Style and reviewers

* style

* Docs and common tests

* Cleaner assert messages

* copy docs

* style issues

* Sphinx fix

* Simplify caching logic

* test does not require torch

* copy _NoLayerEmbedTokens

* Update src/transformers/modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update tests/test_modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Line length and dont document None

* Add pipeline test coverage

* assert msg

* At parity

* Assert messages

* mark slow

* Update compile test

* back in init

* Merge master

* Fix tests
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

82984215

Update README.md · 5cd9e2cb
Patrick von Platen authored Oct 21, 2020

5cd9e2cb
Create README.md · 220b5f97
Patrick von Platen authored Oct 21, 2020

220b5f97
Update README.md · 8ffd7fb1
Patrick von Platen authored Oct 21, 2020

8ffd7fb1
Update README.md · 613ab364
Patrick von Platen authored Oct 21, 2020

613ab364
Update README.md · f7eb17dc
Patrick von Platen authored Oct 21, 2020

f7eb17dc
[ProphetNet] Add Question Generation Model + Test (#7942) · 29792864
Patrick von Platen authored Oct 21, 2020
```
* new prophetnet model

* correct name

* make style
```
29792864