Commits · f20aec1de5c8bb3279db9848003498be71ebe9c6 · chenpangpang / transformers

26 Oct, 2020 8 commits
- fsmt slow test uses lists (#8031) · f20aec1d
  Sam Shleifer authored Oct 26, 2020
  
  f20aec1d
- [docs] [testing] distributed training (#7993) · 101186bc
  Stas Bekman authored Oct 26, 2020
```
* distributed training

* fix

* fix formatting

* wording
```
  101186bc
- Add mixed precision evaluation (#8036) · c153bcc5
  luyug authored Oct 26, 2020
```
* Add mixed precision evaluation

* use original flag
```
  c153bcc5
- Minor typo fixes to the tokenizer summary (#8045) · 9aa28266
  Samuel authored Oct 26, 2020
```
Minor typo fixes to the tokenizer summary
```
  9aa28266
- Remove codecov.yml · 829b9f8c
  Lysandre authored Oct 26, 2020
  
  829b9f8c
- [tokenizers] Fixing #8001 - Adding tests on tokenizers serialization (#8006) · 79eb3915
  Thomas Wolf authored Oct 26, 2020
```
* fixing #8001

* make T5 tokenizer serialization more robust - style
```
  79eb3915
- [model_cards] bert-base-danish Fixup · 7087d9b1
  Julien Chaumond authored Oct 26, 2020
```
#8030
```
  7087d9b1
- Fixup #8025 · efc4a21f
  Julien Chaumond authored Oct 26, 2020
```
Close #8030
```
  efc4a21f
25 Oct, 2020 1 commit
- [Model Card] DJSammy/bert-base-danish-uncased_BotXO,ai (#8025) · 5148f433
  Sam Longenbach authored Oct 25, 2020
```
* Create README.md

* Update README.md
```
  5148f433
24 Oct, 2020 2 commits

[doc prepare_seq2seq_batch] fix docs (#8013) · 38f6739c
Suraj Patil authored Oct 25, 2020

38f6739c

Create model card for pre-trained NLI models. (#7864) · 00602f78

Yixin Nie authored Oct 24, 2020



* Create README.md

* Update model_cards/ynie/roberta-large-snli_mnli_fever_anli_R1_R2_R3-nli/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Add Meta information for dataset identifier.
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

00602f78

23 Oct, 2020 11 commits

[Examples] Allow EncoderDecoderModels to be trained with Seq2Seq (#7809) · 3c682ea1

Patrick von Platen authored Oct 23, 2020

* Make Seq2Seq Trainer more similar to Trainer

* fix typo

* fix seq2seq trainer

* remove from tests

* remove lock

* remove train files

* delete test files

* correct typo

* check at init

* make sure trainer is not slowed down on TPU

* correct isort

* remove use cache

* fix use cache

* add last use chache = false

3c682ea1

Create model card for bert-italian-cased-finetuned-pos (#8003) · 59b5953d

Sacha Arbonel authored Oct 23, 2020



* Create README.md

* Update model_cards/sachaarbonel/bert-italian-cased-finetuned-pos/README.md

* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

59b5953d

Add model cards for DynaBERT (#7999) · 6e07c1f4
Zhiqi Huang authored Oct 23, 2020

6e07c1f4
Create README.md (#7997) · 43fdafef
Zhiqi Huang authored Oct 23, 2020

43fdafef
Added model cards for Tagalog ELECTRA models (#7996) · 627e8137
Blaise Cruz authored Oct 23, 2020
```
Co-authored-by: Jan Christian Blaise Cruz <jcblaise@Blaises-MacBook-Pro.local>
```
627e8137

model card for German Sentence Embeddings V2 (#7952) · 9865e1fe

Philip May authored Oct 23, 2020

* model card German Sentence Embeddings V2

- for German RoBERTa for Sentence Embeddings V2
- marked old as outdated

* small correction

* small improvement in description

* small spelling fix

* spelling fix

* add evaluation results

* spearman explanation

* add number of trials

9865e1fe

Handling longformer model_type (#7990) · d39da5a2

Ethan Perez authored Oct 23, 2020

Updating the run_squad training script to handle the "longformer" `model_type`. The longformer is trained in the same was as RoBERTa, so I've added the "longformer" `model_type` (that's the right hugginface name for the LongFormer model, right?) everywhere there was a "roberta" `model_type` reference. The longformer (like RoBERTa) doesn't use `token_type_ids` (as I understand from looking at the [longformer notebook](https://github.com/patil-suraj/Notebooks/blob/master/longformer_qa_training.ipynb), which is what gets updated after this change.

This fix might be related to [this issue](https://github.com/huggingface/transformers/issues/7249) with SQuAD training when using run_squad.py

d39da5a2

Fix BatchEncoding.word_to_tokens for removed tokens (#7939) · 5e323017
Anthony MOI authored Oct 23, 2020

5e323017
[Reformer] remove reformer pad_token_id (#7991) · 4acfd1a8
Patrick von Platen authored Oct 23, 2020
```
* remove reformer pad_token_id

* fix pegasus
```
4acfd1a8

[tests|tokenizers] Refactoring pipelines test backbone - Small tokenizers... · 3a40cdf5

Thomas Wolf authored Oct 23, 2020


[tests|tokenizers] Refactoring pipelines test backbone - Small tokenizers improvements - General tests speedups (#7970)

* WIP refactoring pipeline tests - switching to fast tokenizers

* fix dialog pipeline and fill-mask

* refactoring pipeline tests backbone

* make large tests slow

* fix tests (tf Bart inactive for now)

* fix doc...

* clean up for merge

* fixing tests - remove bart from summarization until there is TF

* fix quality and RAG

* Add new translation pipeline tests - fix JAX tests

* only slow for dialog

* Fixing the missing TF-BART imports in modeling_tf_auto

* spin out pipeline tests in separate CI job

* adding pipeline test to CI YAML

* add slow pipeline tests

* speed up tf and pt join test to avoid redoing all the standalone pt and tf tests

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Update src/transformers/pipelines.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/testing_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add require_torch and require_tf in is_pt_tf_cross_test
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

3a40cdf5

Handle the case when title is None (#7941) · 88b3a91e
Lalit Pagaria authored Oct 23, 2020

88b3a91e

22 Oct, 2020 18 commits

[s2s trainer] tests to use distributed on multi-gpu machine (#7965) · 023f0f37
Stas Bekman authored Oct 22, 2020

023f0f37
change zero shot widget default example (#7992) · 64b24bb3
Joe Davison authored Oct 22, 2020

64b24bb3
Move NoLayerEmbedTokens (#7945) · 0397619a
Sam Shleifer authored Oct 22, 2020
```
* Move NoLayerEmbedTokens

* TFWrappedEmbeddings

* Add comment
```
0397619a
[gh ci] less output ( --durations=50) (#7989) · 5ac07513
Sam Shleifer authored Oct 22, 2020

5ac07513
Reload checkpoint (#7984) · 5ae935d2
Sylvain Gugger authored Oct 22, 2020
```
* Fix checkpoint loading in Trainer

* Fix typo
```
5ae935d2
Fix documentation redirect · 467573dd
Lysandre authored Oct 22, 2020

467573dd

add zero shot pipeline tags & examples (#7983) · 077c99bb

Joe Davison authored Oct 22, 2020



* add zero shot pipeline tags

* rm default and fix yaml format

* rm DS_Store

* add bart large default

* don't add more typos
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* add multiple multilingual examples

* improve multilingual examples for single-label
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

077c99bb

Only log total_flos at the end of training (#7981) · 06fc3954
Sylvain Gugger authored Oct 22, 2020
```
* Only log total_flos at the end of training

* Fix test
```
06fc3954

FillMaskPipeline: support passing top_k on __call__ (#7971) · ff65beaf

Julien Chaumond authored Oct 22, 2020

* FillMaskPipeline: support passing top_k on __call__

Also move from topk to top_k

* migrate to new param name in tests

* Review from @sgugger

ff65beaf

New run glue script (#7917) · 2e5052d4

Sylvain Gugger authored Oct 22, 2020



* Start simplification

* More progress

* Finished script

* Address comments and update tests instructions

* Wrong test

* Accept files as inputs and fix test

* Update src/transformers/trainer_utils.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Fix labels and add combined score

* Add special labels

* Update TPU command

* Revert to old label strategy

* Use model labels

* Fix for STT-B

* Styling

* Apply suggestions from code review
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

* Code styling

* Fix review comments
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

2e5052d4

Fixing the "translation", "translation_XX_to_YY" pipelines. (#7975) · 18ce6b8f

Nicolas Patry authored Oct 22, 2020



* Actually make the "translation", "translation_XX_to_YY" task behave correctly.

Background:
- Currently "translation_cn_to_ar" does not work. (only 3 pairs are
supported)
- Some models, contain in their config the correct values for the (src,
tgt) pair they can translate. It's usually just one pair, and we can
infer it automatically from the `model.config.task_specific_params`. If
it's not defined we can still probably load the TranslationPipeline
nevertheless.

Proposed fix:
- A simplified version of what could become more general which is
a `parametrized` task. "translation" + (src, tgt) in this instance
it what we need in the general case. The way we go about it for now
is simply parsing "translation_XX_to_YY". If cases of parametrized task arise
we should preferably go in something closer to what `datasets` propose
which is having a secondary argument `task_options`? that will be close
to what that task requires.
- Should be backward compatible in all cases for instance
`pipeline(task="translation_en_to_de") should work out of the box.
- Should provide a warning when a specific translation pair has been
selected on behalf of the user using
`model.config.task_specific_params`.

* Update src/transformers/pipelines.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

18ce6b8f

Remove the else branch adding 0 to the hidden state if token_type_embeds is None. (#7977) · 901e9b8e
Funtowicz Morgan authored Oct 22, 2020
```
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>
```
901e9b8e

[PretrainedConfig] Fix save pretrained config for edge case (#7943) · f34372a9

Patrick von Platen authored Oct 22, 2020



* fix config save

* add test

* add config class variable and another test

* line break

* fix fsmt and typo

* god am I making many errors today :-/

* Update src/transformers/configuration_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

f34372a9

adding text classification with DistilBERT/tf notebook (#7964) · cc2e312c

Peter Bayerle authored Oct 22, 2020



Looking at the current community notebooks, it seems that few are targeted for absolute beginners and even fewer are written with TensorFlow. This notebook describes absolutely everything a beginner would need to know, including how to save/load their model and use it for new predictions (this is often omitted in tutorials)
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

cc2e312c

# Add whole word mask support for lm fine-tune (#7925) · a16e568f

wlhgtc authored Oct 22, 2020



* ADD: add whole word mask proxy for both eng and chinese

* MOD: adjust format

* MOD: reformat code

* MOD: update import

* MOD: fix bug

* MOD: add import

* MOD: fix bug

* MOD: decouple code and update readme

* MOD: reformat code

* Update examples/language-modeling/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/run_language_modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/run_language_modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/run_language_modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update examples/language-modeling/run_language_modeling.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* change wwm to whole_word_mask

* reformat code

* reformat

* format

* Code quality

* ADD: update chinese ref readme

* MOD: small changes

* MOD: small changes2

* update readme
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

a16e568f

[fsmt test] basic config test with online model + super tiny model (#7860) · 64b4d25c
Stas Bekman authored Oct 22, 2020
```
* basic config test with online model

* typo

* style

* better test
```
64b4d25c
Disable inference API for t5-11b (#7978) · 3479787e
Julien Chaumond authored Oct 22, 2020

3479787e
[model_card] t5-11b move disclaimer to top of page · a7db81c3
Julien Chaumond authored Oct 22, 2020
```
cc @Narsil @patrickvonplaten
```
a7db81c3