Commits · 04a17f8550c686e339dfd77ccfdbda9ee168b112 · chenpangpang / transformers

26 Oct, 2020 25 commits

Doc fixes in preparation for the docstyle PR (#8061) · 04a17f85

Sylvain Gugger authored Oct 26, 2020

* Fixes in preparation for doc styling

* More fixes

* Better syntax

* Fixes

* Style

* More fixes

* More fixes

04a17f85

[Model Card] new cross lingual sentence model for German and English (#8026) · 8bbb74f2

Philip May authored Oct 26, 2020

* mc for new cross lingual sentence model

* fat text

* url spelling fix

* more url spelling fixes

* slight thanks change

* small improvements in text

* multilingual word xchange

* change colab link

* xval fold number

* add model links

* line break in model names

* Update README.md

* Update README.md

* new examples link

* new examples link

* add evaluation dataset name

* add more about multi lingual

* typo fix

* typo

* typos

* hyperparameter typos

* hyperparameter typo

* add metadata

* add metadata

* Update README.md

* typo fix

* Small improvement

8bbb74f2

Fix TF training arguments instantiation (#8063) · 3a107645
Lysandre Debut authored Oct 26, 2020

3a107645
[TF] from_pt should respect authorized_unexpected_keys (#8056) · bc9332b5
Sam Shleifer authored Oct 26, 2020

bc9332b5
fixing crash (#8057) · 7ff7c493
Stas Bekman authored Oct 26, 2020

7ff7c493
Fix + Test (#8049) · cbad90d8
Lysandre Debut authored Oct 26, 2020

cbad90d8

[Seq2Seq Trainer] Make sure padding is implemented for models without pad_token (#8043) · 664c7ec4

Patrick von Platen authored Oct 26, 2020

* make sure padding is implemented for non-padding tokens models as well

* add better error message

* add better warning

* remove results files

* Update examples/seq2seq/seq2seq_trainer.py

* remove unnecessary copy line

* correct usage of labels

* delete test files

664c7ec4

Update README.md (#8050) · 098ddc22

mohammadreza-Banaei73 authored Oct 26, 2020

--wwm cant be used as an argument given run_language_modeling.py and should be changed to --whole_word_mask

098ddc22

add mutliclass field to default zero shot example · fbcddb85
Joe Davison authored Oct 26, 2020

fbcddb85
Minor error fix of 'bart-large-cnn' details in the pretrained_models doc (#8053) · a9ac1db2
Yusuke Mori authored Oct 27, 2020

a9ac1db2

Minor typo fixes to the preprocessing tutorial in the docs (#8046) · fc2d6eac

Samuel authored Oct 26, 2020



* Fix minor typos

Fix minor typos in the docs.

* Update docs/source/preprocessing.rst

Clearer data structure description.
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

fc2d6eac

minor model card description updates (#8051) · b0a90761
Joe Davison authored Oct 26, 2020

b0a90761

Mlflow integration callback (#8016) · c48b16b8

noise-field authored Oct 26, 2020

* Add MLflow integration class

Add integration code for MLflow in integrations.py along with the code
that checks that MLflow is installed.

* Add MLflowCallback import

Add import of MLflowCallback in trainer.py

* Handle model argument

Allow the callback to handle model argument and store model config items as hyperparameters.

* Log parameters to MLflow in batches

MLflow cannot log more than a hundred parameters at once.
Code added to split the parameters into batches of 100 items and log the batches one by one.

* Fix style

* Add docs on MLflow callback

* Fix issue with unfinished runs

The "fluent" api used in MLflow integration allows only one run to be active at any given moment. If the Trainer is disposed off and a new one is created, but the training is not finished, it will refuse to log the results when the next trainer is created.

* Add MLflow integration class

Add integration code for MLflow in integrations.py along with the code
that checks that MLflow is installed.

* Add MLflowCallback import

Add import of MLflowCallback in trainer.py

* Handle model argument

Allow the callback to handle model argument and store model config items as hyperparameters.

* Log parameters to MLflow in batches

MLflow cannot log more than a hundred parameters at once.
Code added to split the parameters into batches of 100 items and log the batches one by one.

* Fix style

* Add docs on MLflow callback

* Fix issue with unfinished runs

The "fluent" api used in MLflow integration allows only one run to be active at any given moment. If the Trainer is disposed off and a new one is created, but the training is not finished, it will refuse to log the results when the next trainer is created.

c48b16b8

Tiny TF Bart fixes (#8023) · 8be9cb0a
Lysandre Debut authored Oct 26, 2020

8be9cb0a
Fix label name in DataCollatorForNextSentencePrediction test (#8048) · 07747863
Sylvain Gugger authored Oct 26, 2020

07747863
Cleanup pytorch tests (#8033) · 8bbe8247
Sam Shleifer authored Oct 26, 2020

8bbe8247
update version for scipy (#7998) · 20a0894d
suliuzh authored Oct 26, 2020

20a0894d
fsmt slow test uses lists (#8031) · f20aec1d
Sam Shleifer authored Oct 26, 2020

f20aec1d
[docs] [testing] distributed training (#7993) · 101186bc
Stas Bekman authored Oct 26, 2020
```
* distributed training

* fix

* fix formatting

* wording
```
101186bc
Add mixed precision evaluation (#8036) · c153bcc5
luyug authored Oct 26, 2020
```
* Add mixed precision evaluation

* use original flag
```
c153bcc5
Minor typo fixes to the tokenizer summary (#8045) · 9aa28266
Samuel authored Oct 26, 2020
```
Minor typo fixes to the tokenizer summary
```
9aa28266
Remove codecov.yml · 829b9f8c
Lysandre authored Oct 26, 2020

829b9f8c
[tokenizers] Fixing #8001 - Adding tests on tokenizers serialization (#8006) · 79eb3915
Thomas Wolf authored Oct 26, 2020
```
* fixing #8001

* make T5 tokenizer serialization more robust - style
```
79eb3915
[model_cards] bert-base-danish Fixup · 7087d9b1
Julien Chaumond authored Oct 26, 2020
```
#8030
```
7087d9b1
Fixup #8025 · efc4a21f
Julien Chaumond authored Oct 26, 2020
```
Close #8030
```
efc4a21f

25 Oct, 2020 1 commit
- [Model Card] DJSammy/bert-base-danish-uncased_BotXO,ai (#8025) · 5148f433
  Sam Longenbach authored Oct 25, 2020
```
* Create README.md

* Update README.md
```
  5148f433
24 Oct, 2020 2 commits

[doc prepare_seq2seq_batch] fix docs (#8013) · 38f6739c
Suraj Patil authored Oct 25, 2020

38f6739c

Create model card for pre-trained NLI models. (#7864) · 00602f78

Yixin Nie authored Oct 24, 2020



* Create README.md

* Update model_cards/ynie/roberta-large-snli_mnli_fever_anli_R1_R2_R3-nli/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Add Meta information for dataset identifier.
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

00602f78

23 Oct, 2020 11 commits

[Examples] Allow EncoderDecoderModels to be trained with Seq2Seq (#7809) · 3c682ea1

Patrick von Platen authored Oct 23, 2020

* Make Seq2Seq Trainer more similar to Trainer

* fix typo

* fix seq2seq trainer

* remove from tests

* remove lock

* remove train files

* delete test files

* correct typo

* check at init

* make sure trainer is not slowed down on TPU

* correct isort

* remove use cache

* fix use cache

* add last use chache = false

3c682ea1

Create model card for bert-italian-cased-finetuned-pos (#8003) · 59b5953d

Sacha Arbonel authored Oct 23, 2020



* Create README.md

* Update model_cards/sachaarbonel/bert-italian-cased-finetuned-pos/README.md

* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

59b5953d

Add model cards for DynaBERT (#7999) · 6e07c1f4
Zhiqi Huang authored Oct 23, 2020

6e07c1f4
Create README.md (#7997) · 43fdafef
Zhiqi Huang authored Oct 23, 2020

43fdafef
Added model cards for Tagalog ELECTRA models (#7996) · 627e8137
Blaise Cruz authored Oct 23, 2020
```
Co-authored-by: Jan Christian Blaise Cruz <jcblaise@Blaises-MacBook-Pro.local>
```
627e8137

model card for German Sentence Embeddings V2 (#7952) · 9865e1fe

Philip May authored Oct 23, 2020

* model card German Sentence Embeddings V2

- for German RoBERTa for Sentence Embeddings V2
- marked old as outdated

* small correction

* small improvement in description

* small spelling fix

* spelling fix

* add evaluation results

* spearman explanation

* add number of trials

9865e1fe

Handling longformer model_type (#7990) · d39da5a2

Ethan Perez authored Oct 23, 2020

Updating the run_squad training script to handle the "longformer" `model_type`. The longformer is trained in the same was as RoBERTa, so I've added the "longformer" `model_type` (that's the right hugginface name for the LongFormer model, right?) everywhere there was a "roberta" `model_type` reference. The longformer (like RoBERTa) doesn't use `token_type_ids` (as I understand from looking at the [longformer notebook](https://github.com/patil-suraj/Notebooks/blob/master/longformer_qa_training.ipynb), which is what gets updated after this change.

This fix might be related to [this issue](https://github.com/huggingface/transformers/issues/7249) with SQuAD training when using run_squad.py

d39da5a2

Fix BatchEncoding.word_to_tokens for removed tokens (#7939) · 5e323017
Anthony MOI authored Oct 23, 2020

5e323017
[Reformer] remove reformer pad_token_id (#7991) · 4acfd1a8
Patrick von Platen authored Oct 23, 2020
```
* remove reformer pad_token_id

* fix pegasus
```
4acfd1a8

[tests|tokenizers] Refactoring pipelines test backbone - Small tokenizers... · 3a40cdf5

Thomas Wolf authored Oct 23, 2020


[tests|tokenizers] Refactoring pipelines test backbone - Small tokenizers improvements - General tests speedups (#7970)

* WIP refactoring pipeline tests - switching to fast tokenizers

* fix dialog pipeline and fill-mask

* refactoring pipeline tests backbone

* make large tests slow

* fix tests (tf Bart inactive for now)

* fix doc...

* clean up for merge

* fixing tests - remove bart from summarization until there is TF

* fix quality and RAG

* Add new translation pipeline tests - fix JAX tests

* only slow for dialog

* Fixing the missing TF-BART imports in modeling_tf_auto

* spin out pipeline tests in separate CI job

* adding pipeline test to CI YAML

* add slow pipeline tests

* speed up tf and pt join test to avoid redoing all the standalone pt and tf tests

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Update src/transformers/pipelines.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/pipelines.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/testing_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add require_torch and require_tf in is_pt_tf_cross_test
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

3a40cdf5

Handle the case when title is None (#7941) · 88b3a91e
Lalit Pagaria authored Oct 23, 2020

88b3a91e

22 Oct, 2020 1 commit
- [s2s trainer] tests to use distributed on multi-gpu machine (#7965) · 023f0f37
  Stas Bekman authored Oct 22, 2020
  
  023f0f37