Commits · e174bfeb340d3d3468d9c8eebce95c42aa2dcf84 · chenpangpang / transformers

21 Oct, 2020 19 commits

TensorBoard/Wandb/optuna/raytune integration improvements. (#7935) · e174bfeb

François Lagunas authored Oct 21, 2020

Improved TensorBoard and Wandb integration, as well as optuna and ray/tune support, with minor modifications to trainer core code.

e174bfeb

Add AI-SOCO models (#7867) · bf162ce8
Ali Hamdi Ali Fadel authored Oct 21, 2020

bf162ce8

Create README.md (#7857) · 58fb25f2

Fangyu Liu authored Oct 21, 2020



* Create README.md

model card for cambridgeltl/BioRedditBERT-uncased.

* Update model_cards/cambridgeltl/BioRedditBERT-uncased/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

58fb25f2

Model card for German BERT fine-tuned for LER/NER (#7855) · 2b07ec78
Manuel Romero authored Oct 21, 2020

2b07ec78
Create README.md (#7819) · 35d2ad5b
MichalPleban authored Oct 21, 2020

35d2ad5b

Create README.md (#7625) · bdda4f22

Wuwei Lan authored Oct 21, 2020



* Create README.md

* Update model_cards/lanwuwei/GigaBERT-v3-Arabic-and-English/README.md

* Update model_cards/lanwuwei/GigaBERT-v3-Arabic-and-English/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

bdda4f22

Add missing comma (#7870) · 8e237496
Manuel Romero authored Oct 21, 2020

8e237496
Create README.md (#7899) · 3eaa007d
Manuel Romero authored Oct 21, 2020

3eaa007d

[model_cards] move hatmimoha/arabic-ner to correct location · 758572ca

Julien Chaumond authored Oct 21, 2020

see https://github.com/huggingface/transformers/commit/16d3cc187ded95946231956460e9004a236474e2 and https://github.com/huggingface/transformers/pull/7836

758572ca

[multiple models] skip saving/loading deterministic state_dict keys (#7878) · 57516c0c

Stas Bekman authored Oct 21, 2020

* make the save_load special key tests common

* handle mbart

* cleaner solution

* fix

* move test_save_load_missing_keys back into fstm for now

* restore

* style

* add marian

* add pegasus

* blenderbot

* revert - no static embed

57516c0c

update model cards of Illuin models (#7930) · 006a1648
quentinheinrich authored Oct 21, 2020

006a1648

model card for arabic-ner model (#7836) · 16d3cc18

hatmimoha authored Oct 21, 2020



* Create README.md

README file for the Arabic NER model

* Update README.md

* Update README.md

* Update hatmimoha/arabic-ner/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

16d3cc18

Add TFBartForConditionalGeneration (#5411) · 82984215

Sam Shleifer authored Oct 21, 2020



* half done

* doc improvement

* Cp test file

* brokedn

* broken test

* undo some mess

* ckpt

* borked

* Halfway

* 6 passing

* boom boom

* Much progress but still 6

* boom boom

* merged master

* 10 passing

* boom boom

* Style

* no t5 changes

* 13 passing

* Integration test failing, but not gibberish

* Frustrated

* Merged master

* 4 fail

* 4 fail

* fix return_dict

* boom boom

* Still only 4

* prepare method

* prepare method

* before delete classif

* Skip tests to avoid adding boilerplate

* boom boom

* fast tests passing

* style

* boom boom

* Switch to supporting many input types

* remove FIXMENORM

* working

* Fixed past_key_values/decoder_cached_states confusion

* new broken test

* Fix attention mask kwarg name

* undo accidental

* Style and reviewers

* style

* Docs and common tests

* Cleaner assert messages

* copy docs

* style issues

* Sphinx fix

* Simplify caching logic

* test does not require torch

* copy _NoLayerEmbedTokens

* Update src/transformers/modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update tests/test_modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_bart.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Line length and dont document None

* Add pipeline test coverage

* assert msg

* At parity

* Assert messages

* mark slow

* Update compile test

* back in init

* Merge master

* Fix tests
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

82984215

Update README.md · 5cd9e2cb
Patrick von Platen authored Oct 21, 2020

5cd9e2cb
Create README.md · 220b5f97
Patrick von Platen authored Oct 21, 2020

220b5f97
Update README.md · 8ffd7fb1
Patrick von Platen authored Oct 21, 2020

8ffd7fb1
Update README.md · 613ab364
Patrick von Platen authored Oct 21, 2020

613ab364
Update README.md · f7eb17dc
Patrick von Platen authored Oct 21, 2020

f7eb17dc
[ProphetNet] Add Question Generation Model + Test (#7942) · 29792864
Patrick von Platen authored Oct 21, 2020
```
* new prophetnet model

* correct name

* make style
```
29792864

20 Oct, 2020 14 commits

PPL guide minor code snippet fix (#7938) · 13842e41
Joe Davison authored Oct 20, 2020

13842e41
[s2s] create doc for pegasus/fsmt replication (#7934) · 0e24e4c1
Stas Bekman authored Oct 20, 2020

0e24e4c1
Respect the 119 line chars (#7928) · 96f4828a
Lysandre Debut authored Oct 20, 2020

96f4828a
Docs for v3.4.0 · ef0ac063
Lysandre authored Oct 20, 2020

ef0ac063
Release: v3.4.0 · eb0e0ce2
Lysandre authored Oct 20, 2020

eb0e0ce2
Update README.md · 02640486
Patrick von Platen authored Oct 20, 2020

02640486
add summary (#7927) · ffd675b4
Patrick von Platen authored Oct 20, 2020

ffd675b4

labels and decoder_input_ids to Glossary (#7906) · 5547b40b

Lysandre Debut authored Oct 20, 2020



* labels and decoder_input_ids to Glossary

* Formatting fixes

* Update docs/source/glossary.rst
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* sam's comments
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

5547b40b

Add note for WikiSplit · f3312515
Patrick von Platen authored Oct 20, 2020

f3312515
Fix EncoderDecoder WikiSplit Example · 0724c0f3
Patrick von Platen authored Oct 20, 2020

0724c0f3

[flax] fix repo_check (#7914) · ca37db05

Stas Bekman authored Oct 20, 2020

* [flax] fix repo_check

Unless, this is actually a problem, this adds `modeling_flax_utils` to ignore list. otherwise currently it expects to have a 'tests/test_modeling_flax_utils.py' for it.
for context please see: https://github.com/huggingface/transformers/pull/3722#issuecomment-712360415

* fix 2 more issues

* merge https://github.com/huggingface/transformers/pull/7919/

ca37db05

Fix bug in _sorted_checkpoints (#7880) · 048dd6cf

Shai Erera authored Oct 20, 2020

I'm using transformers 3.3.1 and run a training script with `--save_total_limit 3`. I hit the exception below, and after debugging the code found that it wrongly tries to index into the `best_model_checkpoint`'s *str* rather than the `sorted_checkpoints` array. When running without the fix I got this exception:

```
Traceback (most recent call last):
  File "/<HOME>/.conda/envs/transformers/lib/python3.7/site-packages/transformers/trainer.py", line 921, in _save_training
    self._rotate_checkpoints(use_mtime=True)
  File "/<HOME>/.conda/envs/transformers/lib/python3.7/site-packages/transformers/trainer.py", line 1283, in _rotate_checkpoints
    checkpoints_sorted = self._sorted_checkpoints(use_mtime=use_mtime)
  File "/<HOME>/.conda/envs/transformers/lib/python3.7/site-packages/transformers/trainer.py", line 1274, in _sorted_checkpoints
    checkpoints_sorted[best_model_index],
TypeError: 'str' object does not support item assignment
```

048dd6cf

Add Flax dummy objects (#7918) · 6d4f8bd0
Sylvain Gugger authored Oct 20, 2020

6d4f8bd0

[testing] rename skip targets + docs (#7863) · 3e31e7f9

Stas Bekman authored Oct 20, 2020



* rename skip targets + docs

* fix quotes

* style

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* small improvements

* fix
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

3e31e7f9

19 Oct, 2020 7 commits

[EncoderDecoder] Fix Typo (#7915) · c912ba5f
Patrick von Platen authored Oct 19, 2020
```
* fix encoder decoder models

* add .gitignore
```
c912ba5f
Raise error when using AMP on non-CUDA device (#7869) · 55bcd0cb
Bram Vanroy authored Oct 19, 2020
```
* Raise error when using AMP on non-CUDA device

* make style

* make style
```
55bcd0cb
fix t5 training docstring (#7911) · e3d2bee8
Patrick von Platen authored Oct 19, 2020

e3d2bee8

`decoder_config` used before intialisation (#7903) · df1ddced

Ayub Subhaniya authored Oct 19, 2020

Seeing error when sending `decoder_config` as a parameter while initializing a encoder-decoder model from pretrained. 
fixed "UnboundLocalError: local variable 'decoder_config' referenced before assignment"

df1ddced

Allow Custom Dataset in RAG Retriever (#7763) · 033f29c6

Quentin Lhoest authored Oct 19, 2020

* add CustomHFIndex

* typo in config

* update tests

* add custom dataset example

* clean script

* update test data

* minor in test

* docs

* docs

* style

* fix imports

* allow to pass the indexed dataset directly

* update tests

* use multiset DPR

* address thom and patrick's comments

* style

* update dpr tokenizer

* add output_dir flag in use_own_knowledge_dataset.py

* allow custom datasets in examples/rag/finetune.py

* add test for custom dataset in distributed rag retriever

033f29c6

Trainer with Iterable Dataset (#7858) · a09fe140

Julien Rossi authored Oct 19, 2020

* fix 5990

* accomodate iterable dataset without predefined length
* set it as 1 use case: provide max_steps, and NO num_epochs
* Is a merge of master and PR 5995

* fix trainer test under TF

* fix only for torch
* TF trainer untouched
* trainer tests are skipped when no torch

* address comments

* fix quality checks

* remove torch.dataset from test_trainer

* unnecessary inheritance
* RegressionDataset implements all needed methods __len__ and __getitem__

* fix quality checks

* restore RegressionDataset

* was wrongly under is_torch_available()

a09fe140

ProphetNet (#7157) · 2422cda0

Weizhen authored Oct 19, 2020



* add new model prophetnet

prophetnet modified

modify codes as suggested v1

add prophetnet test files

* still bugs, because of changed output formats of encoder and decoder

* move prophetnet into the latest version

* clean integration tests

* clean tokenizers

* add xlm config to init

* correct typo in init

* further refactoring

* continue refactor

* save parallel

* add decoder_attention_mask

* fix use_cache vs. past_key_values

* fix common tests

* change decoder output logits

* fix xlm tests

* make common tests pass

* change model architecture

* add tokenizer tests

* finalize model structure

* no weight mapping

* correct n-gram stream attention mask as discussed with qweizhen

* remove unused import

* fix index.rst

* fix tests

* delete unnecessary code

* add fast integration test

* rename weights

* final weight remapping

* save intermediate

* Descriptions for Prophetnet Config File

* finish all models

* finish new model outputs

* delete unnecessary files

* refactor encoder layer

* add dummy docs

* code quality

* fix tests

* add model pages to doctree

* further refactor

* more refactor, more tests

* finish code refactor and tests

* remove unnecessary files

* further clean up

* add docstring template

* finish tokenizer doc

* finish prophetnet

* fix copies

* fix typos

* fix tf tests

* fix fp16

* fix tf test 2nd try

* fix code quality

* add test for each model

* merge new tests to branch

* Update model_cards/microsoft/prophetnet-large-uncased-cnndm/README.md
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Update model_cards/microsoft/prophetnet-large-uncased-cnndm/README.md
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Update src/transformers/modeling_prophetnet.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Update utils/check_repo.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* apply sams and sylvains comments

* make style

* remove unnecessary code

* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update README.md
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/configuration_prophetnet.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* implement lysandres comments

* correct docs

* fix isort

* fix tokenizers

* fix copies
Co-authored-by: weizhen <weizhen@mail.ustc.edu.cn>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

2422cda0