Commits · 26dc6593f314a7cf8fd8dd1dc752efb4eba7bc00 · chenpangpang / transformers

18 Nov, 2020 15 commits

Vishal Singh authored Nov 18, 2020

Modified Model in Action section. The class `AutoModelWithLMHead` is deprecated so changed it to `AutoModelForSeq2SeqLM` for encoder-decoder models. Removed duplicate eos token.

26dc6593

replace performance table with markdown (#8565) · 6c8fad4f

smanjil authored Nov 18, 2020



* replace performance table with markdown

* Update model_cards/smanjil/German-MedBERT/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

6c8fad4f

model_cards for Chinese Couplet and Poem GPT2 models (#8620) · e7f77fc5
hhou435 authored Nov 19, 2020

e7f77fc5
Fix training from scratch in new scripts (#8623) · a0c62d24
Sylvain Gugger authored Nov 18, 2020

a0c62d24
Fixes the training resuming with gradient accumulation (#8624) · 1e62e999
Sylvain Gugger authored Nov 18, 2020

1e62e999

[Tokenizer Doc] Improve tokenizer summary (#8622) · cdfa56af

Patrick von Platen authored Nov 18, 2020

* improve summary

* small fixes

* cleaned line length

* correct "" formatting

* apply sylvains suggestions

cdfa56af

Adding PrefixConstrainedLogitsProcessor (#8529) · 2f9d49b3

Nicola De Cao authored Nov 18, 2020



* Adding PrefixConstrainedLogitsProcessor

* fixing RAG and style_doc

* fixing black (v20 instead of v19)

* Improving doc in generation_logits_process.py

* Improving docs and typing in generation_utils.py

* docs improvement

* adding test and fixing doc typo

* fixing doc_len

* isort on test

* fixed test

* improve docstring a bit
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

2f9d49b3

New TF loading weights (#8490) · 3bc15400

Julien Plu authored Nov 18, 2020

* New TF loading weights

* apply style

* Better naming

* Largely comment the loading method

* Apply style

* Address Patrick's comments

* Remove useless line of code

* Update Docstring

* Address Sylvain's and Lysandre's comments

* Simplify the names computation

* Typos

3bc15400

self.self.activation_dropout -> self.activation_dropout (#8611) · 0df91ee4
Ratthachat (Jung) authored Nov 18, 2020
```
(one line typo)
```
0df91ee4
fix to adjust for #8530 changes (#8612) · cdf1b7ae
Stas Bekman authored Nov 18, 2020

cdf1b7ae
[s2s] broken test (#8613) · 2819da02
Stas Bekman authored Nov 18, 2020

2819da02
Fix missing space in multiline warning (#8593) · 9fa3ed1a
Michał Pogoda authored Nov 18, 2020
```
Multiline string informing about missing PyTorch/TensorFlow had missing space.
```
9fa3ed1a
Fix DataCollatorForLanguageModeling (#8621) · 8fcb6935
Sylvain Gugger authored Nov 18, 2020

8fcb6935

Reset loss to zero on logging in Trainer to avoid bfloat16 issues (#8561) · f6fe41c9

Benjamin Minixhofer authored Nov 18, 2020

* make tr_loss regular float

* Revert "make tr_loss regular float"

This reverts commit c9d7ccfaf0c4387187b0841694f01ec0ffd5f4ba.

* reset loss at each logging step

* keep track of total loss with _total_loss_scalar

* add remaining tr_loss at the end

f6fe41c9

Fixed link to the wrong paper. (#8607) · b592728e
cronoik authored Nov 18, 2020

b592728e

17 Nov, 2020 14 commits

Remove old doc · 0512444e
Sylvain Gugger authored Nov 17, 2020

0512444e

Add Harry Potter Model Card (#8605) · 5cf9c796

Caitlin Ostroff authored Nov 17, 2020



* Add Harry Potter Model

* Update model_cards/ceostroff/harry-potter-gpt2-fanfiction/README.md

* Update model_cards/ceostroff/harry-potter-gpt2-fanfiction/README.md

* Update model_cards/ceostroff/harry-potter-gpt2-fanfiction/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

5cf9c796

Remove deprecated (#8604) · dd52804f

Sylvain Gugger authored Nov 17, 2020



* Remove old deprecated arguments
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

* Remove needless imports

* Fix tests
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

dd52804f

Tokenizers should be framework agnostic (#8599) · 3095ee9d

Lysandre Debut authored Nov 17, 2020



* Tokenizers should be framework agnostic

* Run the slow tests

* Not testing

* Fix documentation

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

3095ee9d

Fix check repo utils (#8600) · 7f3b41a3
Sylvain Gugger authored Nov 17, 2020

7f3b41a3
these should run fine on multi-gpu (#8582) · f0435f5a
Stas Bekman authored Nov 17, 2020

f0435f5a

Fix model templates (#8595) · 36a19915

Sylvain Gugger authored Nov 17, 2020

* First fixes

* Fix imports and add init

* Fix typo

* Move init to final dest

* Fix tokenization import

* More fixes

* Styling

36a19915

Tokenizers: ability to load from model subfolder (#8586) · 042a6aa7

Julien Chaumond authored Nov 17, 2020



* <small>tiny typo</small>

* Tokenizers: ability to load from model subfolder

* use subfolder for local files as well

* Uniformize model shortcut name => model id

* from s3 => from huggingface.co
Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>

042a6aa7

Fix init for MT5 (#8591) · 48395d6b
Sylvain Gugger authored Nov 17, 2020

48395d6b
Add __init__ to the models folder · a6cf9ca0
sgugger authored Nov 17, 2020

a6cf9ca0
[MT5] More docs (#8589) · 51042235
Patrick von Platen authored Nov 17, 2020
```
* add docs

* make style
```
51042235

T5 & mT5 (#8552) · 86822a35

Patrick von Platen authored Nov 17, 2020

* add mt5 and t5v1_1 model

* fix tests

* correct some imports

* add tf model

* finish tf t5

* improve examples

* fix copies

* clean doc

86822a35

model_card for indolem/indobert-base-uncased (#8579) · 9e01f988
fajri91 authored Nov 17, 2020

9e01f988

Reorganize repo (#8580) · c89bdfbe

Sylvain Gugger authored Nov 16, 2020

* Put models in subfolders

* Styling

* Fix imports in tests

* More fixes in test imports

* Sneaky hidden imports

* Fix imports in doc files

* More sneaky imports

* Finish fixing tests

* Fix examples

* Fix path for copies

* More fixes for examples

* Fix dummy files

* More fixes for example

* More model import fixes

* Is this why you're unhappy GitHub?

* Fix imports in conver command

c89bdfbe

16 Nov, 2020 10 commits

Fix mixed precision issue for GPT2 (#8572) · 90150733
Julien Plu authored Nov 16, 2020
```
* Fix mixed precision issue for GPT2

* Forgot one cast

* oops

* Forgotten casts
```
90150733

Switch `return_dict` to `True` by default. (#8530) · 1073a2bd

Sylvain Gugger authored Nov 16, 2020

* Use the CI to identify failing tests

* Remove from all examples and tests

* More default switch

* Fixes

* More test fixes

* More fixes

* Last fixes hopefully

* Use the CI to identify failing tests

* Remove from all examples and tests

* More default switch

* Fixes

* More test fixes

* More fixes

* Last fixes hopefully

* Run on the real suite

* Fix slow tests

1073a2bd

Update version to v4.0.0-dev (#8568) · 0d0a0785
Sylvain Gugger authored Nov 16, 2020

0d0a0785

Fix GPT2DoubleHeadsModel to work with model.generate() (#6601) · afb50c66

LSinev authored Nov 16, 2020

* Fix passing token_type_ids during GPT2DoubleHeadsModel.generate() if used

and for GPT2LMHeadModel too

* Update tests to check token_type_ids usage in GPT2 models

afb50c66

Adding the prepare_seq2seq_batch function to ProphetNet (#8515) · 04d8136b

Yusuke Mori authored Nov 16, 2020

* Simply insert T5Tokenizer's prepare_seq2seq_batch

* Update/Add some 'import'

* fix RunTimeError caused by '.view'

* Moves .view related error avoidance from seq2seq_trainer to inside prophetnet

* Update test_tokenization_prophetnet.py

* Format the test code with black

* Re-format the test code

* Update test_tokenization_prophetnet.py

* Add importing require_torch in the test code

* Add importing BatchEncoding in the test code

* Re-format the test code on Colab

04d8136b

[doc] typo fix (#8535) · 931b1097

Stas Bekman authored Nov 16, 2020



* [doc] typo fix

@sgugger

* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

931b1097

Clearer Model Versioning Example (#8562) · 6db21a06
Branden Chan authored Nov 16, 2020

6db21a06
Readme for Wiki Summary [Persian] bert2bert (#8558) · daaa6845
Mehrdad Farahani authored Nov 16, 2020

daaa6845
Readme for News Headline Generation (bert2bert) (#8557) · 06d468d3
Mehrdad Farahani authored Nov 16, 2020

06d468d3

Create README.md for Chinese RoBERTa Miniatures (#8550) · 9b7fb8a3

zhezhaoa authored Nov 16, 2020



* Create README.md

* Update model_cards/uer/chinese_roberta_L-2_H-128/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

9b7fb8a3

15 Nov, 2020 1 commit

[breaking|pipelines|tokenizers] Adding slow-fast tokenizers equivalence tests... · f4e04cd2

Thomas Wolf authored Nov 15, 2020


[breaking|pipelines|tokenizers] Adding slow-fast tokenizers equivalence tests pipelines - Removing sentencepiece as a required dependency (#8073)

* Fixing roberta for slow-fast tests

* WIP getting equivalence on pipelines

* slow-to-fast equivalence - working on question-answering pipeline

* optional FAISS tests

* Pipeline Q&A

* Move pipeline tests to their own test job again

* update tokenizer to add sequence id methods

* update to tokenizers 0.9.4

* set sentencepiecce as optional

* clean up squad

* clean up pipelines to use sequence_ids

* style/quality

* wording

* Switch to use_fast = True by default

* update tests for use_fast at True by default

* fix rag tokenizer test

* removing protobuf from required dependencies

* fix NER test for use_fast = True by default

* fixing example tests (Q&A examples use slow tokenizers for now)

* protobuf in main deps extras["sentencepiece"] and example deps

* fix protobug install test

* try to fix seq2seq by switching to slow tokenizers for now

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

f4e04cd2