Commits · 0512444ee5dea5fa409ebd9396e9b7edac00afe2 · chenpangpang / transformers

17 Nov, 2020 14 commits

Remove old doc · 0512444e
Sylvain Gugger authored Nov 17, 2020

0512444e

Add Harry Potter Model Card (#8605) · 5cf9c796

Caitlin Ostroff authored Nov 17, 2020



* Add Harry Potter Model

* Update model_cards/ceostroff/harry-potter-gpt2-fanfiction/README.md

* Update model_cards/ceostroff/harry-potter-gpt2-fanfiction/README.md

* Update model_cards/ceostroff/harry-potter-gpt2-fanfiction/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

5cf9c796

Remove deprecated (#8604) · dd52804f

Sylvain Gugger authored Nov 17, 2020



* Remove old deprecated arguments
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

* Remove needless imports

* Fix tests
Co-authored-by: LysandreJik <lysandre.debut@reseau.eseo.fr>

dd52804f

Tokenizers should be framework agnostic (#8599) · 3095ee9d

Lysandre Debut authored Nov 17, 2020



* Tokenizers should be framework agnostic

* Run the slow tests

* Not testing

* Fix documentation

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

3095ee9d

Fix check repo utils (#8600) · 7f3b41a3
Sylvain Gugger authored Nov 17, 2020

7f3b41a3
these should run fine on multi-gpu (#8582) · f0435f5a
Stas Bekman authored Nov 17, 2020

f0435f5a

Fix model templates (#8595) · 36a19915

Sylvain Gugger authored Nov 17, 2020

* First fixes

* Fix imports and add init

* Fix typo

* Move init to final dest

* Fix tokenization import

* More fixes

* Styling

36a19915

Tokenizers: ability to load from model subfolder (#8586) · 042a6aa7

Julien Chaumond authored Nov 17, 2020



* <small>tiny typo</small>

* Tokenizers: ability to load from model subfolder

* use subfolder for local files as well

* Uniformize model shortcut name => model id

* from s3 => from huggingface.co
Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>

042a6aa7

Fix init for MT5 (#8591) · 48395d6b
Sylvain Gugger authored Nov 17, 2020

48395d6b
Add __init__ to the models folder · a6cf9ca0
sgugger authored Nov 17, 2020

a6cf9ca0
[MT5] More docs (#8589) · 51042235
Patrick von Platen authored Nov 17, 2020
```
* add docs

* make style
```
51042235

T5 & mT5 (#8552) · 86822a35

Patrick von Platen authored Nov 17, 2020

* add mt5 and t5v1_1 model

* fix tests

* correct some imports

* add tf model

* finish tf t5

* improve examples

* fix copies

* clean doc

86822a35

model_card for indolem/indobert-base-uncased (#8579) · 9e01f988
fajri91 authored Nov 17, 2020

9e01f988

Reorganize repo (#8580) · c89bdfbe

Sylvain Gugger authored Nov 16, 2020

* Put models in subfolders

* Styling

* Fix imports in tests

* More fixes in test imports

* Sneaky hidden imports

* Fix imports in doc files

* More sneaky imports

* Finish fixing tests

* Fix examples

* Fix path for copies

* More fixes for examples

* Fix dummy files

* More fixes for example

* More model import fixes

* Is this why you're unhappy GitHub?

* Fix imports in conver command

c89bdfbe

16 Nov, 2020 10 commits

Fix mixed precision issue for GPT2 (#8572) · 90150733
Julien Plu authored Nov 16, 2020
```
* Fix mixed precision issue for GPT2

* Forgot one cast

* oops

* Forgotten casts
```
90150733

Switch `return_dict` to `True` by default. (#8530) · 1073a2bd

Sylvain Gugger authored Nov 16, 2020

* Use the CI to identify failing tests

* Remove from all examples and tests

* More default switch

* Fixes

* More test fixes

* More fixes

* Last fixes hopefully

* Use the CI to identify failing tests

* Remove from all examples and tests

* More default switch

* Fixes

* More test fixes

* More fixes

* Last fixes hopefully

* Run on the real suite

* Fix slow tests

1073a2bd

Update version to v4.0.0-dev (#8568) · 0d0a0785
Sylvain Gugger authored Nov 16, 2020

0d0a0785

Fix GPT2DoubleHeadsModel to work with model.generate() (#6601) · afb50c66

LSinev authored Nov 16, 2020

* Fix passing token_type_ids during GPT2DoubleHeadsModel.generate() if used

and for GPT2LMHeadModel too

* Update tests to check token_type_ids usage in GPT2 models

afb50c66

Adding the prepare_seq2seq_batch function to ProphetNet (#8515) · 04d8136b

Yusuke Mori authored Nov 16, 2020

* Simply insert T5Tokenizer's prepare_seq2seq_batch

* Update/Add some 'import'

* fix RunTimeError caused by '.view'

* Moves .view related error avoidance from seq2seq_trainer to inside prophetnet

* Update test_tokenization_prophetnet.py

* Format the test code with black

* Re-format the test code

* Update test_tokenization_prophetnet.py

* Add importing require_torch in the test code

* Add importing BatchEncoding in the test code

* Re-format the test code on Colab

04d8136b

[doc] typo fix (#8535) · 931b1097

Stas Bekman authored Nov 16, 2020



* [doc] typo fix

@sgugger

* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

931b1097

Clearer Model Versioning Example (#8562) · 6db21a06
Branden Chan authored Nov 16, 2020

6db21a06
Readme for Wiki Summary [Persian] bert2bert (#8558) · daaa6845
Mehrdad Farahani authored Nov 16, 2020

daaa6845
Readme for News Headline Generation (bert2bert) (#8557) · 06d468d3
Mehrdad Farahani authored Nov 16, 2020

06d468d3

Create README.md for Chinese RoBERTa Miniatures (#8550) · 9b7fb8a3

zhezhaoa authored Nov 16, 2020



* Create README.md

* Update model_cards/uer/chinese_roberta_L-2_H-128/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

9b7fb8a3

15 Nov, 2020 1 commit

[breaking|pipelines|tokenizers] Adding slow-fast tokenizers equivalence tests... · f4e04cd2

Thomas Wolf authored Nov 15, 2020


[breaking|pipelines|tokenizers] Adding slow-fast tokenizers equivalence tests pipelines - Removing sentencepiece as a required dependency (#8073)

* Fixing roberta for slow-fast tests

* WIP getting equivalence on pipelines

* slow-to-fast equivalence - working on question-answering pipeline

* optional FAISS tests

* Pipeline Q&A

* Move pipeline tests to their own test job again

* update tokenizer to add sequence id methods

* update to tokenizers 0.9.4

* set sentencepiecce as optional

* clean up squad

* clean up pipelines to use sequence_ids

* style/quality

* wording

* Switch to use_fast = True by default

* update tests for use_fast at True by default

* fix rag tokenizer test

* removing protobuf from required dependencies

* fix NER test for use_fast = True by default

* fixing example tests (Q&A examples use slow tokenizers for now)

* protobuf in main deps extras["sentencepiece"] and example deps

* fix protobug install test

* try to fix seq2seq by switching to slow tokenizers for now

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

f4e04cd2

13 Nov, 2020 11 commits

Rework some TF tests (#8492) · 24184e73

Julien Plu authored Nov 13, 2020

* Update some tests

* Small update

* Apply style

* Use max_position_embeddings

* Create a fake attribute

* Create a fake attribute

* Update wrong name

* Wrong TransfoXL model file

* Keep the common tests agnostic

24184e73

fix load weights (#8528) · f6cdafde
Patrick von Platen authored Nov 13, 2020
```
* fix load weights

* delete line
```
f6cdafde
Add bart-large-mnli model card (#8527) · f6f4da8d
Joe Davison authored Nov 13, 2020

f6f4da8d

Model sharing doc: more tweaks (#8520) · 72526974

Julien Chaumond authored Nov 13, 2020



* More doc tweaks

* Update model_sharing.rst

* make style

* missing newline

* Add email tip
Co-authored-by: Pierric Cistac <pierric@huggingface.co>

72526974

Fix paths in github YAML · 9d519dab
LysandreJik authored Nov 13, 2020

9d519dab

Model templates encoder only (#8509) · 826f0457

Lysandre Debut authored Nov 13, 2020



* Model templates

* TensorFlow

* Remove pooler

* CI

* Tokenizer + Refactoring

* Encoder-Decoder

* Let's go testing

* Encoder-Decoder in TF

* Let's go testing in TF

* Documentation

* README

* Fixes

* Better names

* Style

* Update docs

* Choose to skip either TF or PT

* Code quality fixes

* Add to testing suite

* Update file path

* Cookiecutter path

* Update `transformers` path

* Handle rebasing

* Remove seq2seq from model templates

* Remove s2s config

* Apply Sylvain and Patrick comments

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Last fixes from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

826f0457

[T5] Bug correction & Refactor (#8518) · 42e2d02e
Patrick von Platen authored Nov 13, 2020
```
* fix bug

* T5 refactor

* refactor tf

* apply sylvains suggestions
```
42e2d02e
Merge remote-tracking branch 'origin/master' · 42f63e38
Sylvain Gugger authored Nov 13, 2020

42f63e38
Update doc for v3.5.1 · bb03a14e
Sylvain Gugger authored Nov 13, 2020

bb03a14e
Update deepset/roberta-base-squad2 model card (#8522) · 4df6b593
Branden Chan authored Nov 13, 2020
```
* Update README.md

* Update README.md
```
4df6b593
Remove typo · 0c9bae09
Sylvain Gugger authored Nov 12, 2020

0c9bae09

12 Nov, 2020 4 commits
- Add pretraining loss computation for TF Bert pretraining (#8470) · 5d805394
  Julien Plu authored Nov 12, 2020
```
* Add pretraining loss computation for TF Bert pretraining

* Fix labels creation

* Fix T5 model

* restore T5 kwargs

* try a generic fix for pretraining models

* Apply style

* Overide the prepare method for the BERT tests
```
  5d805394
- Use LF instead of os.linesep (#8491) · 91a67b75
  Julien Plu authored Nov 12, 2020
  
  91a67b75
- Try to understand and apply Sylvain's comments (#8458) · 27b3ff31
  Julien Plu authored Nov 12, 2020
  
  27b3ff31
- fix SqueezeBertForMaskedLM (#8479) · 0fa03498
  Forrest Iandola authored Nov 12, 2020
  
  0fa03498