Commits · 82f7bbbd9388f2b4b399f5e5d34cfeeefb3cd69d · chenpangpang / transformers

10 Jul, 2020 15 commits

Update README.md (#5617) · 82f7bbbd
Bashar Talafha authored Jul 10, 2020
```
* Update README.md

* Update README.md
```
82f7bbbd
Create README.md (#5572) · bf497376
Manuel Romero authored Jul 10, 2020

bf497376
Create README.md for electra-base-squad2 (#5574) · 3653d01f
kolk authored Jul 10, 2020

3653d01f
Add freshly trained `base` version (#5621) · aa69c81f
Txus authored Jul 10, 2020

aa69c81f

Fixed use of memories in XLNet (caching for language generation + warning when... · 227e0a40

Teven authored Jul 10, 2020

Fixed use of memories in XLNet (caching for language generation + warning when loading improper memoryless model) (#5632)

* Pytorch gpu => cpu proper device

* Memoryless XLNet warning + fixed memories during generation

* Revert "Pytorch gpu => cpu proper device"

This reverts commit 93489b36

* made black happy

* TF generation with memories

* dim => axis

* added padding_text to TF XL models

* Added comment, added TF

227e0a40

Create README.md (#5638) · 3b7b6465
Manuel Romero authored Jul 10, 2020

3b7b6465
Create model card (#5655) · 0039b965
Manuel Romero authored Jul 10, 2020
```
Create model card for T5-small fine-tuned on SQUAD v2
```
0039b965
Create README.md - Model card (#5657) · 46982d61
Nils Reimers authored Jul 10, 2020
```
Model card for sentence-transformers/bert-base-nli-cls-token
```
46982d61
Create README.md - Model card (#5658) · c483803d
Nils Reimers authored Jul 10, 2020
```
Model card for sentence-transformers/bert-base-nli-max-tokens
```
c483803d

Change model outputs types to self-document outputs (#5438) · edfd82f5

Sylvain Gugger authored Jul 10, 2020

* [WIP] Proposal for model outputs

* All Bert models

* Make CI green maybe?

* Fix ONNX test

* Isolate ModelOutput from pt and tf

* Formatting

* Add Electra models

* Auto-generate docstrings from outputs

* Add TF outputs

* Add some BERT models

* Revert TF side

* Remove last traces of TF changes

* Fail with a clear error message

* Add Albert and work through Bart

* Add CTRL and DistilBert

* Formatting

* Progress on Bart

* Renames and finish Bart

* Formatting

* Fix last test

* Add DPR

* Finish Electra and add FlauBERT

* Add GPT2

* Add Longformer

* Add MMBT

* Add MobileBert

* Add GPT

* Formatting

* Add Reformer

* Add Roberta

* Add T5

* Add Transformer XL

* Fix test

* Add XLM + fix XLMForTokenClassification

* Style + XLMRoberta

* Add XLNet

* Formatting

* Add doc of return_tuple arg

edfd82f5

Create Model card for RoBERTa-hindi-guj-san (#5661) · fa265230
Suraj Parmar authored Jul 10, 2020

fa265230
Improvements to PretrainedConfig documentation (#5642) · b2747af5
Sylvain Gugger authored Jul 10, 2020
```
* Update PretrainedConfig doc

* Formatting

* Small fixes

* Forgotten args and more cleanup
```
b2747af5
[model_card] BART for ELI5 · bfacb2e3
Julien Chaumond authored Jul 10, 2020
```
cc @yjernite
```
bfacb2e3
Create README.md (#5652) · 2e6bb0e9
Nils Reimers authored Jul 10, 2020

2e6bb0e9
[model_card] Add meta + fix link to image · 552e4591
Julien Chaumond authored Jul 10, 2020
```
(hotlinking to image works on GitHub but not on external sites)

cc @bashartalafha
```
552e4591

09 Jul, 2020 10 commits

Fixed TextGenerationPipeline on torch + GPU (#5629) · 02a0b430

Teven authored Jul 09, 2020

* Pytorch gpu => cpu proper device

* Memoryless XLNet warning + fixed memories during generation

* Revert "Memoryless XLNet warning + fixed memories during generation"

This reverts commit 3d3251ff

* Took the operations on the generated_sequence out of the ensure_device scope

02a0b430

Add forum link in the docs (#5637) · 760f726e
Sylvain Gugger authored Jul 09, 2020

760f726e
fix 404 (#5616) · bfeaae22
Stas Bekman authored Jul 09, 2020

bfeaae22
Should check that torch TPU is available (#5636) · b25f7802
Lysandre Debut authored Jul 09, 2020

b25f7802
More explicit error when failing to tensorize overflowing tokens (#5633) · 3cc23eee
Lysandre Debut authored Jul 09, 2020

3cc23eee
Update stable doc · b9d8af07
Lysandre authored Jul 09, 2020

b9d8af07
Correct extension (#5631) · 1158e565
Lysandre Debut authored Jul 09, 2020

1158e565
Update stable doc · 5c82bf68
Lysandre authored Jul 09, 2020

5c82bf68

Test XLA examples (#5583) · 0533cf47

Lysandre Debut authored Jul 09, 2020

* Test XLA examples

* Style

* Using `require_torch_tpu`

* Style

* No need for pytest

0533cf47

QA pipeline BART compatible (#5496) · 3bd55199

Funtowicz Morgan authored Jul 09, 2020



* Ensure padding and question cannot have higher probs than context.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Add bart the the list of tokenizers adding two <sep> tokens for squad_convert_example_to_feature
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Format.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Addressing @patrickvonplaten comments.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Addressing @patrickvonplaten comments about masking non-context element when generating the answer.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Addressing @sshleifer comments.
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Make sure we mask CLS after handling impossible answers
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

* Mask in the correct vectors ...
Signed-off-by: Morgan Funtowicz <funtowiczmo@gmail.com>

3bd55199

08 Jul, 2020 9 commits

doc fixes (#5613) · fa5423b1
Stas Bekman authored Jul 08, 2020

fa5423b1

Add newly trained `calbert-tiny-uncased` (#5599) · 7d0ef004

Txus authored Jul 08, 2020



* Create README.md

Add newly trained `calbert-tiny-uncased` (complete rewrite with SentencePiece)

* Add Exbert link

* Apply suggestions from code review
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

7d0ef004

Fix Inconsistent NER Grouping (Pipeline) (#4987) · 0cc4eae0

Lorenzo Ampil authored Jul 09, 2020



* Add B I handling to grouping

* Add fix to include separate entity as last token

* move last_idx definition outside loop

* Use first entity in entity group as reference for entity type

* Add test cases

* Take out extra class accidentally added

* Return tf ner grouped test to original

* Take out redundant last entity

* Get last_idx safely
Co-authored-by: ColleterVi <36503688+ColleterVi@users.noreply.github.com>

* Fix first entity comment

* Create separate functions for group_sub_entities and group_entities (splitting call method to testable functions)

* Take out unnecessary last_idx

* Remove additional forward pass test

* Move token classification basic tests to separate class

* Move token classification basic tests back to monocolumninputtestcase

* Move base ner tests to nerpipelinetests

* Take out unused kwargs

* Add back mandatory_keys argument

* Add unitary tests for group_entities in _test_ner_pipeline

* Fix last entity handling

* Fix grouping fucntion used

* Add typing to group_sub_entities and group_entities
Co-authored-by: ColleterVi <36503688+ColleterVi@users.noreply.github.com>

0cc4eae0

create model cards for qg models (#5610) · 82ce8488
Suraj Patil authored Jul 09, 2020

82ce8488
Create README.md (#5601) · d6b6ab11
Bashar Talafha authored Jul 08, 2020

d6b6ab11
Update benchmark notebook (#5603) · 40d98ebf
Patrick von Platen authored Jul 08, 2020
```
* Créé avec Colaboratory

* delete old file
```
40d98ebf
Update question template (#5585) · 281e3948
Sylvain Gugger authored Jul 08, 2020

281e3948

[Benchmark] Add benchmarks for TF Training (#5594) · f82a2a5e

Patrick von Platen authored Jul 08, 2020

* tf_train

* adapt timing for tpu

* fix timing

* fix timing

* fix timing

* fix timing

* update notebook

* add tests

f82a2a5e

Add DeeBERT (entropy-based early exiting for *BERT) (#5477) · cfbb9829

Ji Xin authored Jul 07, 2020

* Add deebert code

* Add readme of deebert

* Add test for deebert

Update test for Deebert

* Update DeeBert (README, class names, function refactoring); remove requirements.txt

* Format update

* Update test

* Update readme and model init methods

cfbb9829

07 Jul, 2020 6 commits

Guide to fixed-length model perplexity evaluation (#5449) · b4b33fdf

Joe Davison authored Jul 07, 2020

* add first draft ppl guide

* upload imgs

* expand on strides

* ref typo

* rm superfluous past var

* add tokenization disclaimer

b4b33fdf

readme for benchmark (#5363) · fde217c6
Patrick von Platen authored Jul 07, 2020

fde217c6
mbart.prepare_translation_batch: pass through kwargs (#5581) · d6eab530
Sam Shleifer authored Jul 07, 2020

d6eab530

Add mbart-large-cc25, support translation finetuning (#5129) · 353b8f1e

Sam Shleifer authored Jul 07, 2020

improve unittests for finetuning, especially w.r.t testing frozen parameters
fix freeze_embeds for T5
add streamlit setup.cfg

353b8f1e

Create xlm-roberta-large-finetuned-conll03-german-README.md · 14149244
Julien Chaumond authored Jul 07, 2020
```
cc @BramVanroy
```
14149244

[Almost all TF models] TF clean up: add missing CLM / MLM loss; fix T5 naming... · 4dc65591

Patrick von Platen authored Jul 07, 2020


[Almost all TF models] TF clean up: add missing CLM / MLM loss; fix T5 naming and keras compile (#5395)

* add first version of clm tf

* make style

* add more tests for bert

* update tf clm loss

* fix tests

* correct tf ner script

* add mlm loss

* delete bogus file

* clean tf auto model + add tests

* finish adding clm loss everywhere

* fix training in distilbert

* fix flake8

* save intermediate

* fix tf t5 naming

* remove prints

* finish up

* up

* fix tf gpt2

* fix new test utils import

* fix flake8

* keep backward compatibility

* Update src/transformers/modeling_tf_albert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_roberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_mobilebert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_distilbert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply sylvains suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4dc65591