Commits · d97d06d05f3349f81716268df244d45b037518ef · chenpangpang / transformers

28 Dec, 2020 2 commits

Fix TF T5 (#9301) · d97d06d0
Julien Plu authored Dec 28, 2020
```
* Fix T5

* Fix test

* Fix test
```
d97d06d0

[Seq2Seq Templates] Correct some TF-serving errors and add gradient... · 83fdd252

Patrick von Platen authored Dec 28, 2020

[Seq2Seq Templates] Correct some TF-serving errors and add gradient checkpointing to PT by default. (#9334)

* correct tests

* correct shape and get_tf_activation

* more correction tf

* add gradient checkpointing to templates

* correct typo

83fdd252

27 Dec, 2020 1 commit
- push (#9320) · 8e74eca7
  Patrick von Platen authored Dec 27, 2020
  
  8e74eca7
25 Dec, 2020 2 commits

[GPT2] Correct gradient checkpointing (#9308) · 61443cd7

Patrick von Platen authored Dec 25, 2020

* correct gpt2

* fix gpt2

* fix use_cache ordering

* correct past tolerance

* fix for all cases

* style

61443cd7

add translation example (#9303) · 21fc6766

Vasudev Gupta authored Dec 25, 2020



* Created using Colaboratory

* mbart-training examples add

* link add

* Update description
Co-authored-by: Suraj Patil <surajp815@gmail.com>

21fc6766

24 Dec, 2020 8 commits

[Bart doc] Fix outdated statement (#9299) · 52b3a05e
Patrick von Platen authored Dec 24, 2020
```
* fix bart doc

* fix docs
```
52b3a05e
Update tokenization_utils_base.py (#9293) · 7777db15
Bram Vanroy authored Dec 24, 2020
```
Missing "s" typo
```
7777db15

fix typo in modeling_encoder_decoder.py (#9297) · 71963a66

Daniele Sartiano authored Dec 24, 2020



* Update modeling_encoder_decoder.py

Fixed typo.

* typo
Co-authored-by: Suraj Patil <surajp815@gmail.com>

71963a66

Proposed Fix : [RagSequenceForGeneration] generate "without" input_ids (#9220) · f3a3b91d

Ratthachat (Jung) authored Dec 24, 2020

* Create modeling_tf_dpr.py

* Add TFDPR

* Add back TFPegasus, TFMarian, TFMBart, TFBlenderBot

last commit accidentally deleted these 4 lines, so I recover them back

* Add TFDPR

* Add TFDPR

* clean up some comments, add TF input-style doc string

* Add TFDPR

* Make return_dict=False as default

* Fix return_dict bug (in .from_pretrained)

* Add get_input_embeddings()

* Create test_modeling_tf_dpr.py

The current version is already passed all 27 tests!
Please see the test run at : 
https://colab.research.google.com/drive/1czS_m9zy5k-iSJbzA_DP1k1xAAC_sdkf?usp=sharing



* fix quality

* delete init weights

* run fix copies

* fix repo consis

* del config_class, load_tf_weights

They shoud be 'pytorch only'

* add config_class back

after removing it, test failed ... so totally only removing "use_tf_weights = None" on Lysandre suggestion

* newline after .. note::

* import tf, np (Necessary for ModelIntegrationTest)

* slow_test from_pretrained with from_pt=True

At the moment we don't have TF weights (since we don't have official official TF model)
Previously, I did not run slow test, so I missed this bug

* Add simple TFDPRModelIntegrationTest

Note that this is just a test that TF and Pytorch gives approx. the same output.
However, I could not test with the official DPR repo's output yet

* upload correct tf model

* remove position_ids as missing keys

* fix RagSeq generate with context_input_ids

fix RagSeq generate with context_input_ids

* apply style

* delete unused lines

* Add test_rag_sequence_generate_batch_from_context_input_ids

* Readability improved

* stylying

* Stylize

* typos

* add check_model_generate_from_context_input_ids

* make style

* Apply suggestions from code review

* make style2
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: patrickvonplaten <patrick@huggingface.co>

f3a3b91d

enable cache by default (#9296) · 2a18b709
Suraj Patil authored Dec 24, 2020

2a18b709
Fix typo in file_utils.py (#9289) · 6189ae99
Jungwhan authored Dec 24, 2020

6189ae99
allow integer device for BatchEncoding (#9271) · 222dbdb2
Jethro Kuan authored Dec 24, 2020
```
Fixes #9244
Co-authored-by: Jethro Kuan <jethro.kuan@bytedance.com>
```
222dbdb2

[Templates] Adapt Bert (#9284) · 6c091abe

Patrick von Platen authored Dec 24, 2020

* adapt templates

* adapt config

* add test as well

* fix output type

* fix cache false naming

* finish tests

* last fix

6c091abe

23 Dec, 2020 6 commits

Add caching mechanism to BERT, RoBERTa (#9183) · 88ef8893

Suraj Patil authored Dec 23, 2020

* add past_key_values

* add use_cache option

* make mask before cutting ids

* adjust position_ids according to past_key_values

* flatten past_key_values

* fix positional embeds

* fix _reorder_cache

* set use_cache to false when not decoder, fix attention mask init

* add test for caching

* add past_key_values for Roberta

* fix position embeds

* add caching test for roberta

* add doc

* make style

* doc, fix attention mask, test

* small fixes

* adress patrick's comments

* input_ids shouldn't start with pad token

* use_cache only when decoder

* make consistent with bert

* make copies consistent

* add use_cache to encoder

* add past_key_values to tapas attention

* apply suggestions from code review

* make coppies consistent

* add attn mask in tests

* remove copied from longformer

* apply suggestions from code review

* fix bart test

* nit

* simplify model outputs

* fix doc

* fix output ordering

88ef8893

Adapt to new name of `label_smoothing_factor` training arg (#9282) · a1cb6e98
Sylvain Gugger authored Dec 23, 2020

a1cb6e98

Minor documentation revisions from copyediting (#9266) · bcc87c63

Connor Brinton authored Dec 23, 2020

* typo: Revise "checkout" to "check out"

* typo: Change "seemlessly" to "seamlessly"

* typo: Close parentheses in "Using the tokenizer"

* typo: Add closing parenthesis to supported models aside

* docs: Treat ``position_ids`` as plural

Alternatively, the word "argument" could be added to make the subject singular.

* docs: Remove comma, making subordinate clause

* docs: Remove comma separating verb and direct object

* docs: Fix typo ("next" -> "text")

* docs: Reverse phrase order to simplify sentence

* docs: "quicktour" -> "quick tour"

* docs: "to throw" -> "from throwing"

* docs: Remove disruptive newline in padding/truncation section

* docs: "show exemplary" -> "show examples of"

* docs: "much harder as" -> "much harder than"

* docs: Fix typo "seach" -> "search"

* docs: Fix subject-verb disagreement in WordPiece description

* docs: Fix style in preprocessing.rst

bcc87c63

[Seq2Seq Templates] Fix check_repo.py templates file (#9277) · d5db6c37
Patrick von Platen authored Dec 23, 2020
```
* add enc dec pt model to check repo

* fix indent
```
d5db6c37

Fix param error (#9273) · 4bafc43b

Xu Song authored Dec 23, 2020

TypeError: forward() got an unexpected keyword argument 'token_type_ids'

4bafc43b

Fix gpt2 document (#9272) · 58e8a761
Xu Song authored Dec 23, 2020

58e8a761

22 Dec, 2020 10 commits

Model Templates for Seq2Seq (#9251) · cbe63949

Patrick von Platen authored Dec 22, 2020

* adapt cookie cutter

* fix copy past statement

* delete copy statements for now

* remove unused import from template

* make doc rst

* correct config docstring

* correct training

* correct inputs processing tf enc dec

* make style

* adapt templates

* clean tabs

* correct tensor -> Tensor naming

* correct indent

* correct templates

* fix the test

* break lines to avoid > 119

* Apply suggestions from code review

cbe63949

Revert renaming in finetune_trainer (#9262) · e6c1f1ca
Sylvain Gugger authored Dec 22, 2020

e6c1f1ca
Add speed metrics to all example scripts + template (#9260) · ab177588
Sylvain Gugger authored Dec 22, 2020

ab177588
[hf_api] Fix incorrect typing · 5b5f7dd0
Julien Chaumond authored Dec 22, 2020

5b5f7dd0

Fix TF BART for saved model creation (#9252) · 1558d191

Julien Plu authored Dec 22, 2020



* Fix TF BART for saved model creation

* Apply style

* Update src/transformers/models/bart/modeling_tf_bart.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/bart/modeling_tf_bart.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Rework the fix

* Fix condition

* Apply style

* Fix condition

* Fix shape_list

* Apply Patrick's solution

* Apply Patrick's solution

* Rebase

* make tests pass
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>

1558d191

Fix link to bertabs/README.md (#9255) · 37d6fb5d
Manuel Romero authored Dec 22, 2020

37d6fb5d
Fix link to old language modeling script (#9254) · 189c1b91
Manuel Romero authored Dec 22, 2020

189c1b91

Seq2seq trainer (#9241) · 490b39e6

Sylvain Gugger authored Dec 22, 2020



* Add label smoothing in Trainer

* Add options for scheduler and Adafactor in Trainer

* Put Seq2SeqTrainer in the main lib

* Apply suggestions from code review
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Address review comments and adapt scripts

* Documentation

* Move test not using script to tests folder
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

490b39e6

Fix script that check objects are documented (#9259) · 1fc71191
Sylvain Gugger authored Dec 22, 2020

1fc71191

[EncoderDecoder] Make tests more aggressive (#9256) · e9d77ccd

Patrick von Platen authored Dec 22, 2020

* add tests

* make style and fix bart bug

* fix bart past key value edge case

* correct tf bart test

* fix gpt2 tf

* fix t5 test

e9d77ccd

21 Dec, 2020 9 commits

Update the README of the text classification example (#9237) · ec07da65

Sylvain Gugger authored Dec 21, 2020



* Update the README of the text classification example

* Update examples/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Adapt comment from review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

ec07da65

Adding performer fine-tuning research exampke (#9239) · 4eef5889
Teven authored Dec 21, 2020
```
* added run_mlm_performer.py research example

* make styke

* make styke

* Added a README !
```
4eef5889
[MPNet] Add slow to fast tokenizer converter (#9233) · 9a12b969
Patrick von Platen authored Dec 21, 2020
```
* add converter

* delet unnecessary comments
```
9a12b969
add base model classes to bart subclassed models (#9230) · f4432b7e
Suraj Patil authored Dec 21, 2020
```
* add base model classes to  bart subclassed models

* add doc
```
f4432b7e
Fixed beam search generation for GPT2 and T5 (#9219) · 08abdabd
TobiasNorlund authored Dec 21, 2020

08abdabd
Fix TF template (#9234) · 161a6461
Julien Plu authored Dec 21, 2020

161a6461

Improve BERT-like models performance with better self attention (#9124) · 5a8a4eb1

Julien Plu authored Dec 21, 2020

* Improve BERT-like models attention layers

* Apply style

* Put back error raising instead of assert

* Update template

* Fix copies

* Apply raising valueerror in MPNet

* Restore the copy check for the Intermediate layer in Longformer

* Update longformer

5a8a4eb1

fix warning (#9231) · 6b034309
Patrick von Platen authored Dec 21, 2020

6b034309

[RAG] Add Ray implementation for distributed retrieval (#9197) · a4b21cdd

Amog Kamsetty authored Dec 21, 2020



* wip

* wip

* wip

* wip

* wip

* wip

* wip

* wip

* uncomment

* uncomment

* wip

* updates

* add docstring

* updates

* fix arg

* fixes

* add unit tests

* update readme

* update readme

* update finetune script

* update test

* add test

* add ray to test dependencies

* separate ray and ray tune

* formatting

* shutdown ray at end of test

* fix tests

* formatting

* formatting

* even more formatting

* address comments

* formatting

* add files

* Update examples/research_projects/rag/test_distributed_retriever.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* address comments

* addressing comments
Co-authored-by: Ubuntu <ubuntu@ip-172-31-21-208.us-west-2.compute.internal>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

a4b21cdd

20 Dec, 2020 1 commit
- better logging and help (#9203) · f38c4ad3
  Stas Bekman authored Dec 20, 2020
  
  f38c4ad3
19 Dec, 2020 1 commit

Added TF TransfoXL Sequence Classification (#9169) · e0e255be

sandip authored Dec 19, 2020

* TF Transfoxl seq classification

* Update test_modeling_tf_transfo_xl.py

Added num_labels to config level

* TF Transfoxl seq classification

* Update test_modeling_tf_transfo_xl.py

Added num_labels to config level

* code refactor

* code refactor

* code refator

e0e255be