Commits · 3a08cc1ce7248f5badbdd3c85d679b2f5068485a · chenpangpang / transformers

29 Nov, 2020 2 commits

Minor docs typo fixes (#8797) · 3a08cc1c

Guy Rosin authored Nov 29, 2020



* Fix minor typos

* Additional typos

* Style fix
Co-authored-by: guyrosin <guyrosin@assist-561.cs.technion.ac.il>

3a08cc1c

[Pegasus] Refactor Tokenizer (#8731) · 5ced23dc

Patrick von Platen authored Nov 29, 2020

* refactor

* further refactor

* fix the rest tomorrow

* save intermediate

* finish slow tokenizer

* make more tests pass

* finish refactor

* fix comment

* clean further

* fix name

* fix naming

* Update src/transformers/models/reformer/tokenization_reformer.py

* Apply suggestions from code review

* Apply suggestions from code review

* refactor

* fix init tokenizers

* refactor

* improve convert

* refactor

* correct convert slow tokenizer

* final fix for Pegasus Tok

* remove ipdb

* improve links

5ced23dc

28 Nov, 2020 1 commit
- fix mt5 config (#8832) · 36b60ce9
  Patrick von Platen authored Nov 28, 2020
  
  36b60ce9
27 Nov, 2020 12 commits

Model parallel tests should return, not pass in non model parallel settings. (#8825) · 18c32eeb
Lysandre Debut authored Nov 27, 2020

18c32eeb
Temporarily deactivate model generation · edbff1fd
LysandreJik authored Nov 27, 2020

edbff1fd
suggest a numerical limit of 50MB for determining @slow (#8824) · 00ea4565
Stas Bekman authored Nov 27, 2020

00ea4565

BART & FSMT: fix decoder not returning hidden states from the last layer (#8597) · 0a921b64

Max Del authored Nov 27, 2020



* Fix decoder not returning hidden states from the last layer

* Resolve conflict

* Change the way to gather hidden states

* Add decoder hidden states test

* Make pytest and black happy

* Remove redundant line

* remove new line
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

0a921b64

Add barthez model (#8393) · 81fe0bf0

Moussa Kamal Eddine authored Nov 27, 2020



* Add init barthez

* Add barthez model, tokenizer and docs

BARThez is a pre-trained french seq2seq model that uses BART objective.

* Apply suggestions from code review docs typos
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add license

* Change URLs scheme

* Remove barthez model keep tokenizer

* Fix style

* Fix quality

* Update tokenizer

* Add fast tokenizer

* Add fast tokenizer test
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

81fe0bf0

Fix setup.py (#8798) · b0f2dbc5
Julien Plu authored Nov 27, 2020
```
enforce unix newline encoding regardless of OS creating the file
```
b0f2dbc5
Create README.md (#8729) · 03bddc37
Manuel Romero authored Nov 27, 2020
```
* Create README.md

* Fix model path
```
03bddc37

Extend typing to path-like objects in `PretrainedConfig` and `PreTrainedModel` (#8770) · f9a2a9e3

Giovanni Compagnoni authored Nov 27, 2020

* update configuration_utils.py typing to allow pathlike objects when sensible

* update modeling_utils.py typing to allow pathlike objects when sensible

* black

* update tokenization_utils_base.py typing to allow pathlike objects when sensible

* update tokenization_utils_fast.py typing to allow pathlike objects when sensible

* update configuration_auto.py typing to allow pathlike objects when sensible

* update configuration_auto.py docstring to allow pathlike objects when sensible

* update tokenization_auto.py docstring to allow pathlike objects when sensible

* black

f9a2a9e3

Fix dpr<>bart config for RAG (#8808) · a7d46a06

Patrick von Platen authored Nov 27, 2020

* correct dpr test and bert pos fault

* fix dpr bert config problem

* fix layoutlm

* add config to dpr as well

a7d46a06

[Flax test] Add require pytorch to flix flax test (#8816) · a2cf3759
Patrick von Platen authored Nov 27, 2020
```
* try flax fix

* same for roberta
```
a2cf3759

Update README.md (#8815) · e3ef62bc

mdermentzi authored Nov 27, 2020

The tokenizer called at the input_ids of example 2 is currently encoding text_1. I think this should be changed to text_2.

e3ef62bc

[FlaxBert] Fix non-broadcastable attention mask for batched forward-passes (#8791) · f8eda599

Kristian Holsheimer authored Nov 27, 2020

* [FlaxBert] Fix non-broadcastable attention mask for batched forward-passes

* [FlaxRoberta] Fix non-broadcastable attention mask

* Use jax.numpy instead of ordinary numpy (otherwise not jit-able)

* Partially revert "Use jax.numpy ..."

* Add tests for batched forward passes

* Avoid unnecessary OOMs due to preallocation of GPU memory by XLA

* Auto-fix style

* Re-enable GPU memory preallocation but with mem fraction < 1/paralleism

f8eda599

26 Nov, 2020 6 commits
- typo (#8810) · cb7602b3
  Stas Bekman authored Nov 26, 2020
  
  cb7602b3
- potpurri of small fixes (#8807) · ddf3c646
  Stas Bekman authored Nov 26, 2020
  
  ddf3c646
- Fix PPLM (#8779) · 52708d26
  chutaklee authored Nov 27, 2020
```
* Fix pplm

* fix style

* make style
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  52708d26
- Revert "finetune.py: specifying generation min_length (#8478)" (#8805) · 8f07f5c4
  Patrick von Platen authored Nov 26, 2020
```
This reverts commit 5aa361f3.
```
  8f07f5c4
- Create README.md (#8760) · 66e9608b
  Manuel Romero authored Nov 26, 2020
  
  66e9608b
- finetune.py: specifying generation min_length (#8478) · 5aa361f3
  Daniel Khashabi authored Nov 25, 2020
  
  5aa361f3
25 Nov, 2020 5 commits

Create README.md (#8752) · 30e7f7e5
joangines authored Nov 26, 2020

30e7f7e5

[XLNet] Fix mems behavior (#8567) · 2a6fbe6a

Patrick von Platen authored Nov 25, 2020

* fix mems in xlnet

* fix use_mems

* fix use_mem_len

* fix use mems

* clean docs

* fix tf typo

* make xlnet tf for generation work

* fix tf test

* refactor use cache

* add use cache for missing models

* correct use_cache in generate

* correct use cache in tf generate

* fix tf

* correct getattr typo

* make sylvain happy

* change in docs as well

* do not apply to cookie cutter statements

* fix tf test

* make pytorch model fully backward compatible

2a6fbe6a

Return correct Bart hidden state tensors (#8747) · 369f1d77

Joe Davison authored Nov 25, 2020



* bart output hidden states upstream

* same w/ decoder

* add tests

* fix prophetnet

* fix gpt2 and ctrl

* fix fstm and skip test for reformer and longformer

* fix all models
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

369f1d77

Fix QA argument handler (#8765) · 138f45c1

Lysandre Debut authored Nov 25, 2020



* Fix QA argument handler

* Attempt to get a better fix for QA (#8768)
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

138f45c1

Big model table (#8774) · 4821ea5a

Sylvain Gugger authored Nov 25, 2020



* First draft

* Styling

* With all changes staged

* Update docs/source/index.rst
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

* Styling
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

4821ea5a

24 Nov, 2020 10 commits

Create README.md (#8761) · 90d5ab3b
Manuel Romero authored Nov 24, 2020

90d5ab3b

New TF model inputs (#8602) · 29d49924

Julien Plu authored Nov 24, 2020

* Apply on BERT and ALBERT

* Update TF Bart

* Add input processing to TF BART

* Add input processing for TF CTRL

* Add input processing to TF Distilbert

* Add input processing to TF DPR

* Add input processing to TF Electra

* Add input processing for TF Flaubert

* Add deprecated arguments

* Add input processing to TF XLM

* remove unused imports

* Add input processing to TF Funnel

* Add input processing to TF GPT2

* Add input processing to TF Longformer

* Add input processing to TF Lxmert

* Apply style

* Add input processing to TF Mobilebert

* Add input processing to TF GPT

* Add input processing to TF Roberta

* Add input processing to TF T5

* Add input processing to TF TransfoXL

* Apply style

* Rebase on master

* Bug fix

* Retry to bugfix

* Retry bug fix

* Fix wrong model name

* Try another fix

* Fix BART

* Fix input precessing

* Apply style

* Put the deprecated warnings in the input processing function

* Remove the unused imports

* Raise an error when len(kwargs)>0

* test ModelOutput instead of TFBaseModelOutput

* Bug fix

* Address Patrick's comments

* Address Patrick's comments

* Address Sylvain's comments

* Add the new inputs in new Longformer models

* Update the template with the new input processing

* Remove useless assert

* Apply style

* Trigger CI

29d49924

[core] implement support for run-time dependency version checking (#8645) · 82d443a7

Stas Bekman authored Nov 24, 2020



* implement support for run-time dependency version checking

* try not escaping !

* use findall that works on py36

* small tweaks

* autoformatter worship

* simplify

* shorter names

* add support for non-versioned checks

* add deps

* revert

* tokenizers not required, check version only if installed

* make a proper distutils cmd and add make target

* tqdm must be checked before tokenizers

* workaround the DistributionNotFound peculiar setup

* handle the rest of packages in setup.py

* fully sync setup.py's install_requires - to check them all

* nit

* make install_requires more readable

* typo

* Update setup.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* restyle

* add types

* simplify

* simplify2
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

82d443a7

fix rag index names in eval_rag.py example (#8730) · a7d73cfd
Quentin Lhoest authored Nov 24, 2020

a7d73cfd

added instructions for syncing upstream master with forked master via PR (#8745) · 8d4ed7e9

Binoy Dalal authored Nov 24, 2020



* added instructions for syncing upstream master with forked master via PR

* expand to add a note to why this is requested
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

8d4ed7e9

MT5 should have an autotokenizer (#8743) · e09e54fd

Lysandre Debut authored Nov 24, 2020

* MT5 should have an autotokenizer

* Different configurations should be able to point to same tokenizers

e09e54fd

Fix slow tests v2 (#8746) · 6fdd0bb2

Lysandre Debut authored Nov 24, 2020

* Fix BART test

* Fix MBART tests

* Remove erroneous line from yaml

* Update tests/test_modeling_bart.py

* Quality

6fdd0bb2

Support various BERT relative position embeddings (2nd) (#8276) · 2c83b3c3

zhiheng-huang authored Nov 24, 2020



* Support BERT relative position embeddings

* Fix typo in README.md

* Address review comment

* Fix failing tests

* [tiny] Fix style_doc.py check by adding an empty line to configuration_bert.py

* make fix copies

* fix configs of electra and albert and fix longformer

* remove copy statement from longformer

* fix albert

* fix electra

* Add bert variants forward tests for various position embeddings

* [tiny] Fix style for test_modeling_bert.py

* improve docstring

* [tiny] improve docstring and remove unnecessary dependency

* [tiny] Remove unused import

* re-add to ALBERT

* make embeddings work for ALBERT

* add test for albert
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

2c83b3c3

[EsperBERTo] Fix URLs to assets · 9e71aa2f
Julien Chaumond authored Nov 24, 2020

9e71aa2f
Model parallel documentation (#8741) · 02f48b9b
Lysandre Debut authored Nov 23, 2020
```
* Add parallelize methods to the .rst files

* Correct format
```
02f48b9b

23 Nov, 2020 4 commits

TF BERT test update · 7f2c0091
LysandreJik authored Nov 23, 2020

7f2c0091
Update TF BERT test · e1b7e10d
LysandreJik authored Nov 23, 2020

e1b7e10d

Add early stopping callback to pytorch trainer (#8581) · 8ffc01a7

Colin Brochtrup authored Nov 23, 2020

* Add early stopping patience and minimum threshold metric must improve to prevent early stopping to pytorch trainer

* Add early stopping test

* Set patience counter to 0 if best metric not defined yet

* Make early stopping a callback. Add callback event for updating the best metric for early stopping callback to trigger on.

* Run make style

* make funciton name sensible

* Improve new argument docstring wording and hope that flakey CI test passes.

* Use on_evaluation callback instead of custom. Remove some debug printing

* Move early stopping arguments and state into early stopping callback

* Run make style

* Remove old code

* Fix docs formatting. make style went rogue on me.

* Remove copied attributes and fix variable

* Add assertions on training arguments instead of mutating them. Move comment out of public docs.

* Make separate test for early stopping callback. Add test of invalid arguments.

* Run make style... I remembered before CI this time!

* appease flake8

* Add EarlyStoppingCallback to callback docs

* Make docstring EarlyStoppingCallabck match other callbacks.

* Fix typo in docs

8ffc01a7

Fix max length in run_plm script (#8738) · 367f497d
Sylvain Gugger authored Nov 23, 2020

367f497d