Commits · 7fd1febf38bd01ad413abc56ed06700a9675c143 · chenpangpang / transformers

10 Sep, 2020 2 commits

Add "Leveraging Pretrained Checkpoints for Generation" Seq2Seq models. (#6594) · 7fd1febf

Patrick von Platen authored Sep 10, 2020

* add conversion script

* improve conversion script

* make style

* add tryout files

* fix

* update

* add causal bert

* better names

* add tokenizer file as well

* finish causal_bert

* fix small bugs

* improve generate

* change naming

* renaming

* renaming

* renaming

* remove leftover files

* clean files

* add fix tokenizer

* finalize

* correct slow test

* update docs

* small fixes

* fix link

* adapt check repo

* apply sams and sylvains recommendations

* fix import

* implement Lysandres recommendations

* fix logger warn

7fd1febf

Albert pretrain datasets/ datacollator (#6168) · 762cba3b

Yu Liu authored Sep 10, 2020



* add dataset for albert pretrain

* datacollator for albert pretrain

* naming, comprehension, file reading change

* data cleaning is no needed after this modification

* delete prints

* fix a bug

* file structure change

* add tests for albert datacollator

* remove random seed

* add back len and get item function

* sample file for testing and test code added

* format change for black

* more format change

* Style

* var assignment issue resolve

* add back wrongly deleted DataCollatorWithPadding in init file

* Style
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

762cba3b

09 Sep, 2020 3 commits
- Batch encore plus and overflowing tokens fails when non existing overflowing... · 15478c12
  Lysandre Debut authored Sep 09, 2020
```
Batch encore plus and overflowing tokens fails when non existing overflowing tokens for a sequence (#6677)

* Patch and test

* Fix tests
```
  15478c12
- [from_pretrained] Allow tokenizer_type ≠ model_type (#6995) · ed71c21d
  Julien Chaumond authored Sep 09, 2020
  
  ed71c21d
- adding TRANSFORMERS_VERBOSITY env var (#6961) · d0963486
  Stas Bekman authored Sep 09, 2020
```
* introduce TRANSFORMERS_VERBOSITY env var + test + test helpers

* cleanup

* remove helper function
```
  d0963486
08 Sep, 2020 1 commit

Funnel transformer (#6908) · d155b38d

Sylvain Gugger authored Sep 08, 2020



* Initial model

* Fix upsampling

* Add special cls token id and test

* Formatting

* Test and fist FunnelTokenizerFast

* Common tests

* Fix the check_repo script and document Funnel

* Doc fixes

* Add all models

* Write doc

* Fix test

* Initial model

* Fix upsampling

* Add special cls token id and test

* Formatting

* Test and fist FunnelTokenizerFast

* Common tests

* Fix the check_repo script and document Funnel

* Doc fixes

* Add all models

* Write doc

* Fix test

* Fix copyright

* Forgot some layers can be repeated

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/modeling_funnel.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* Update src/transformers/modeling_funnel.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Address review comments

* Update src/transformers/modeling_funnel.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Slow integration test

* Make small integration test

* Formatting

* Add checkpoint and separate classification head

* Formatting

* Expand list, fix link and add in pretrained models

* Styling

* Add the model in all summaries

* Typo fixes
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

d155b38d

07 Sep, 2020 1 commit

feat: allow prefix for any generative model (#5885) · 995a958d

Boris Dayma authored Sep 07, 2020



* feat: allow padding_text for any generative model

* docs(pipelines.py): correct typo

* Update src/transformers/pipelines.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* feat: rename padding_text to prefix

* fix: cannot tokenize empty text

* fix: pass prefix arg to pipeline

* test: add prefix to text-generetation pipeline

* style: fix style

* style: clean code and variable name more explicit

* set arg docstring to optional
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

995a958d

04 Sep, 2020 1 commit
- fix (#6946) · e3990d13
  Patrick von Platen authored Sep 04, 2020
  
  e3990d13
03 Sep, 2020 1 commit

Adding the LXMERT pretraining model (MultiModal languageXvision) to... · ea2c6f1a

Antonio V Mendoza authored Sep 03, 2020


Adding the LXMERT pretraining model (MultiModal  languageXvision)  to HuggingFace's suite of models (#5793)

* added template files for LXMERT and competed the configuration_lxmert.py

* added modeling, tokization, testing, and finishing touched for lxmert [yet to be tested]

* added model card for lxmert

* cleaning up lxmert code

* Update src/transformers/modeling_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* tested torch lxmert, changed documtention, updated outputs, and other small fixes

* Update src/transformers/convert_pytorch_checkpoint_to_tf2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/convert_pytorch_checkpoint_to_tf2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/convert_pytorch_checkpoint_to_tf2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* renaming, other small issues, did not change TF code in this commit

* added lxmert question answering model in pytorch

* added capability to edit number of qa labels for lxmert

* made answer optional for lxmert question answering

* add option to return hidden_states for lxmert

* changed default qa labels for lxmert

* changed config archive path

* squshing 3 commits: merged UI + testing improvments + more UI and testing

* changed some variable names for lxmert

* TF LXMERT

* Various fixes to LXMERT

* Final touches to LXMERT

* AutoTokenizer order

* Add LXMERT to index.rst and README.md

* Merge commit test fixes + Style update

* TensorFlow 2.3.0 sequential model changes variable names

Remove inherited test

* Update src/transformers/modeling_tf_pytorch_utils.py

* Update docs/source/model_doc/lxmert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/model_doc/lxmert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_lxmert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* added suggestions

* Fixes

* Final fixes for TF model

* Fix docs
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ea2c6f1a

02 Sep, 2020 3 commits

test_tf_common: remove un_used mixin class parameters (#6866) · 4ebb52af
Puneetha Pai authored Sep 02, 2020

4ebb52af

[testing] fix ambiguous test (#6898) · e71f32c0

Stas Bekman authored Sep 02, 2020

Since `generate()` does:
```
        num_beams = num_beams if num_beams is not None else self.config.num_beams
```
This test fails if `model.config.num_beams > 1` (which is the case in the model I'm porting).

This fix makes the test setup unambiguous by passing an explicit `num_beams=1` to `generate()`.

Thanks.

e71f32c0

[pipelines] Text2TextGenerationPipeline (#6744) · 4230d30f

Suraj Patil authored Sep 02, 2020

* add Text2TextGenerationPipeline

* remove max length warning

* remove comments

* remove input_length

* fix typo

* add tests

* use TFAutoModelForSeq2SeqLM

* doc

* typo

* add the doc below TextGenerationPipeline

* doc nit

* style

* delete comment

4230d30f

01 Sep, 2020 1 commit

[Generate] Facilitate PyTorch generate using `ModelOutputs` (#6735) · afc4ece4

Patrick von Platen authored Sep 01, 2020

* fix generate for GPT2 Double Head

* fix gpt2 double head model

* fix  bart / t5

* also add for no beam search

* fix no beam search

* fix encoder decoder

* simplify t5

* simplify t5

* fix t5 tests

* fix BART

* fix transfo-xl

* fix conflict

* integrating sylvains and sams comments

* fix tf past_decoder_key_values

* fix enc dec test

afc4ece4

31 Aug, 2020 2 commits

Fix marian slow test (#6854) · 8af1970e
Sam Shleifer authored Aug 31, 2020

8af1970e

Dataset and DataCollator for BERT Next Sentence Prediction (NSP) task (#6644) · 2de7ee03

Huang Lianzhe authored Aug 31, 2020



* add datacollator and dataset for next sentence prediction task

* bug fix (numbers of special tokens & truncate sequences)

* bug fix (+ dict inputs support for data collator)

* add padding for nsp data collator; renamed cached files to avoid conflict.

* add test for nsp data collator

* Style
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

2de7ee03

30 Aug, 2020 1 commit
- [tests] fix typos in inputs (#6818) · 563485bf
  Stas Bekman authored Aug 30, 2020
  
  563485bf
29 Aug, 2020 1 commit
- Pegasus finetune script: add --adafactor (#6811) · 0f58903b
  Sam Shleifer authored Aug 29, 2020
  
  0f58903b
28 Aug, 2020 5 commits

t5 model should make decoder_attention_mask (#6800) · 3cac867f
Sam Shleifer authored Aug 28, 2020

3cac867f
Fix style (#6803) · 20f77864
Sam Shleifer authored Aug 28, 2020

20f77864

prepare_seq2seq_batch makes labels/ decoder_input_ids made later. (#6654) · 9336086a

Sam Shleifer authored Aug 28, 2020

* broken test

* batch parity

* tests pass

* boom boom

* boom boom

* split out bart tokenizer tests

* fix tests

* boom boom

* Fixed dataset bug

* Fix marian

* Undo extra

* Get marian working

* Fix t5 tok tests

* Test passing

* Cleanup

* better assert msg

* require torch

* Fix mbart tests

* undo extra decoder_attn_mask change

* Fix import

* pegasus tokenizer can ignore src_lang kwargs

* unused kwarg test cov

* boom boom

* add todo for pegasus issue

* cover one word translation edge case

* Cleanup

* doc

9336086a

Transformer-XL: Improved tokenization with sacremoses (#6322) · cb276b41

RafaelWO authored Aug 28, 2020



* Improved tokenization with sacremoses

 * The TransfoXLTokenizer is now using sacremoses for tokenization
 * Added tokenization of comma-separated and floating point numbers.
 * Removed prepare_for_tokenization() from tokenization_transfo_xl.py because punctuation is handled by sacremoses
 * Added corresponding tests
 * Removed test comapring TransfoXLTokenizer and TransfoXLTokenizerFast
 * Added deprecation warning to TransfoXLTokenizerFast

* isort change
Co-authored-by: Teven <teven.lescao@gmail.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

cb276b41

[transformers-cli] fix logger getter (#6777) · 92ac2fa7
Stas Bekman authored Aug 27, 2020

92ac2fa7

27 Aug, 2020 4 commits

Format · 42fddacd
Lysandre authored Aug 27, 2020

42fddacd
[test schedulers] adjust to test the first step's reading (#6429) · dbfe34f2
Stas Bekman authored Aug 27, 2020
```
* [test schedulers] small improvement

* cleanup
```
dbfe34f2
[testing] replace hardcoded paths to allow running tests from anywhere (#6523) · e6b811f0
Stas Bekman authored Aug 27, 2020
```
* [testing] replace hardcoded paths to allow running tests from anywhere

* fix the merge conflict
```
e6b811f0

Add AdaFactor optimizer from fairseq (#6722) · 971d1802

Nikolai Yakovenko authored Aug 27, 2020



* AdaFactor optimizer ported from fairseq. Tested for T5 finetuning and MLM -- reduced memory consumption compared to ADAM.

* update PR fixes, add basic test

* bug -- incorrect params in test

* bugfix -- import Adafactor into test

* bugfix -- removed accidental T5 include

* resetting T5 to master

* bugfix -- include Adafactor in __init__

* longer loop for adafactor test

* remove double error class declare

* lint

* black

* isort

* Update src/transformers/optimization.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* single docstring

* Cleanup docstring
Co-authored-by: Nikolai Y <nikolai.yakovenko@point72.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

971d1802

26 Aug, 2020 4 commits

[model_cards] Fix tiny typos · 3242e4d9
Julien Chaumond authored Aug 26, 2020

3242e4d9

[TF Longformer] Improve Speed for TF Longformer (#6447) · 858b7d58

Patrick von Platen authored Aug 26, 2020

* add tf graph compile tests

* fix conflict

* remove more tf transpose statements

* fix conflicts

* fix comment typos

* move function to class function

* fix black

* fix black

* make style

858b7d58

Black 20 release · a75c64d8
Lysandre authored Aug 26, 2020

a75c64d8

Centralize logging (#6434) · 77abd1e7

Lysandre Debut authored Aug 26, 2020



* Logging

* Style

* hf_logging > utils.logging

* Address @thomwolf's comments

* Update test

* Update src/transformers/benchmark/benchmark_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Revert bad change
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

77abd1e7

25 Aug, 2020 3 commits
- T5Tokenizer adds EOS token if not already added (#5866) · 62449570
  Sam Shleifer authored Aug 25, 2020
```
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
```
  62449570
- Fix pegasus-xsum integration test (#6726) · e11d923b
  Sam Shleifer authored Aug 25, 2020
  
  e11d923b
- More tests to Trainer (#6699) · abc02021
  Sylvain Gugger authored Aug 25, 2020
```
* More tests to Trainer

* Add warning in the doc
```
  abc02021
24 Aug, 2020 1 commit
- Update repo to isort v5 (#6686) · a5737779
  Sylvain Gugger authored Aug 24, 2020
```
* Run new isort

* More changes

* Update CI, CONTRIBUTING and benchmarks
```
  a5737779
20 Aug, 2020 6 commits
- Regression test for pegasus bugfix (#6606) · 5bf4465e
  Sam Shleifer authored Aug 20, 2020
  
  5bf4465e
- One last threshold to raise · 86c07e63
  sgugger authored Aug 20, 2020
  
  86c07e63
- Move threshold up for flaky test with Electra (#6622) · e8af90c0
  Sylvain Gugger authored Aug 20, 2020
```
* Move threshold up for flaky test with Electra

* Update above as well
```
  e8af90c0
- [Tests] fix attention masks in Tests (#6621) · 505f2d74
  Patrick von Platen authored Aug 20, 2020
```
* fix distilbert

* fix typo
```
  505f2d74
- Add tests for Reformer tokenizer (#6485) · c9454507
  Denisa Roberts authored Aug 20, 2020
  
  c9454507
- Add tests to Trainer (#6605) · 573bdb0a
  Sylvain Gugger authored Aug 20, 2020
```
* Add tests to Trainer

* Test if removing long breaks everything

* Remove ugly hack

* Fix distributed test

* Use float for number of epochs
```
  573bdb0a