Commits · e9a2f772bcce0d48cb9adac24a0e165675f178f0 · chenpangpang / transformers

"test/vscode:/vscode.git/clone" did not exist on "d26e40ba83848f3f8c9d9d753cb9c51075d1685c"

10 Sep, 2020 8 commits

[xlm tok] config dict: fix str into int to match definition (#7034) · df4594a9
Stas Bekman authored Sep 10, 2020

df4594a9
[AutoTokenizer] Correct error message · d6c08b07
Julien Chaumond authored Sep 10, 2020

d6c08b07

fix to ensure that returned tensors after the tokenization is Long (#7039) · 66a5a6fd

Ashwin Geet Dsa authored Sep 10, 2020



* fix to ensure that returned tensors after the tokenization is Long

* fix to ensure that returned tensors after the tokenization is Long
Co-authored-by: Ashwin Geet Dsa <adsa@grvingt-6.nancy.grid5000.fr>

66a5a6fd

Add TF Funnel Transformer (#7029) · 15a18904

Sylvain Gugger authored Sep 10, 2020



* Add TF Funnel Transformer

* Proper dummy input

* Formatting

* Update src/transformers/modeling_tf_funnel.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* One review comment forgotten
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

15a18904

Add "Leveraging Pretrained Checkpoints for Generation" Seq2Seq models. (#6594) · 7fd1febf

Patrick von Platen authored Sep 10, 2020

* add conversion script

* improve conversion script

* make style

* add tryout files

* fix

* update

* add causal bert

* better names

* add tokenizer file as well

* finish causal_bert

* fix small bugs

* improve generate

* change naming

* renaming

* renaming

* renaming

* remove leftover files

* clean files

* add fix tokenizer

* finalize

* correct slow test

* update docs

* small fixes

* fix link

* adapt check repo

* apply sams and sylvains recommendations

* fix import

* implement Lysandres recommendations

* fix logger warn

7fd1febf

Albert pretrain datasets/ datacollator (#6168) · 762cba3b

Yu Liu authored Sep 10, 2020



* add dataset for albert pretrain

* datacollator for albert pretrain

* naming, comprehension, file reading change

* data cleaning is no needed after this modification

* delete prints

* fix a bug

* file structure change

* add tests for albert datacollator

* remove random seed

* add back len and get item function

* sample file for testing and test code added

* format change for black

* more format change

* Style

* var assignment issue resolve

* add back wrongly deleted DataCollatorWithPadding in init file

* Style
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

762cba3b

Fix confusing warnings during TF2 import from PyTorch (#6623) · 49e9be06

Johann C. Rocholl authored Sep 10, 2020

1. Swapped missing_keys and unexpected_keys.

2. Copy&paste error caused these warnings to say "from TF 2.0" when it's actually "from PyTorch".

49e9be06

add -y to bypass prompt for transformers-cli upload (#7035) · 4ee1053d
Stas Bekman authored Sep 10, 2020

4ee1053d

09 Sep, 2020 5 commits

Batch encore plus and overflowing tokens fails when non existing overflowing... · 15478c12

Lysandre Debut authored Sep 09, 2020

Batch encore plus and overflowing tokens fails when non existing overflowing tokens for a sequence (#6677)

* Patch and test

* Fix tests

15478c12

replace torch.triu with onnx compatible code (#6929) · 9fd11bf1
Henry Dashwood authored Sep 09, 2020

9fd11bf1
[from_pretrained] Allow tokenizer_type ≠ model_type (#6995) · ed71c21d
Julien Chaumond authored Sep 09, 2020

ed71c21d

[generation] consistently add eos tokens (#6982) · 03e363f9

Stas Bekman authored Sep 09, 2020

Currently beam search returns inconsistent outputs - if hypos have different lengths we get eos, if they are the same - we don't.

This PR makes the output consistent.

Also why not also replace:

```
            if sent_lengths[i] < max_length:
                decoded[i, sent_lengths[i]] = eos_token_id
```
with:
```
            decoded[i, sent_lengths[i]] = eos_token_id
```
Shouldn't eos always be there? If the data gets truncated, the caller needs to user a larger `max_length`.

Please correct me if my logic is flawed.

03e363f9

adding TRANSFORMERS_VERBOSITY env var (#6961) · d0963486

Stas Bekman authored Sep 09, 2020

* introduce TRANSFORMERS_VERBOSITY env var + test + test helpers

* cleanup

* remove helper function

d0963486

08 Sep, 2020 6 commits

[Longformer] Fix longformer documentation (#7016) · 120176ea
Patrick von Platen authored Sep 08, 2020
```
* fix longformer

* allow position ids to not be initialized
```
120176ea

Fixing FLOPS merge by checking if torch is available (#7013) · 5c4eb4b1

Lysandre Debut authored Sep 08, 2020



* Should check if `torch` is available

* fixed samples_count error, distributed_concat arguments

* style

* Import torch at beginning of file
Co-authored-by: TevenLeScao <teven.lescao@gmail.com>

5c4eb4b1

Floating-point operations logging in trainer (#6768) · 01d340ad

Teven authored Sep 08, 2020



* neFLOs calculation, logging, and reloading (#1)

* testing distributed consecutive batches

* fixed AttributeError from DataParallel

* removed verbosity

* rotate with use_mtime=True

* removed print

* fixed interaction with gradient accumulation

* indent formatting

* distributed neflo counting

* fixed typo

* fixed typo

* mean distributed losses

* exporting log history

* moved a few functions

* floating_point_ops clarification for transformers with parameter-reuse

* code quality

* double import

* made flo estimation more task-agnostic

* only logging flos if computed

* code quality

* unused import

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Sylvain review

* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* black
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

01d340ad

Funnel transformer (#6908) · d155b38d

Sylvain Gugger authored Sep 08, 2020



* Initial model

* Fix upsampling

* Add special cls token id and test

* Formatting

* Test and fist FunnelTokenizerFast

* Common tests

* Fix the check_repo script and document Funnel

* Doc fixes

* Add all models

* Write doc

* Fix test

* Initial model

* Fix upsampling

* Add special cls token id and test

* Formatting

* Test and fist FunnelTokenizerFast

* Common tests

* Fix the check_repo script and document Funnel

* Doc fixes

* Add all models

* Write doc

* Fix test

* Fix copyright

* Forgot some layers can be repeated

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/modeling_funnel.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* Update src/transformers/modeling_funnel.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Address review comments

* Update src/transformers/modeling_funnel.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Slow integration test

* Make small integration test

* Formatting

* Add checkpoint and separate classification head

* Formatting

* Expand list, fix link and add in pretrained models

* Styling

* Add the model in all summaries

* Typo fixes
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

d155b38d

fixed trainer tr_loss memory leak (#6999) · 25afb4ea

Stuart Mesham authored Sep 08, 2020

* fixed trainer tr_loss memory leak

* detached returned training loss from computation graph in the Trainer class' training_step() method

* Revert "fixed trainer tr_loss memory leak"

This reverts commit 47226e4e

25afb4ea

typo (#7001) · c18f5916

Stas Bekman authored Sep 07, 2020

apologies for the tiny PRs, just sending those as I find them.

c18f5916

07 Sep, 2020 9 commits

Add missing arguments for BertWordPieceTokenizer (#5810) · 90ec78b5
Jangwon Park authored Sep 07, 2020

90ec78b5
Conversion scripts shouldn't have relative imports (#6991) · 77cd0e13
Lysandre Debut authored Sep 07, 2020

77cd0e13

[gen utils] missing else case (#6980) · 848fbe1e

Stas Bekman authored Sep 07, 2020

* [gen utils] missing else case

1. `else` is missing - I hit that case while porting a model. Probably needs to assert there?
2. also the comment on top seems to be outdated (just vocab_size is being set there)

* typo

848fbe1e

Fixed the default number of attention heads in Reformer Configuration (#6973) · f7e80721
tznurmin authored Sep 07, 2020

f7e80721

[docstring] missing arg (#6933) · acfaad74

Stas Bekman authored Sep 07, 2020



* [docstring] missing arg

add the missing `tie_word_embeddings` entry

* cleanup

* Update src/transformers/configuration_reformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

acfaad74

typo (#6959) · c3317e1f

Stas Bekman authored Sep 07, 2020

there is no var `decoder_input_ids`, but there is `input_ids` for decoder :)

c3317e1f

Cannot index `None` (#6984) · 9ef9c397
Lysandre Debut authored Sep 07, 2020

9ef9c397
Trainer with grad accum (#6930) · 08de989a
Sylvain Gugger authored Sep 07, 2020
```
* Add warning for gradient accumulation

* Formatting
```
08de989a

feat: allow prefix for any generative model (#5885) · 995a958d

Boris Dayma authored Sep 07, 2020



* feat: allow padding_text for any generative model

* docs(pipelines.py): correct typo

* Update src/transformers/pipelines.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* feat: rename padding_text to prefix

* fix: cannot tokenize empty text

* fix: pass prefix arg to pipeline

* test: add prefix to text-generetation pipeline

* style: fix style

* style: clean code and variable name more explicit

* set arg docstring to optional
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

995a958d

04 Sep, 2020 4 commits

[doc] remove the implied defaults to :obj:`None`, s/True/ :obj:`True/, etc. (#6956) · 48ff6d51
Stas Bekman authored Sep 04, 2020
```
* remove the implied defaults to :obj:`None`

* fix bug in the original

* replace to :obj:`True`, :obj:`False`
```
48ff6d51
typo (#6952) · eff274d6
Stas Bekman authored Sep 04, 2020

eff274d6

[docstring] misc arg doc corrections (#6932) · c5d43a87

Stas Bekman authored Sep 04, 2020

* correct bool types

fix docstring s/int/bool/

* fix description

* fix num_labels to match reality

c5d43a87

Fix mixed precision issue in TF DistilBert (#6915) · a75e3198

Yih-Dar authored Sep 04, 2020

* Remove hard-coded uses of float32 to fix mixed precision use in TF Distilbert

* fix style

* fix gelu dtype issue in TF Distilbert

* fix numeric overflow while using half precision

a75e3198

03 Sep, 2020 2 commits

move wandb/comet logger init to train() to allow parallel logging (#6850) · 0f360d3d
krfricke authored Sep 03, 2020
```
* move wandb/comet logger init to train() to allow parallel logging

* Setup wandb/comet loggers on first call to log()
```
0f360d3d

Adding the LXMERT pretraining model (MultiModal languageXvision) to... · ea2c6f1a

Antonio V Mendoza authored Sep 03, 2020


Adding the LXMERT pretraining model (MultiModal  languageXvision)  to HuggingFace's suite of models (#5793)

* added template files for LXMERT and competed the configuration_lxmert.py

* added modeling, tokization, testing, and finishing touched for lxmert [yet to be tested]

* added model card for lxmert

* cleaning up lxmert code

* Update src/transformers/modeling_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_tf_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/modeling_lxmert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* tested torch lxmert, changed documtention, updated outputs, and other small fixes

* Update src/transformers/convert_pytorch_checkpoint_to_tf2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/convert_pytorch_checkpoint_to_tf2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/convert_pytorch_checkpoint_to_tf2.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* renaming, other small issues, did not change TF code in this commit

* added lxmert question answering model in pytorch

* added capability to edit number of qa labels for lxmert

* made answer optional for lxmert question answering

* add option to return hidden_states for lxmert

* changed default qa labels for lxmert

* changed config archive path

* squshing 3 commits: merged UI + testing improvments + more UI and testing

* changed some variable names for lxmert

* TF LXMERT

* Various fixes to LXMERT

* Final touches to LXMERT

* AutoTokenizer order

* Add LXMERT to index.rst and README.md

* Merge commit test fixes + Style update

* TensorFlow 2.3.0 sequential model changes variable names

Remove inherited test

* Update src/transformers/modeling_tf_pytorch_utils.py

* Update docs/source/model_doc/lxmert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/model_doc/lxmert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_lxmert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* added suggestions

* Fixes

* Final fixes for TF model

* Fix docs
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ea2c6f1a

02 Sep, 2020 6 commits
- Output attention takes an s (#6903) · 8f2723ca
  Sylvain Gugger authored Sep 02, 2020
```
* Fix output_attention -> output_attentions

* Formatting

* One unsaved file
```
  8f2723ca
- fix error class instantiation (#6634) · 485da722
  Yohei Tamura authored Sep 02, 2020
  
  485da722
- [pipelines] Text2TextGenerationPipeline (#6744) · 4230d30f
  Suraj Patil authored Sep 02, 2020
```
* add Text2TextGenerationPipeline

* remove max length warning

* remove comments

* remove input_length

* fix typo

* add tests

* use TFAutoModelForSeq2SeqLM

* doc

* typo

* add the doc below TextGenerationPipeline

* doc nit

* style

* delete comment
```
  4230d30f
- fix typo in comments (#6838) · 6b242812
  Prajjwal Bhargava authored Sep 02, 2020
  
  6b242812
- fix warning for position ids (#6884) · 8abd7f69
  Patrick von Platen authored Sep 02, 2020
  
  8abd7f69
- Update modeling_bert.py (#6897) · 7cb0572c
  Parthe Pandit authored Sep 02, 2020
```
outptus -> outputs in example of BertForPreTraining
```
  7cb0572c