Commits · e841b75decf5ec9c8829dc1a3c43426ffa9f6907 · chenpangpang / transformers

11 Sep, 2020 3 commits

Automate the lists in auto-xxx docs (#7061) · e841b75d

Sylvain Gugger authored Sep 11, 2020

* More readable dict

* More nlp -> datasets

* Revert "More nlp -> datasets"

This reverts commit 3cd1883d226c63c4a686fc1fed35f2cd586ebe45.

* Automate the lists in auto-xxx docs

* More readable dict

* Revert "More nlp -> datasets"

This reverts commit 3cd1883d226c63c4a686fc1fed35f2cd586ebe45.

* Automate the lists in auto-xxx docs

* nlp -> datasets

* Fix new key

e841b75d

Add dep on datasets (#7058) · 0054a48c
Sylvain Gugger authored Sep 11, 2020

0054a48c
clean naming (#7068) · 221d4c63
Patrick von Platen authored Sep 11, 2020

221d4c63

10 Sep, 2020 24 commits
- these tests require non-multigpu env (#7059) · 8fcbe486
  Stas Bekman authored Sep 10, 2020
```
* these tests require non-multigpu env

* cleanup

* clarify
```
  8fcbe486
- [wip/s2s] DistributedSortishSampler (#7056) · 77950c48
  Sam Shleifer authored Sep 10, 2020
  
  77950c48
- Fix CI with change of name of nlp (#7054) · 51448673
  Sylvain Gugger authored Sep 10, 2020
```
* nlp -> datasets

* More nlp -> datasets

* Woopsie

* More nlp -> datasets

* One last
```
  51448673
- [s2s] --eval_max_generate_length (#7018) · e9a2f772
  Sam Shleifer authored Sep 10, 2020
  
  e9a2f772
- [xlm tok] config dict: fix str into int to match definition (#7034) · df4594a9
  Stas Bekman authored Sep 10, 2020
  
  df4594a9
- [AutoTokenizer] Correct error message · d6c08b07
  Julien Chaumond authored Sep 10, 2020
  
  d6c08b07
- [BertGeneration, Docs] Fix another old name in docs (#7050) · db38f7ce
  Patrick von Platen authored Sep 10, 2020
```
* correct docs for bert generation

* upload
```
  db38f7ce
- correct docs for bert generation (#7048) · 3bd95b0f
  Patrick von Platen authored Sep 10, 2020
  
  3bd95b0f
- Create README.md · eb2feb5d
  Patrick von Platen authored Sep 10, 2020
  
  eb2feb5d
- fix to ensure that returned tensors after the tokenization is Long (#7039) · 66a5a6fd
  Ashwin Geet Dsa authored Sep 10, 2020
```
* fix to ensure that returned tensors after the tokenization is Long

* fix to ensure that returned tensors after the tokenization is Long
Co-authored-by: Ashwin Geet Dsa <adsa@grvingt-6.nancy.grid5000.fr>
```
  66a5a6fd
- Update README.md · 9ccdb1d5
  Patrick von Platen authored Sep 10, 2020
  
  9ccdb1d5
- Create README.md · 60698936
  Patrick von Platen authored Sep 10, 2020
  
  60698936
- Create README.md · e0c3bc8e
  Patrick von Platen authored Sep 10, 2020
  
  e0c3bc8e
- Create README.md · c356b987
  Patrick von Platen authored Sep 10, 2020
  
  c356b987
- Create README.md · 5afd3f61
  Patrick von Platen authored Sep 10, 2020
  
  5afd3f61
- Add TF Funnel Transformer (#7029) · 15a18904
  Sylvain Gugger authored Sep 10, 2020
```
* Add TF Funnel Transformer

* Proper dummy input

* Formatting

* Update src/transformers/modeling_tf_funnel.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* One review comment forgotten
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
```
  15a18904
- Add "Leveraging Pretrained Checkpoints for Generation" Seq2Seq models. (#6594) · 7fd1febf
  Patrick von Platen authored Sep 10, 2020
```
* add conversion script

* improve conversion script

* make style

* add tryout files

* fix

* update

* add causal bert

* better names

* add tokenizer file as well

* finish causal_bert

* fix small bugs

* improve generate

* change naming

* renaming

* renaming

* renaming

* remove leftover files

* clean files

* add fix tokenizer

* finalize

* correct slow test

* update docs

* small fixes

* fix link

* adapt check repo

* apply sams and sylvains recommendations

* fix import

* implement Lysandres recommendations

* fix logger warn
```
  7fd1febf
- Samell fixed in tf template (#7044) · d1691d90
  Sylvain Gugger authored Sep 10, 2020
  
  d1691d90
- Update README.md · 63e53945
  Patrick von Platen authored Sep 10, 2020
  
  63e53945
- Create README.md · 054db06b
  Patrick von Platen authored Sep 10, 2020
  
  054db06b
- Fix template (#7040) · b482ad47
  Lysandre Debut authored Sep 10, 2020
  
  b482ad47
- Albert pretrain datasets/ datacollator (#6168) · 762cba3b
  Yu Liu authored Sep 10, 2020
```
* add dataset for albert pretrain

* datacollator for albert pretrain

* naming, comprehension, file reading change

* data cleaning is no needed after this modification

* delete prints

* fix a bug

* file structure change

* add tests for albert datacollator

* remove random seed

* add back len and get item function

* sample file for testing and test code added

* format change for black

* more format change

* Style

* var assignment issue resolve

* add back wrongly deleted DataCollatorWithPadding in init file

* Style
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
```
  762cba3b
- Fix confusing warnings during TF2 import from PyTorch (#6623) · 49e9be06
  Johann C. Rocholl authored Sep 10, 2020
```
1. Swapped missing_keys and unexpected_keys.

2. Copy&paste error caused these warnings to say "from TF 2.0" when it's actually "from PyTorch".
```
  49e9be06
- add -y to bypass prompt for transformers-cli upload (#7035) · 4ee1053d
  Stas Bekman authored Sep 10, 2020
  
  4ee1053d
09 Sep, 2020 6 commits

Create README.md · 76818cc4
Patrick von Platen authored Sep 09, 2020

76818cc4

Batch encore plus and overflowing tokens fails when non existing overflowing... · 15478c12

Lysandre Debut authored Sep 09, 2020

Batch encore plus and overflowing tokens fails when non existing overflowing tokens for a sequence (#6677)

* Patch and test

* Fix tests

15478c12

replace torch.triu with onnx compatible code (#6929) · 9fd11bf1
Henry Dashwood authored Sep 09, 2020

9fd11bf1
[from_pretrained] Allow tokenizer_type ≠ model_type (#6995) · ed71c21d
Julien Chaumond authored Sep 09, 2020

ed71c21d

[generation] consistently add eos tokens (#6982) · 03e363f9

Stas Bekman authored Sep 09, 2020

Currently beam search returns inconsistent outputs - if hypos have different lengths we get eos, if they are the same - we don't.

This PR makes the output consistent.

Also why not also replace:

```
            if sent_lengths[i] < max_length:
                decoded[i, sent_lengths[i]] = eos_token_id
```
with:
```
            decoded[i, sent_lengths[i]] = eos_token_id
```
Shouldn't eos always be there? If the data gets truncated, the caller needs to user a larger `max_length`.

Please correct me if my logic is flawed.

03e363f9

adding TRANSFORMERS_VERBOSITY env var (#6961) · d0963486

Stas Bekman authored Sep 09, 2020

* introduce TRANSFORMERS_VERBOSITY env var + test + test helpers

* cleanup

* remove helper function

d0963486

08 Sep, 2020 7 commits

pegasus.rst: fix expected output (#7017) · f0fc0aea
Sam Shleifer authored Sep 08, 2020

f0fc0aea
[Longformer] Fix longformer documentation (#7016) · 120176ea
Patrick von Platen authored Sep 08, 2020
```
* fix longformer

* allow position ids to not be initialized
```
120176ea

Fixing FLOPS merge by checking if torch is available (#7013) · 5c4eb4b1

Lysandre Debut authored Sep 08, 2020



* Should check if `torch` is available

* fixed samples_count error, distributed_concat arguments

* style

* Import torch at beginning of file
Co-authored-by: TevenLeScao <teven.lescao@gmail.com>

5c4eb4b1

Floating-point operations logging in trainer (#6768) · 01d340ad

Teven authored Sep 08, 2020



* neFLOs calculation, logging, and reloading (#1)

* testing distributed consecutive batches

* fixed AttributeError from DataParallel

* removed verbosity

* rotate with use_mtime=True

* removed print

* fixed interaction with gradient accumulation

* indent formatting

* distributed neflo counting

* fixed typo

* fixed typo

* mean distributed losses

* exporting log history

* moved a few functions

* floating_point_ops clarification for transformers with parameter-reuse

* code quality

* double import

* made flo estimation more task-agnostic

* only logging flos if computed

* code quality

* unused import

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Sylvain review

* Update src/transformers/modeling_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* black
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

01d340ad

Funnel transformer (#6908) · d155b38d

Sylvain Gugger authored Sep 08, 2020



* Initial model

* Fix upsampling

* Add special cls token id and test

* Formatting

* Test and fist FunnelTokenizerFast

* Common tests

* Fix the check_repo script and document Funnel

* Doc fixes

* Add all models

* Write doc

* Fix test

* Initial model

* Fix upsampling

* Add special cls token id and test

* Formatting

* Test and fist FunnelTokenizerFast

* Common tests

* Fix the check_repo script and document Funnel

* Doc fixes

* Add all models

* Write doc

* Fix test

* Fix copyright

* Forgot some layers can be repeated

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/transformers/modeling_funnel.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address review comments

* Update src/transformers/modeling_funnel.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Address review comments

* Update src/transformers/modeling_funnel.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* Slow integration test

* Make small integration test

* Formatting

* Add checkpoint and separate classification head

* Formatting

* Expand list, fix link and add in pretrained models

* Styling

* Add the model in all summaries

* Typo fixes
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

d155b38d

fixed trainer tr_loss memory leak (#6999) · 25afb4ea

Stuart Mesham authored Sep 08, 2020

* fixed trainer tr_loss memory leak

* detached returned training loss from computation graph in the Trainer class' training_step() method

* Revert "fixed trainer tr_loss memory leak"

This reverts commit 47226e4e

25afb4ea

Fix typo (#6994) · 1b76936d
Manuel Romero authored Sep 08, 2020

1b76936d