Commits · e19b978151419fe0756ba852b145fccfc96dbeb4 · chenpangpang / transformers

22 May, 2020 15 commits
- Add Type Hints to modeling_utils.py Closes #3911 (#3948) · e19b9781
  Bijay Gurung authored May 23, 2020
```
* Add Type Hints to modeling_utils.py Closes #3911

Add Type Hints to methods in `modeling_utils.py`

Note: The coverage isn't 100%. Mostly skipped internal methods.

* Reformat according to `black` and `isort`

* Use typing.Iterable instead of Sequence

* Parameterize Iterable by its generic type

* Use typing.Optional when None is the default value

* Adhere to style guideline

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  e19b9781
- Warn the user about max_len being on the path to be deprecated. (#4528) · 996f393a
  Funtowicz Morgan authored May 22, 2020
```
* Warn the user about max_len being on the path to be deprecated.

* Ensure better backward compatibility when max_len is provided to a tokenizer.

* Make sure to override the parameter and not the actual instance value.

* Format & quality
```
  996f393a
- Better github link for Reformer Colab Notebook · 0f6969b7
  Patrick von Platen authored May 22, 2020
  
  0f6969b7
- [Summarization Pipeline]: Fix default tokenizer (#4506) · ab44630d
  Sam Shleifer authored May 22, 2020
```
* Fix pipelines defaults bug

* one liner

* style
```
  ab44630d
- Re-apply #4446 + add packaging dependency · 2c1ebb8b
  Julien Chaumond authored May 22, 2020
```
As discussed w/ @lysandrejik

packaging is maintained by PyPA (the Python Packaging Authority), and should be lightweight and stable
```
  2c1ebb8b
- Style · e6aeb0d3
  Lysandre authored May 22, 2020
  
  e6aeb0d3
- link to paper was broken (#4526) · 95a26fcf
  Alexander Measure authored May 22, 2020
```
changed from https://https://arxiv.org/abs/2001.04451.pdf to https://arxiv.org/abs/2001.04451.pdf
```
  95a26fcf
- Added huseinzol05/t5-small-bahasa-cased README.md (#4522) · 89d795f1
  HUSEIN ZOLKEPLI authored May 23, 2020
  
  89d795f1
- Fix convert_token_type_ids_from_sequences for fast tokenizers (#4503) · 35df9114
  Anthony MOI authored May 22, 2020
  
  35df9114
- [model_cards] bart-large-cnn · f7677e16
  Julien Chaumond authored May 22, 2020
```
cc @sshleifer
```
  f7677e16
- Add Reformer colab to community noteboos · 12e6afe9
  Patrick von Platen authored May 22, 2020
  
  12e6afe9
- Re-pin versions · ef22ba48
  Lysandre authored May 22, 2020
  
  ef22ba48
- Revert #4446 Since it introduces a new dependency · 10d72390
  Lysandre authored May 22, 2020
  
  10d72390
- Release: v2.10.0 · e0db6bbd
  Lysandre authored May 22, 2020
  
  e0db6bbd
- added functionality for electra classification head (#4257) · bd6e3018
  Frankie Liuzzi authored May 22, 2020
```
* added functionality for electra classification head

* unneeded dropout

* Test ELECTRA for sequence classification

* Style
Co-authored-by: Frankie <frankie@frase.io>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
```
  bd6e3018
21 May, 2020 4 commits

Unused Union should not be imported · a0865277
Lysandre authored May 21, 2020

a0865277

TPU hangs when saving optimizer/scheduler (#4467) · 9d2ce253

Lysandre Debut authored May 21, 2020

* TPU hangs when saving optimizer/scheduler

* Style

* ParallelLoader is not a DataLoader

* Style

* Addressing @julien-c's comments

9d2ce253

Adds predict stage for glue tasks, and generate result files which can be... · 49296533

Zhangyx authored May 21, 2020


Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463)

* Adds predict stage for glue tasks, and generate result files which could be submitted to gluebenchmark.com website.

* Use Split enum + always output the label name
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

49296533

[examples] fix no grad in second pruning in run_bertology (#4479) · 271bedb4

Tobias Lee authored May 21, 2020

* fix no grad in second pruning and typo

* fix prune heads attention mismatch problem

* fix

* fix

* fix

* run make style

* run make style

271bedb4

20 May, 2020 13 commits
- [ci] Close #4481 · 865d4d59
  Julien Chaumond authored May 20, 2020
  
  865d4d59
- Update test_trainer_distributed.py · a3af8e86
  Julien Chaumond authored May 20, 2020
  
  a3af8e86
- 🚨 Remove warning of deprecation (#4477) · eacea530
  Cola authored May 21, 2020
```
Remove warning of deprecated overload of addcdiv_

Fix #4451
```
  eacea530
- Better None gradients handling in TF Trainer (#4469) · fa2fbed3
  Julien Plu authored May 20, 2020
```
* Better None gradients handling

* Apply Style

* Apply Style
```
  fa2fbed3
- Correct TF formatting to exclude LayerNorms from weight decay (#4448) · e708bb75
  Oliver Åstrand authored May 20, 2020
```
* Exclude LayerNorms from weight decay

* Include both formats of layer norm
```
  e708bb75
- pass on tokenizer to pipeline (#4489) · 49c06132
  Rens authored May 20, 2020
  
  49c06132
- Add Fine-tune DialoGPT on new datasets notebook (#4473) · cacb654c
  Nathan Cooper authored May 20, 2020
  
  cacb654c
- Adjust german bert model card, add new model card (#4488) · 30a09f38
  Timo Moeller authored May 20, 2020
  
  30a09f38
- Fix slow gpu tests lysandre (#4487) · 14cb5b35
  Lysandre Debut authored May 20, 2020
```
* There is one missing key in BERT

* Correct device for CamemBERT model

* RoBERTa tokenization adding prefix space

* Style
```
  14cb5b35
- Create README.md (#4482) · 6dc52c78
  Manuel Romero authored May 20, 2020
  
  6dc52c78
- Model card for RuPERTa-base fine-tuned for NER (#4466) · ed5456da
  Manuel Romero authored May 20, 2020
  
  ed5456da
- Model card for Tereveni-AI/gpt2-124M-uk-fiction (#4470) · c76450e2
  Oleksandr Bushkovskyi authored May 20, 2020
```
Create model card for "Tereveni-AI/gpt2-124M-uk-fiction" model
```
  c76450e2
- add BERT trained from review corpus. (#4405) · 9907dc52
  Hu Xu authored May 20, 2020
```
* add model_cards for BERT trained on reviews.

* add link to repository.

* refine README.md for each review model
```
  9907dc52
19 May, 2020 8 commits

[MarianTokenizer] implement save_vocabulary and other common methods (#4389) · efbc1c5a
Sam Shleifer authored May 19, 2020

efbc1c5a
[gpu slow tests] fix mbart-large-enro gpu tests (#4472) · 956c4c4e
Sam Shleifer authored May 19, 2020

956c4c4e
[Longformer] Docs and clean API (#4464) · 48c3a70b
Patrick von Platen authored May 19, 2020
```
* add longformer docs

* improve docs
```
48c3a70b
[Tests, GPU, SLOW] fix a bunch of GPU hardcoded tests in Pytorch (#4468) · aa925a52
Patrick von Platen authored May 19, 2020
```
* fix gpu slow tests in pytorch

* change model to device syntax
```
aa925a52

add T5 fine-tuning notebook [Community notebooks] (#4462) · 5856999a

Suraj Patil authored May 19, 2020



* add T5 fine-tuning notebook [Community notebooks]

* Update README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5856999a

[cleanup] test_tokenization_common.py (#4390) · 07dd7c2f
Sam Shleifer authored May 19, 2020

07dd7c2f

Longformer (#4352) · 8f1d0471

Iz Beltagy authored May 19, 2020

* first commit

* bug fixes

* better examples

* undo padding

* remove wrong VOCAB_FILES_NAMES

* License

* make style

* make isort happy

* unit tests

* integration test

* make `black` happy by undoing `isort` changes!!

* lint

* no need for the padding value

* batch_size not bsz

* remove unused type casting

* seqlen not seq_len

* staticmethod

* `bert` selfattention instead of `n2`

* uint8 instead of bool + lints

* pad inputs_embeds using embeddings not a constant

* black

* unit test with padding

* fix unit tests

* remove redundant unit test

* upload model weights

* resolve todo

* simpler _mask_invalid_locations without lru_cache + backward compatible masked_fill_

* increase unittest coverage

8f1d0471

Refactored the README.md file (#4427) · 31eedff5
Girishkumar authored May 19, 2020

31eedff5