Commits · 3e3e552125e86824239e445dd3c659df0aea4db9 · chenpangpang / transformers

25 May, 2020 9 commits

[Reformer] fix reformer num buckets (#4564) · 3e3e5521
Patrick von Platen authored May 25, 2020
```
* fix reformer num buckets

* fix

* adapt docs

* set num buckets in config
```
3e3e5521
fixing tokenization of extra_id symbols in T5Tokenizer. Related to issue 4021 (#4353) · 3dea40b8
Elman Mansimov authored May 25, 2020

3dea40b8
LongformerTokenizerFast (#4547) · 51397336
Suraj Patil authored May 26, 2020

51397336
Updated the link to the paper (#4570) · c9c385c5
Oliver Guhr authored May 25, 2020
```
I looks like the conference has changed the link to the paper.
```
c9c385c5
Add nn.Module as superclass (#4533) · adab7f83
Sho Arora authored May 25, 2020

adab7f83
Create model card (#4578) · 8f7c1c76
Manuel Romero authored May 25, 2020

8f7c1c76
Update README.md (#4556) · 4c6b2180
Ali Safaya authored May 25, 2020

4c6b2180
add DistilBERT to supported models (#4558) · 50d1ce41
Antonis Maronikolakis authored May 25, 2020

50d1ce41

Longformer for question answering (#4500) · 03d8527d

Suraj Patil authored May 25, 2020

* added LongformerForQuestionAnswering

* add LongformerForQuestionAnswering

* fix import for LongformerForMaskedLM

* add LongformerForQuestionAnswering

* hardcoded sep_token_id

* compute attention_mask if not provided

* combine global_attention_mask with attention_mask when provided

* update example in  docstring

* add assert error messages, better attention combine

* add test for longformerForQuestionAnswering

* typo

* cast gloabl_attention_mask to long

* make style

* Update src/transformers/configuration_longformer.py

* Update src/transformers/configuration_longformer.py

* fix the code quality

* Merge branch 'longformer-for-question-answering' of https://github.com/patil-suraj/transformers

 into longformer-for-question-answering
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

03d8527d

23 May, 2020 1 commit
- DOC: Fix typos in modeling_auto (#4534) · a34a9896
  Bharat Raghunathan authored May 23, 2020
  
  a34a9896
22 May, 2020 15 commits
- Add Type Hints to modeling_utils.py Closes #3911 (#3948) · e19b9781
  Bijay Gurung authored May 23, 2020
```
* Add Type Hints to modeling_utils.py Closes #3911

Add Type Hints to methods in `modeling_utils.py`

Note: The coverage isn't 100%. Mostly skipped internal methods.

* Reformat according to `black` and `isort`

* Use typing.Iterable instead of Sequence

* Parameterize Iterable by its generic type

* Use typing.Optional when None is the default value

* Adhere to style guideline

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
```
  e19b9781
- Warn the user about max_len being on the path to be deprecated. (#4528) · 996f393a
  Funtowicz Morgan authored May 22, 2020
```
* Warn the user about max_len being on the path to be deprecated.

* Ensure better backward compatibility when max_len is provided to a tokenizer.

* Make sure to override the parameter and not the actual instance value.

* Format & quality
```
  996f393a
- Better github link for Reformer Colab Notebook · 0f6969b7
  Patrick von Platen authored May 22, 2020
  
  0f6969b7
- [Summarization Pipeline]: Fix default tokenizer (#4506) · ab44630d
  Sam Shleifer authored May 22, 2020
```
* Fix pipelines defaults bug

* one liner

* style
```
  ab44630d
- Re-apply #4446 + add packaging dependency · 2c1ebb8b
  Julien Chaumond authored May 22, 2020
```
As discussed w/ @lysandrejik

packaging is maintained by PyPA (the Python Packaging Authority), and should be lightweight and stable
```
  2c1ebb8b
- Style · e6aeb0d3
  Lysandre authored May 22, 2020
  
  e6aeb0d3
- link to paper was broken (#4526) · 95a26fcf
  Alexander Measure authored May 22, 2020
```
changed from https://https://arxiv.org/abs/2001.04451.pdf to https://arxiv.org/abs/2001.04451.pdf
```
  95a26fcf
- Added huseinzol05/t5-small-bahasa-cased README.md (#4522) · 89d795f1
  HUSEIN ZOLKEPLI authored May 23, 2020
  
  89d795f1
- Fix convert_token_type_ids_from_sequences for fast tokenizers (#4503) · 35df9114
  Anthony MOI authored May 22, 2020
  
  35df9114
- [model_cards] bart-large-cnn · f7677e16
  Julien Chaumond authored May 22, 2020
```
cc @sshleifer
```
  f7677e16
- Add Reformer colab to community noteboos · 12e6afe9
  Patrick von Platen authored May 22, 2020
  
  12e6afe9
- Re-pin versions · ef22ba48
  Lysandre authored May 22, 2020
  
  ef22ba48
- Revert #4446 Since it introduces a new dependency · 10d72390
  Lysandre authored May 22, 2020
  
  10d72390
- Release: v2.10.0 · e0db6bbd
  Lysandre authored May 22, 2020
  
  e0db6bbd
- added functionality for electra classification head (#4257) · bd6e3018
  Frankie Liuzzi authored May 22, 2020
```
* added functionality for electra classification head

* unneeded dropout

* Test ELECTRA for sequence classification

* Style
Co-authored-by: Frankie <frankie@frase.io>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
```
  bd6e3018
21 May, 2020 4 commits

Unused Union should not be imported · a0865277
Lysandre authored May 21, 2020

a0865277

TPU hangs when saving optimizer/scheduler (#4467) · 9d2ce253

Lysandre Debut authored May 21, 2020

* TPU hangs when saving optimizer/scheduler

* Style

* ParallelLoader is not a DataLoader

* Style

* Addressing @julien-c's comments

9d2ce253

Adds predict stage for glue tasks, and generate result files which can be... · 49296533

Zhangyx authored May 21, 2020


Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463)

* Adds predict stage for glue tasks, and generate result files which could be submitted to gluebenchmark.com website.

* Use Split enum + always output the label name
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

49296533

[examples] fix no grad in second pruning in run_bertology (#4479) · 271bedb4

Tobias Lee authored May 21, 2020

* fix no grad in second pruning and typo

* fix prune heads attention mismatch problem

* fix

* fix

* fix

* run make style

* run make style

271bedb4

20 May, 2020 11 commits
- [ci] Close #4481 · 865d4d59
  Julien Chaumond authored May 20, 2020
  
  865d4d59
- Update test_trainer_distributed.py · a3af8e86
  Julien Chaumond authored May 20, 2020
  
  a3af8e86
- 🚨 Remove warning of deprecation (#4477) · eacea530
  Cola authored May 21, 2020
```
Remove warning of deprecated overload of addcdiv_

Fix #4451
```
  eacea530
- Better None gradients handling in TF Trainer (#4469) · fa2fbed3
  Julien Plu authored May 20, 2020
```
* Better None gradients handling

* Apply Style

* Apply Style
```
  fa2fbed3
- Correct TF formatting to exclude LayerNorms from weight decay (#4448) · e708bb75
  Oliver Åstrand authored May 20, 2020
```
* Exclude LayerNorms from weight decay

* Include both formats of layer norm
```
  e708bb75
- pass on tokenizer to pipeline (#4489) · 49c06132
  Rens authored May 20, 2020
  
  49c06132
- Add Fine-tune DialoGPT on new datasets notebook (#4473) · cacb654c
  Nathan Cooper authored May 20, 2020
  
  cacb654c
- Adjust german bert model card, add new model card (#4488) · 30a09f38
  Timo Moeller authored May 20, 2020
  
  30a09f38
- Fix slow gpu tests lysandre (#4487) · 14cb5b35
  Lysandre Debut authored May 20, 2020
```
* There is one missing key in BERT

* Correct device for CamemBERT model

* RoBERTa tokenization adding prefix space

* Style
```
  14cb5b35
- Create README.md (#4482) · 6dc52c78
  Manuel Romero authored May 20, 2020
  
  6dc52c78
- Model card for RuPERTa-base fine-tuned for NER (#4466) · ed5456da
  Manuel Romero authored May 20, 2020
  
  ed5456da