Commits · 2c1ebb8b507c19c75af5084b6c73e0b003c9eda6 · chenpangpang / transformers

22 May, 2020 11 commits
- Re-apply #4446 + add packaging dependency · 2c1ebb8b
  Julien Chaumond authored May 22, 2020
```
As discussed w/ @lysandrejik

packaging is maintained by PyPA (the Python Packaging Authority), and should be lightweight and stable
```
  2c1ebb8b
- Style · e6aeb0d3
  Lysandre authored May 22, 2020
  
  e6aeb0d3
- link to paper was broken (#4526) · 95a26fcf
  Alexander Measure authored May 22, 2020
```
changed from https://https://arxiv.org/abs/2001.04451.pdf to https://arxiv.org/abs/2001.04451.pdf
```
  95a26fcf
- Added huseinzol05/t5-small-bahasa-cased README.md (#4522) · 89d795f1
  HUSEIN ZOLKEPLI authored May 23, 2020
  
  89d795f1
- Fix convert_token_type_ids_from_sequences for fast tokenizers (#4503) · 35df9114
  Anthony MOI authored May 22, 2020
  
  35df9114
- [model_cards] bart-large-cnn · f7677e16
  Julien Chaumond authored May 22, 2020
```
cc @sshleifer
```
  f7677e16
- Add Reformer colab to community noteboos · 12e6afe9
  Patrick von Platen authored May 22, 2020
  
  12e6afe9
- Re-pin versions · ef22ba48
  Lysandre authored May 22, 2020
  
  ef22ba48
- Revert #4446 Since it introduces a new dependency · 10d72390
  Lysandre authored May 22, 2020
  
  10d72390
- Release: v2.10.0 · e0db6bbd
  Lysandre authored May 22, 2020
  
  e0db6bbd
- added functionality for electra classification head (#4257) · bd6e3018
  Frankie Liuzzi authored May 22, 2020
```
* added functionality for electra classification head

* unneeded dropout

* Test ELECTRA for sequence classification

* Style
Co-authored-by: Frankie <frankie@frase.io>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
```
  bd6e3018
21 May, 2020 4 commits

Unused Union should not be imported · a0865277
Lysandre authored May 21, 2020

a0865277

TPU hangs when saving optimizer/scheduler (#4467) · 9d2ce253

Lysandre Debut authored May 21, 2020

* TPU hangs when saving optimizer/scheduler

* Style

* ParallelLoader is not a DataLoader

* Style

* Addressing @julien-c's comments

9d2ce253

Adds predict stage for glue tasks, and generate result files which can be... · 49296533

Zhangyx authored May 21, 2020


Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463)

* Adds predict stage for glue tasks, and generate result files which could be submitted to gluebenchmark.com website.

* Use Split enum + always output the label name
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

49296533

[examples] fix no grad in second pruning in run_bertology (#4479) · 271bedb4

Tobias Lee authored May 21, 2020

* fix no grad in second pruning and typo

* fix prune heads attention mismatch problem

* fix

* fix

* fix

* run make style

* run make style

271bedb4

20 May, 2020 13 commits
- [ci] Close #4481 · 865d4d59
  Julien Chaumond authored May 20, 2020
  
  865d4d59
- Update test_trainer_distributed.py · a3af8e86
  Julien Chaumond authored May 20, 2020
  
  a3af8e86
- 🚨 Remove warning of deprecation (#4477) · eacea530
  Cola authored May 21, 2020
```
Remove warning of deprecated overload of addcdiv_

Fix #4451
```
  eacea530
- Better None gradients handling in TF Trainer (#4469) · fa2fbed3
  Julien Plu authored May 20, 2020
```
* Better None gradients handling

* Apply Style

* Apply Style
```
  fa2fbed3
- Correct TF formatting to exclude LayerNorms from weight decay (#4448) · e708bb75
  Oliver Åstrand authored May 20, 2020
```
* Exclude LayerNorms from weight decay

* Include both formats of layer norm
```
  e708bb75
- pass on tokenizer to pipeline (#4489) · 49c06132
  Rens authored May 20, 2020
  
  49c06132
- Add Fine-tune DialoGPT on new datasets notebook (#4473) · cacb654c
  Nathan Cooper authored May 20, 2020
  
  cacb654c
- Adjust german bert model card, add new model card (#4488) · 30a09f38
  Timo Moeller authored May 20, 2020
  
  30a09f38
- Fix slow gpu tests lysandre (#4487) · 14cb5b35
  Lysandre Debut authored May 20, 2020
```
* There is one missing key in BERT

* Correct device for CamemBERT model

* RoBERTa tokenization adding prefix space

* Style
```
  14cb5b35
- Create README.md (#4482) · 6dc52c78
  Manuel Romero authored May 20, 2020
  
  6dc52c78
- Model card for RuPERTa-base fine-tuned for NER (#4466) · ed5456da
  Manuel Romero authored May 20, 2020
  
  ed5456da
- Model card for Tereveni-AI/gpt2-124M-uk-fiction (#4470) · c76450e2
  Oleksandr Bushkovskyi authored May 20, 2020
```
Create model card for "Tereveni-AI/gpt2-124M-uk-fiction" model
```
  c76450e2
- add BERT trained from review corpus. (#4405) · 9907dc52
  Hu Xu authored May 20, 2020
```
* add model_cards for BERT trained on reviews.

* add link to repository.

* refine README.md for each review model
```
  9907dc52
19 May, 2020 12 commits

[MarianTokenizer] implement save_vocabulary and other common methods (#4389) · efbc1c5a
Sam Shleifer authored May 19, 2020

efbc1c5a
[gpu slow tests] fix mbart-large-enro gpu tests (#4472) · 956c4c4e
Sam Shleifer authored May 19, 2020

956c4c4e
[Longformer] Docs and clean API (#4464) · 48c3a70b
Patrick von Platen authored May 19, 2020
```
* add longformer docs

* improve docs
```
48c3a70b
[Tests, GPU, SLOW] fix a bunch of GPU hardcoded tests in Pytorch (#4468) · aa925a52
Patrick von Platen authored May 19, 2020
```
* fix gpu slow tests in pytorch

* change model to device syntax
```
aa925a52

add T5 fine-tuning notebook [Community notebooks] (#4462) · 5856999a

Suraj Patil authored May 19, 2020



* add T5 fine-tuning notebook [Community notebooks]

* Update README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5856999a

[cleanup] test_tokenization_common.py (#4390) · 07dd7c2f
Sam Shleifer authored May 19, 2020

07dd7c2f

Longformer (#4352) · 8f1d0471

Iz Beltagy authored May 19, 2020

* first commit

* bug fixes

* better examples

* undo padding

* remove wrong VOCAB_FILES_NAMES

* License

* make style

* make isort happy

* unit tests

* integration test

* make `black` happy by undoing `isort` changes!!

* lint

* no need for the padding value

* batch_size not bsz

* remove unused type casting

* seqlen not seq_len

* staticmethod

* `bert` selfattention instead of `n2`

* uint8 instead of bool + lints

* pad inputs_embeds using embeddings not a constant

* black

* unit test with padding

* fix unit tests

* remove redundant unit test

* upload model weights

* resolve todo

* simpler _mask_invalid_locations without lru_cache + backward compatible masked_fill_

* increase unittest coverage

8f1d0471

Refactored the README.md file (#4427) · 31eedff5
Girishkumar authored May 19, 2020

31eedff5

Map optimizer to correct device after loading from checkpoint. (#4403) · 384f0eb2

Shaoyen authored May 18, 2020



* Map optimizer to correct device after loading from checkpoint.

* Make style test pass
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

384f0eb2

[Trainer] move model to device before setting optimizer (#4450) · bf14ef75
Julien Chaumond authored May 18, 2020

bf14ef75

Distributed eval: SequentialDistributedSampler + gather all results (#4243) · 5e7fe8b5

Julien Chaumond authored May 18, 2020

* Distributed eval: SequentialDistributedSampler + gather all results

* For consistency only write to disk from world_master

Close https://github.com/huggingface/transformers/issues/4272

* Working distributed eval

* Hook into scripts

* Fix #3721 again

* TPU.mesh_reduce: stay in tensor space

Thanks @jysohn23

* Just a small comment

* whitespace

* torch.hub: pip install packaging

* Add test scenarii

5e7fe8b5

Fix nn.DataParallel compatibility in PyTorch 1.5 (#4300) · 4c068936

Julien Chaumond authored May 18, 2020

* Test case for #3936

* multigpu tests pass on pytorch 1.4.0

* Fixup

* multigpu tests pass on pytorch 1.5.0

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py

* rename multigpu to require_multigpu

* mode doc

4c068936