Commits · c589eae2b83be5206dab7a899738a0995624cc82 · chenpangpang / transformers

26 May, 2020 3 commits
- [Longformer For Question Answering] Conversion script, doc, small fixes (#4593) · c589eae2
  Patrick von Platen authored May 26, 2020
```
* add new longformer for question answering model

* add new config as well

* fix links

* fix links part 2
```
  c589eae2
- [T5] Fix Cross Attention position bias (#4499) · a163c9ca
  ZhuBaohe authored May 26, 2020
```
* fix

* fix1
```
  a163c9ca
- fix (#4410) · 1d690289
  ZhuBaohe authored May 26, 2020
  
  1d690289
25 May, 2020 6 commits

[ci] fix 3 remaining slow GPU failures (#4584) · b86e42e0
Sam Shleifer authored May 25, 2020

b86e42e0
[Reformer] fix reformer num buckets (#4564) · 3e3e5521
Patrick von Platen authored May 25, 2020
```
* fix reformer num buckets

* fix

* adapt docs

* set num buckets in config
```
3e3e5521
fixing tokenization of extra_id symbols in T5Tokenizer. Related to issue 4021 (#4353) · 3dea40b8
Elman Mansimov authored May 25, 2020

3dea40b8
LongformerTokenizerFast (#4547) · 51397336
Suraj Patil authored May 26, 2020

51397336
Add nn.Module as superclass (#4533) · adab7f83
Sho Arora authored May 25, 2020

adab7f83

Longformer for question answering (#4500) · 03d8527d

Suraj Patil authored May 25, 2020

* added LongformerForQuestionAnswering

* add LongformerForQuestionAnswering

* fix import for LongformerForMaskedLM

* add LongformerForQuestionAnswering

* hardcoded sep_token_id

* compute attention_mask if not provided

* combine global_attention_mask with attention_mask when provided

* update example in  docstring

* add assert error messages, better attention combine

* add test for longformerForQuestionAnswering

* typo

* cast gloabl_attention_mask to long

* make style

* Update src/transformers/configuration_longformer.py

* Update src/transformers/configuration_longformer.py

* fix the code quality

* Merge branch 'longformer-for-question-answering' of https://github.com/patil-suraj/transformers

 into longformer-for-question-answering
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

03d8527d

23 May, 2020 1 commit
- DOC: Fix typos in modeling_auto (#4534) · a34a9896
  Bharat Raghunathan authored May 23, 2020
  
  a34a9896
22 May, 2020 9 commits

Add Type Hints to modeling_utils.py Closes #3911 (#3948) · e19b9781

Bijay Gurung authored May 23, 2020



* Add Type Hints to modeling_utils.py Closes #3911

Add Type Hints to methods in `modeling_utils.py`

Note: The coverage isn't 100%. Mostly skipped internal methods.

* Reformat according to `black` and `isort`

* Use typing.Iterable instead of Sequence

* Parameterize Iterable by its generic type

* Use typing.Optional when None is the default value

* Adhere to style guideline

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

e19b9781

Warn the user about max_len being on the path to be deprecated. (#4528) · 996f393a

Funtowicz Morgan authored May 22, 2020

* Warn the user about max_len being on the path to be deprecated.

* Ensure better backward compatibility when max_len is provided to a tokenizer.

* Make sure to override the parameter and not the actual instance value.

* Format & quality

996f393a

[Summarization Pipeline]: Fix default tokenizer (#4506) · ab44630d
Sam Shleifer authored May 22, 2020
```
* Fix pipelines defaults bug

* one liner

* style
```
ab44630d

Re-apply #4446 + add packaging dependency · 2c1ebb8b

Julien Chaumond authored May 22, 2020

As discussed w/ @lysandrejik

packaging is maintained by PyPA (the Python Packaging Authority), and should be lightweight and stable

2c1ebb8b

Style · e6aeb0d3
Lysandre authored May 22, 2020

e6aeb0d3
Fix convert_token_type_ids_from_sequences for fast tokenizers (#4503) · 35df9114
Anthony MOI authored May 22, 2020

35df9114
Revert #4446 Since it introduces a new dependency · 10d72390
Lysandre authored May 22, 2020

10d72390
Release: v2.10.0 · e0db6bbd
Lysandre authored May 22, 2020

e0db6bbd

added functionality for electra classification head (#4257) · bd6e3018

Frankie Liuzzi authored May 22, 2020



* added functionality for electra classification head

* unneeded dropout

* Test ELECTRA for sequence classification

* Style
Co-authored-by: Frankie <frankie@frase.io>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

bd6e3018

21 May, 2020 3 commits

Unused Union should not be imported · a0865277
Lysandre authored May 21, 2020

a0865277

TPU hangs when saving optimizer/scheduler (#4467) · 9d2ce253

Lysandre Debut authored May 21, 2020

* TPU hangs when saving optimizer/scheduler

* Style

* ParallelLoader is not a DataLoader

* Style

* Addressing @julien-c's comments

9d2ce253

Adds predict stage for glue tasks, and generate result files which can be... · 49296533

Zhangyx authored May 21, 2020


Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463)

* Adds predict stage for glue tasks, and generate result files which could be submitted to gluebenchmark.com website.

* Use Split enum + always output the label name
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

49296533

20 May, 2020 4 commits
- 🚨 Remove warning of deprecation (#4477) · eacea530
  Cola authored May 21, 2020
```
Remove warning of deprecated overload of addcdiv_

Fix #4451
```
  eacea530
- Better None gradients handling in TF Trainer (#4469) · fa2fbed3
  Julien Plu authored May 20, 2020
```
* Better None gradients handling

* Apply Style

* Apply Style
```
  fa2fbed3
- Correct TF formatting to exclude LayerNorms from weight decay (#4448) · e708bb75
  Oliver Åstrand authored May 20, 2020
```
* Exclude LayerNorms from weight decay

* Include both formats of layer norm
```
  e708bb75
- pass on tokenizer to pipeline (#4489) · 49c06132
  Rens authored May 20, 2020
  
  49c06132
19 May, 2020 10 commits

[MarianTokenizer] implement save_vocabulary and other common methods (#4389) · efbc1c5a
Sam Shleifer authored May 19, 2020

efbc1c5a
[Longformer] Docs and clean API (#4464) · 48c3a70b
Patrick von Platen authored May 19, 2020
```
* add longformer docs

* improve docs
```
48c3a70b
[Tests, GPU, SLOW] fix a bunch of GPU hardcoded tests in Pytorch (#4468) · aa925a52
Patrick von Platen authored May 19, 2020
```
* fix gpu slow tests in pytorch

* change model to device syntax
```
aa925a52
[cleanup] test_tokenization_common.py (#4390) · 07dd7c2f
Sam Shleifer authored May 19, 2020

07dd7c2f

Longformer (#4352) · 8f1d0471

Iz Beltagy authored May 19, 2020

* first commit

* bug fixes

* better examples

* undo padding

* remove wrong VOCAB_FILES_NAMES

* License

* make style

* make isort happy

* unit tests

* integration test

* make `black` happy by undoing `isort` changes!!

* lint

* no need for the padding value

* batch_size not bsz

* remove unused type casting

* seqlen not seq_len

* staticmethod

* `bert` selfattention instead of `n2`

* uint8 instead of bool + lints

* pad inputs_embeds using embeddings not a constant

* black

* unit test with padding

* fix unit tests

* remove redundant unit test

* upload model weights

* resolve todo

* simpler _mask_invalid_locations without lru_cache + backward compatible masked_fill_

* increase unittest coverage

8f1d0471

Map optimizer to correct device after loading from checkpoint. (#4403) · 384f0eb2

Shaoyen authored May 18, 2020



* Map optimizer to correct device after loading from checkpoint.

* Make style test pass
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

384f0eb2

[Trainer] move model to device before setting optimizer (#4450) · bf14ef75
Julien Chaumond authored May 18, 2020

bf14ef75

Distributed eval: SequentialDistributedSampler + gather all results (#4243) · 5e7fe8b5

Julien Chaumond authored May 18, 2020

* Distributed eval: SequentialDistributedSampler + gather all results

* For consistency only write to disk from world_master

Close https://github.com/huggingface/transformers/issues/4272

* Working distributed eval

* Hook into scripts

* Fix #3721 again

* TPU.mesh_reduce: stay in tensor space

Thanks @jysohn23

* Just a small comment

* whitespace

* torch.hub: pip install packaging

* Add test scenarii

5e7fe8b5

Fix nn.DataParallel compatibility in PyTorch 1.5 (#4300) · 4c068936

Julien Chaumond authored May 18, 2020

* Test case for #3936

* multigpu tests pass on pytorch 1.4.0

* Fixup

* multigpu tests pass on pytorch 1.5.0

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py

* rename multigpu to require_multigpu

* mode doc

4c068936

Make get_last_lr in trainer backward compatible (#4446) · 9de4afa8

Rakesh Chada authored May 18, 2020

* makes fetching last learning late in trainer backward compatible

* split comment to multiple lines

* fixes black styling issue

* uses version to create a more explicit logic

9de4afa8

18 May, 2020 4 commits
- Adding optimizations block from ONNXRuntime. (#4431) · ca4a3f4d
  Funtowicz Morgan authored May 18, 2020
```
* Adding optimizations block from ONNXRuntime.

* Turn off external data format by default for PyTorch export.

* Correct the way use_external_format is passed through the cmdline args.
```
  ca4a3f4d
- better naming in tf t5 (#4401) · d39bf0ac
  Patrick von Platen authored May 18, 2020
  
  d39bf0ac
- improve docstring (#4422) · 590adb13
  Patrick von Platen authored May 18, 2020
  
  590adb13
- [T5 fp16] Fix fp16 in T5 (#4436) · 026a5d08
  Patrick von Platen authored May 18, 2020
```
* fix fp16 in t5

* make style

* refactor invert_attention_mask fn

* fix typo
```
  026a5d08