Commits · d4c2cb402d6674211726fd5f4803d1090664e438 · chenpangpang / transformers

02 Jun, 2020 1 commit

Kill model archive maps (#4636) · d4c2cb40

Julien Chaumond authored Jun 02, 2020

* Kill model archive maps

* Fixup

* Also kill model_archive_map for MaskedBertPreTrainedModel

* Unhook config_archive_map

* Tokenizers: align with model id changes

* make style && make quality

* Fix CI

d4c2cb40

01 Jun, 2020 1 commit

Fix onnx export input names order (#4641) · ec62b7d9

Rens authored Jun 01, 2020

* pass on tokenizer to pipeline

* order input names when convert to onnx

* update style

* remove unused imports

* make ordered inputs list needs to be mutable

* add test custom bert model

* remove unused imports

ec62b7d9

29 May, 2020 3 commits

[EncoderDecoder] Fix initialization and save/load bug (#4680) · 0866669e
Patrick von Platen authored May 30, 2020
```
* fix bug

* add more tests
```
0866669e

[Longformer] Better handling of global attention mask vs local attention mask (#4672) · 56ee2560

Patrick von Platen authored May 29, 2020

* better api

* improve automatic setting of global attention mask

* fix longformer bug

* fix global attention mask in test

* fix global attn mask flatten

* fix slow tests

* update docstring

* update docs and make more robust

* improve attention mask

56ee2560

[Longformer] Multiple choice for longformer (#4645) · 9c172564

Patrick von Platen authored May 29, 2020

* add multiple choice for longformer

* add models to docs

* adapt docstring

* add test to longformer

* add longformer for mc in init and modeling auto

* fix tests

9c172564

28 May, 2020 2 commits
- Fix add_special_tokens on fast tokenizers (#4531) · 5e737018
  Anthony MOI authored May 28, 2020
  
  5e737018
- LongformerForTokenClassification (#4638) · e444648a
  Suraj Patil authored May 28, 2020
  
  e444648a
27 May, 2020 3 commits

[Benchmark] Memory benchmark utils (#4198) · 96f57c9c

Patrick von Platen authored May 27, 2020



* improve memory benchmarking

* correct typo

* fix current memory

* check torch memory allocated

* better pytorch function

* add total cached gpu memory

* add total gpu required

* improve torch gpu usage

* update memory usage

* finalize memory tracing

* save intermediate benchmark class

* fix conflict

* improve benchmark

* improve benchmark

* finalize

* make style

* improve benchmarking

* correct typo

* make train function more flexible

* fix csv save

* better repr of bytes

* better print

* fix __repr__ bug

* finish plot script

* rename plot file

* delete csv and small improvements

* fix in plot

* fix in plot

* correct usage of timeit

* remove redundant line

* remove redundant line

* fix bug

* add hf parser tests

* add versioning and platform info

* make style

* add gpu information

* ensure backward compatibility

* finish adding all tests

* Update src/transformers/benchmark/benchmark_args.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/benchmark/benchmark_args_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* delete csv files

* fix isort ordering

* add out of memory handling

* add better train memory handling
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

96f57c9c

LongformerForSequenceClassification (#4580) · ec4cdfdd

Suraj Patil authored May 28, 2020



* LongformerForSequenceClassification

* better naming x=>hidden_states, fix typo in doc

* Update src/transformers/modeling_longformer.py

* Update src/transformers/modeling_longformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

ec4cdfdd

[testing] LanguageModelGenerationTests require_tf or require_torch (#4616) · 07797c4d
Sam Shleifer authored May 27, 2020

07797c4d

25 May, 2020 2 commits

[ci] fix 3 remaining slow GPU failures (#4584) · b86e42e0
Sam Shleifer authored May 25, 2020

b86e42e0

Longformer for question answering (#4500) · 03d8527d

Suraj Patil authored May 25, 2020

* added LongformerForQuestionAnswering

* add LongformerForQuestionAnswering

* fix import for LongformerForMaskedLM

* add LongformerForQuestionAnswering

* hardcoded sep_token_id

* compute attention_mask if not provided

* combine global_attention_mask with attention_mask when provided

* update example in  docstring

* add assert error messages, better attention combine

* add test for longformerForQuestionAnswering

* typo

* cast gloabl_attention_mask to long

* make style

* Update src/transformers/configuration_longformer.py

* Update src/transformers/configuration_longformer.py

* fix the code quality

* Merge branch 'longformer-for-question-answering' of https://github.com/patil-suraj/transformers

 into longformer-for-question-answering
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

03d8527d

22 May, 2020 2 commits

Fix convert_token_type_ids_from_sequences for fast tokenizers (#4503) · 35df9114
Anthony MOI authored May 22, 2020

35df9114

added functionality for electra classification head (#4257) · bd6e3018

Frankie Liuzzi authored May 22, 2020



* added functionality for electra classification head

* unneeded dropout

* Test ELECTRA for sequence classification

* Style
Co-authored-by: Frankie <frankie@frase.io>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

bd6e3018

21 May, 2020 1 commit

Adds predict stage for glue tasks, and generate result files which can be... · 49296533

Zhangyx authored May 21, 2020


Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463)

* Adds predict stage for glue tasks, and generate result files which could be submitted to gluebenchmark.com website.

* Use Split enum + always output the label name
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

49296533

20 May, 2020 3 commits
- [ci] Close #4481 · 865d4d59
  Julien Chaumond authored May 20, 2020
  
  865d4d59
- Update test_trainer_distributed.py · a3af8e86
  Julien Chaumond authored May 20, 2020
  
  a3af8e86
- Fix slow gpu tests lysandre (#4487) · 14cb5b35
  Lysandre Debut authored May 20, 2020
```
* There is one missing key in BERT

* Correct device for CamemBERT model

* RoBERTa tokenization adding prefix space

* Style
```
  14cb5b35
19 May, 2020 7 commits

[MarianTokenizer] implement save_vocabulary and other common methods (#4389) · efbc1c5a
Sam Shleifer authored May 19, 2020

efbc1c5a
[gpu slow tests] fix mbart-large-enro gpu tests (#4472) · 956c4c4e
Sam Shleifer authored May 19, 2020

956c4c4e
[Tests, GPU, SLOW] fix a bunch of GPU hardcoded tests in Pytorch (#4468) · aa925a52
Patrick von Platen authored May 19, 2020
```
* fix gpu slow tests in pytorch

* change model to device syntax
```
aa925a52
[cleanup] test_tokenization_common.py (#4390) · 07dd7c2f
Sam Shleifer authored May 19, 2020

07dd7c2f

Longformer (#4352) · 8f1d0471

Iz Beltagy authored May 19, 2020

* first commit

* bug fixes

* better examples

* undo padding

* remove wrong VOCAB_FILES_NAMES

* License

* make style

* make isort happy

* unit tests

* integration test

* make `black` happy by undoing `isort` changes!!

* lint

* no need for the padding value

* batch_size not bsz

* remove unused type casting

* seqlen not seq_len

* staticmethod

* `bert` selfattention instead of `n2`

* uint8 instead of bool + lints

* pad inputs_embeds using embeddings not a constant

* black

* unit test with padding

* fix unit tests

* remove redundant unit test

* upload model weights

* resolve todo

* simpler _mask_invalid_locations without lru_cache + backward compatible masked_fill_

* increase unittest coverage

8f1d0471

Distributed eval: SequentialDistributedSampler + gather all results (#4243) · 5e7fe8b5

Julien Chaumond authored May 18, 2020

* Distributed eval: SequentialDistributedSampler + gather all results

* For consistency only write to disk from world_master

Close https://github.com/huggingface/transformers/issues/4272

* Working distributed eval

* Hook into scripts

* Fix #3721 again

* TPU.mesh_reduce: stay in tensor space

Thanks @jysohn23

* Just a small comment

* whitespace

* torch.hub: pip install packaging

* Add test scenarii

5e7fe8b5

Fix nn.DataParallel compatibility in PyTorch 1.5 (#4300) · 4c068936

Julien Chaumond authored May 18, 2020

* Test case for #3936

* multigpu tests pass on pytorch 1.4.0

* Fixup

* multigpu tests pass on pytorch 1.5.0

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py

* rename multigpu to require_multigpu

* mode doc

4c068936

18 May, 2020 3 commits
- [test_pipelines] Mark tests > 10s @slow, small speedups (#4421) · a699525d
  Sam Shleifer authored May 18, 2020
  
  a699525d
- [T5 fp16] Fix fp16 in T5 (#4436) · 026a5d08
  Patrick von Platen authored May 18, 2020
```
* fix fp16 in t5

* make style

* refactor invert_attention_mask fn

* fix typo
```
  026a5d08
- Tag onnx export tests as slow (#4432) · 31c799a0
  Funtowicz Morgan authored May 18, 2020
  
  31c799a0
17 May, 2020 1 commit

Allow the creation of "entity groups" for NerPipeline #3548 (#3957) · 18d233d5

Lorenzo Ampil authored May 17, 2020

* Add index to be returned by NerPipeline to allow for the creation of

* Add entity groups

* Convert entity list to dict

* Add entity to entity_group_disagg atfter updating entity gorups

* Change 'group' parameter to 'grouped_entities'

* Add unit tests for grouped NER pipeline case

* Correct variable name typo for NER_FINETUNED_MODELS

* Sync grouped tests to recent test updates

18d233d5

14 May, 2020 3 commits

Conversion script to export transformers models to ONNX IR. (#4253) · db0076a9

Funtowicz Morgan authored May 14, 2020

* Added generic ONNX conversion script for PyTorch model.

* WIP initial TF support.

* TensorFlow/Keras ONNX export working.

* Print framework version info

* Add possibility to check the model is correctly loading on ONNX runtime.

* Remove quantization option.

* Specify ONNX opset version when exporting.

* Formatting.

* Remove unused imports.

* Make functions more generally reusable from other part of the code.

* isort happy.

* flake happy

* Export only feature-extraction for now

* Correctly check inputs order / filter before export.

* Removed task variable

* Fix invalid args call in load_graph_from_args.

* Fix invalid args call in convert.

* Fix invalid args call in infer_shapes.

* Raise exception and catch in caller function instead of exit.

* Add 04-onnx-export.ipynb notebook

* More WIP on the notebook

* Remove unused imports

* Simplify & remove unused constants.

* Export with constant_folding in PyTorch

* Let's try to put function args in the right order this time ...

* Disable external_data_format temporary

* ONNX notebook draft ready.

* Updated notebooks charts + wording

* Correct error while exporting last chart in notebook.

* Adressing @LysandreJik comment.

* Set ONNX opset to 11 as default value.

* Set opset param mandatory

* Added ONNX export unittests

* Quality.

* flake8 happy

* Add keras2onnx dependency on extras["tf"]

* Pin keras2onnx on github master to v1.6.5

* Second attempt.

* Third attempt.

* Use the right repo URL this time ...

* Do the same for onnxconverter-common

* Added keras2onnx and onnxconveter-common to 1.7.0 to supports TF2.2

* Correct commit hash.

* Addressing PR review: Optimization are enabled by default.

* Addressing PR review: small changes in the notebook

* setup.py comment about keras2onnx versioning.

db0076a9

[tests] make pipelines tests faster with smaller models (#4238) · 7822cd38
Sam Shleifer authored May 14, 2020
```
covers torch and tf. Also fixes a failing @slow test
```
7822cd38
Fix: unpin flake8 and fix cs errors (#4367) · 448c4672
Julien Chaumond authored May 14, 2020
```
* Fix: unpin flake8 and fix cs errors

* Ok we still need to quote those
```
448c4672

13 May, 2020 2 commits

[Marian Fixes] prevent predicting pad_token_id before softmax, support... · 9a687ebb
Sam Shleifer authored May 13, 2020
```
[Marian Fixes] prevent predicting pad_token_id before softmax, support language codes, name multilingual models (#4290)
```
9a687ebb

(v2) Improvements to the wandb integration (#4324) · 24175910

Julien Chaumond authored May 12, 2020



* Improvements to the wandb integration

* small reorg + no global necessary

* feat(trainer): log epoch and final metrics

* Simplify logging a bit

* Fixup

* Fix crash when just running eval
Co-authored-by: Chris Van Pelt <vanpelt@gmail.com>
Co-authored-by: Boris Dayma <boris.dayma@gmail.com>

24175910

12 May, 2020 1 commit
- Fix BART tests on GPU (#4298) · 4bf50422
  Julien Chaumond authored May 12, 2020
  
  4bf50422
10 May, 2020 1 commit

[Marian] documentation and AutoModel support (#4152) · 3487be75

Sam Shleifer authored May 10, 2020

- MarianSentencepieceTokenizer - > MarianTokenizer
- Start using unk token.
- add docs page
- add better generation params to MarianConfig
- more conversion utilities

3487be75

08 May, 2020 1 commit
- [Pipeline, Generation] tf generation pipeline bug (#4217) · cf08830c
  Patrick von Platen authored May 08, 2020
```
* fix PR

* move tests to correct place
```
  cf08830c
07 May, 2020 3 commits

Add AlbertForPreTraining and TFAlbertForPreTraining models. (#4057) · 8bf73126

Jared T Nielsen authored May 07, 2020



* Add AlbertForPreTraining and TFAlbertForPreTraining models.

* PyTorch conversion

* TensorFlow conversion

* style
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

8bf73126

BIG Reorganize examples (#4213) · 0ae96ff8

Julien Chaumond authored May 07, 2020

* Created using Colaboratory

* [examples] reorganize files

* remove run_tpu_glue.py as superseded by TPU support in Trainer

* Bugfix: int, not tuple

* move files around

0ae96ff8

Rewritten batch support in pipelines. (#4154) · 0a6cbea0

Funtowicz Morgan authored May 07, 2020



* Rewritten batch support in pipelines.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Fix imports sorting 🔧

Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Set pad_to_max_length=True by default on Pipeline.

* Set pad_to_max_length=False for generation pipelines.

Most of generation models doesn't have padding token.

* Address @joeddav review comment: Uniformized *args.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Address @joeddav review comment: Uniformized *args (second).
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

0a6cbea0