Commits · 5c0cfc2cf0941d2db368767fd232d8712449c7f8 · chenpangpang / transformers

05 Jun, 2020 1 commit
- Fix argument label (#4792) · 4dd5cf22
  Sylvain Gugger authored Jun 05, 2020
```
* Fix argument label

* Fix test
```
  4dd5cf22
04 Jun, 2020 2 commits

Tensorflow improvements (#4530) · f9414f75

Julien Plu authored Jun 05, 2020



* Better None gradients handling

* Apply Style

* Apply Style

* Create a loss class per task to compute its respective loss

* Add loss classes to the ALBERT TF models

* Add loss classes to the BERT TF models

* Add question answering and multiple choice to TF Camembert

* Remove prints

* Add multiple choice model to TF DistilBERT + loss computation

* Add question answering model to TF Electra + loss computation

* Add token classification, question answering and multiple choice models to TF Flaubert

* Add multiple choice model to TF Roberta + loss computation

* Add multiple choice model to TF XLM + loss computation

* Add multiple choice and question answering models to TF XLM-Roberta

* Add multiple choice model to TF XLNet + loss computation

* Remove unused parameters

* Add task loss classes

* Reorder TF imports + add new model classes

* Add new model classes

* Bugfix in TF T5 model

* Bugfix for TF T5 tests

* Bugfix in TF T5 model

* Fix TF T5 model tests

* Fix T5 tests + some renaming

* Fix inheritance issue in the AutoX tests

* Add tests for TF Flaubert and TF XLM Roberta

* Add tests for TF Flaubert and TF XLM Roberta

* Remove unused piece of code in the TF trainer

* bugfix and remove unused code

* Bugfix for TF 2.2

* Apply Style

* Divide TFSequenceClassificationAndMultipleChoiceLoss into their two respective name

* Apply style

* Mirror the PT Trainer in the TF one: fp16, optimizers and tb_writer as class parameter and better dataset handling

* Fix TF optimizations tests and apply style

* Remove useless parameter

* Bugfix and apply style

* Fix TF Trainer prediction

* Now the TF models return the loss such as their PyTorch couterparts

* Apply Style

* Ignore some tests output

* Take into account the SQuAD cls_index, p_mask and is_impossible parameters for the QuestionAnswering task models.

* Fix names for SQuAD data

* Apply Style

* Fix conflicts with 2.11 release

* Fix conflicts with 2.11

* Fix wrongname

* Add better documentation on the new create_optimizer function

* Fix isort

* logging_dir: use same default as PyTorch
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

f9414f75

Introduce a new tensor type for return_tensors on tokenizer for NumPy (#4585) · 5bf9afbf

Funtowicz Morgan authored Jun 04, 2020

* Refactor tensor creation in tokenizers.

* Make sure to convert string to TensorType

* Refactor convert_to_tensors_

* Introduce numpy tensor creation

* Format

* Add unittest for TensorType creation from str

* sorting imports

* Added unittests for numpy tensor conversion.

* Do not use in-place version for squeeze as numpy doesn't provide such feature.

* Added extra parameter prepend_batch_axis: bool on prepare_for_model.

* Ensure test_np_encode_plus_sent_to_model is not executed if encoder/decoder model.

* style.

* numpy tests require_torch for now while flax not merged.

* Hopefully will make flake8 happy.

* One more time 🎶

5bf9afbf

03 Jun, 2020 1 commit

Unify label args (#4722) · 1b5820a5

Sylvain Gugger authored Jun 03, 2020

* Deprecate masked_lm_labels argument

* Apply to all models

* Better error message

1b5820a5

02 Jun, 2020 4 commits
- [Reformer] Improved memory if input is shorter than chunk length (#4720) · 9ca48573
  Patrick von Platen authored Jun 02, 2020
```
* improve handling of short inputs for reformer

* correct typo in assert statement

* fix other tests
```
  9ca48573
- TFRobertaModelIntegrationTest requires tf (#4726) · 70f74234
  Sam Shleifer authored Jun 02, 2020
  
  70f74234
- Fix CI after killing archive maps (#4724) · b42586ea
  Julien Chaumond authored Jun 02, 2020
```
* 🐛 Fix model ids for BART and Flaubert
```
  b42586ea
- Kill model archive maps (#4636) · d4c2cb40
  Julien Chaumond authored Jun 02, 2020
```
* Kill model archive maps

* Fixup

* Also kill model_archive_map for MaskedBertPreTrainedModel

* Unhook config_archive_map

* Tokenizers: align with model id changes

* make style && make quality

* Fix CI
```
  d4c2cb40
01 Jun, 2020 1 commit

Fix onnx export input names order (#4641) · ec62b7d9

Rens authored Jun 01, 2020

* pass on tokenizer to pipeline

* order input names when convert to onnx

* update style

* remove unused imports

* make ordered inputs list needs to be mutable

* add test custom bert model

* remove unused imports

ec62b7d9

29 May, 2020 3 commits

[EncoderDecoder] Fix initialization and save/load bug (#4680) · 0866669e
Patrick von Platen authored May 30, 2020
```
* fix bug

* add more tests
```
0866669e

[Longformer] Better handling of global attention mask vs local attention mask (#4672) · 56ee2560

Patrick von Platen authored May 29, 2020

* better api

* improve automatic setting of global attention mask

* fix longformer bug

* fix global attention mask in test

* fix global attn mask flatten

* fix slow tests

* update docstring

* update docs and make more robust

* improve attention mask

56ee2560

[Longformer] Multiple choice for longformer (#4645) · 9c172564

Patrick von Platen authored May 29, 2020

* add multiple choice for longformer

* add models to docs

* adapt docstring

* add test to longformer

* add longformer for mc in init and modeling auto

* fix tests

9c172564

28 May, 2020 2 commits
- Fix add_special_tokens on fast tokenizers (#4531) · 5e737018
  Anthony MOI authored May 28, 2020
  
  5e737018
- LongformerForTokenClassification (#4638) · e444648a
  Suraj Patil authored May 28, 2020
  
  e444648a
27 May, 2020 3 commits

[Benchmark] Memory benchmark utils (#4198) · 96f57c9c

Patrick von Platen authored May 27, 2020



* improve memory benchmarking

* correct typo

* fix current memory

* check torch memory allocated

* better pytorch function

* add total cached gpu memory

* add total gpu required

* improve torch gpu usage

* update memory usage

* finalize memory tracing

* save intermediate benchmark class

* fix conflict

* improve benchmark

* improve benchmark

* finalize

* make style

* improve benchmarking

* correct typo

* make train function more flexible

* fix csv save

* better repr of bytes

* better print

* fix __repr__ bug

* finish plot script

* rename plot file

* delete csv and small improvements

* fix in plot

* fix in plot

* correct usage of timeit

* remove redundant line

* remove redundant line

* fix bug

* add hf parser tests

* add versioning and platform info

* make style

* add gpu information

* ensure backward compatibility

* finish adding all tests

* Update src/transformers/benchmark/benchmark_args.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/benchmark/benchmark_args_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* delete csv files

* fix isort ordering

* add out of memory handling

* add better train memory handling
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

96f57c9c

LongformerForSequenceClassification (#4580) · ec4cdfdd

Suraj Patil authored May 28, 2020



* LongformerForSequenceClassification

* better naming x=>hidden_states, fix typo in doc

* Update src/transformers/modeling_longformer.py

* Update src/transformers/modeling_longformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

ec4cdfdd

[testing] LanguageModelGenerationTests require_tf or require_torch (#4616) · 07797c4d
Sam Shleifer authored May 27, 2020

07797c4d

25 May, 2020 2 commits

[ci] fix 3 remaining slow GPU failures (#4584) · b86e42e0
Sam Shleifer authored May 25, 2020

b86e42e0

Longformer for question answering (#4500) · 03d8527d

Suraj Patil authored May 25, 2020

* added LongformerForQuestionAnswering

* add LongformerForQuestionAnswering

* fix import for LongformerForMaskedLM

* add LongformerForQuestionAnswering

* hardcoded sep_token_id

* compute attention_mask if not provided

* combine global_attention_mask with attention_mask when provided

* update example in  docstring

* add assert error messages, better attention combine

* add test for longformerForQuestionAnswering

* typo

* cast gloabl_attention_mask to long

* make style

* Update src/transformers/configuration_longformer.py

* Update src/transformers/configuration_longformer.py

* fix the code quality

* Merge branch 'longformer-for-question-answering' of https://github.com/patil-suraj/transformers

 into longformer-for-question-answering
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

03d8527d

22 May, 2020 2 commits

Fix convert_token_type_ids_from_sequences for fast tokenizers (#4503) · 35df9114
Anthony MOI authored May 22, 2020

35df9114

added functionality for electra classification head (#4257) · bd6e3018

Frankie Liuzzi authored May 22, 2020



* added functionality for electra classification head

* unneeded dropout

* Test ELECTRA for sequence classification

* Style
Co-authored-by: Frankie <frankie@frase.io>
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

bd6e3018

21 May, 2020 1 commit

Adds predict stage for glue tasks, and generate result files which can be... · 49296533

Zhangyx authored May 21, 2020


Adds predict stage for glue tasks, and generate result files which can be submitted to gluebenchmark.com (#4463)

* Adds predict stage for glue tasks, and generate result files which could be submitted to gluebenchmark.com website.

* Use Split enum + always output the label name
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

49296533

20 May, 2020 3 commits
- [ci] Close #4481 · 865d4d59
  Julien Chaumond authored May 20, 2020
  
  865d4d59
- Update test_trainer_distributed.py · a3af8e86
  Julien Chaumond authored May 20, 2020
  
  a3af8e86
- Fix slow gpu tests lysandre (#4487) · 14cb5b35
  Lysandre Debut authored May 20, 2020
```
* There is one missing key in BERT

* Correct device for CamemBERT model

* RoBERTa tokenization adding prefix space

* Style
```
  14cb5b35
19 May, 2020 7 commits

[MarianTokenizer] implement save_vocabulary and other common methods (#4389) · efbc1c5a
Sam Shleifer authored May 19, 2020

efbc1c5a
[gpu slow tests] fix mbart-large-enro gpu tests (#4472) · 956c4c4e
Sam Shleifer authored May 19, 2020

956c4c4e
[Tests, GPU, SLOW] fix a bunch of GPU hardcoded tests in Pytorch (#4468) · aa925a52
Patrick von Platen authored May 19, 2020
```
* fix gpu slow tests in pytorch

* change model to device syntax
```
aa925a52
[cleanup] test_tokenization_common.py (#4390) · 07dd7c2f
Sam Shleifer authored May 19, 2020

07dd7c2f

Longformer (#4352) · 8f1d0471

Iz Beltagy authored May 19, 2020

* first commit

* bug fixes

* better examples

* undo padding

* remove wrong VOCAB_FILES_NAMES

* License

* make style

* make isort happy

* unit tests

* integration test

* make `black` happy by undoing `isort` changes!!

* lint

* no need for the padding value

* batch_size not bsz

* remove unused type casting

* seqlen not seq_len

* staticmethod

* `bert` selfattention instead of `n2`

* uint8 instead of bool + lints

* pad inputs_embeds using embeddings not a constant

* black

* unit test with padding

* fix unit tests

* remove redundant unit test

* upload model weights

* resolve todo

* simpler _mask_invalid_locations without lru_cache + backward compatible masked_fill_

* increase unittest coverage

8f1d0471

Distributed eval: SequentialDistributedSampler + gather all results (#4243) · 5e7fe8b5

Julien Chaumond authored May 18, 2020

* Distributed eval: SequentialDistributedSampler + gather all results

* For consistency only write to disk from world_master

Close https://github.com/huggingface/transformers/issues/4272

* Working distributed eval

* Hook into scripts

* Fix #3721 again

* TPU.mesh_reduce: stay in tensor space

Thanks @jysohn23

* Just a small comment

* whitespace

* torch.hub: pip install packaging

* Add test scenarii

5e7fe8b5

Fix nn.DataParallel compatibility in PyTorch 1.5 (#4300) · 4c068936

Julien Chaumond authored May 18, 2020

* Test case for #3936

* multigpu tests pass on pytorch 1.4.0

* Fixup

* multigpu tests pass on pytorch 1.5.0

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py

* rename multigpu to require_multigpu

* mode doc

4c068936

18 May, 2020 3 commits
- [test_pipelines] Mark tests > 10s @slow, small speedups (#4421) · a699525d
  Sam Shleifer authored May 18, 2020
  
  a699525d
- [T5 fp16] Fix fp16 in T5 (#4436) · 026a5d08
  Patrick von Platen authored May 18, 2020
```
* fix fp16 in t5

* make style

* refactor invert_attention_mask fn

* fix typo
```
  026a5d08
- Tag onnx export tests as slow (#4432) · 31c799a0
  Funtowicz Morgan authored May 18, 2020
  
  31c799a0
17 May, 2020 1 commit

Allow the creation of "entity groups" for NerPipeline #3548 (#3957) · 18d233d5

Lorenzo Ampil authored May 17, 2020

* Add index to be returned by NerPipeline to allow for the creation of

* Add entity groups

* Convert entity list to dict

* Add entity to entity_group_disagg atfter updating entity gorups

* Change 'group' parameter to 'grouped_entities'

* Add unit tests for grouped NER pipeline case

* Correct variable name typo for NER_FINETUNED_MODELS

* Sync grouped tests to recent test updates

18d233d5

14 May, 2020 3 commits

Conversion script to export transformers models to ONNX IR. (#4253) · db0076a9

Funtowicz Morgan authored May 14, 2020

* Added generic ONNX conversion script for PyTorch model.

* WIP initial TF support.

* TensorFlow/Keras ONNX export working.

* Print framework version info

* Add possibility to check the model is correctly loading on ONNX runtime.

* Remove quantization option.

* Specify ONNX opset version when exporting.

* Formatting.

* Remove unused imports.

* Make functions more generally reusable from other part of the code.

* isort happy.

* flake happy

* Export only feature-extraction for now

* Correctly check inputs order / filter before export.

* Removed task variable

* Fix invalid args call in load_graph_from_args.

* Fix invalid args call in convert.

* Fix invalid args call in infer_shapes.

* Raise exception and catch in caller function instead of exit.

* Add 04-onnx-export.ipynb notebook

* More WIP on the notebook

* Remove unused imports

* Simplify & remove unused constants.

* Export with constant_folding in PyTorch

* Let's try to put function args in the right order this time ...

* Disable external_data_format temporary

* ONNX notebook draft ready.

* Updated notebooks charts + wording

* Correct error while exporting last chart in notebook.

* Adressing @LysandreJik comment.

* Set ONNX opset to 11 as default value.

* Set opset param mandatory

* Added ONNX export unittests

* Quality.

* flake8 happy

* Add keras2onnx dependency on extras["tf"]

* Pin keras2onnx on github master to v1.6.5

* Second attempt.

* Third attempt.

* Use the right repo URL this time ...

* Do the same for onnxconverter-common

* Added keras2onnx and onnxconveter-common to 1.7.0 to supports TF2.2

* Correct commit hash.

* Addressing PR review: Optimization are enabled by default.

* Addressing PR review: small changes in the notebook

* setup.py comment about keras2onnx versioning.

db0076a9

[tests] make pipelines tests faster with smaller models (#4238) · 7822cd38
Sam Shleifer authored May 14, 2020
```
covers torch and tf. Also fixes a failing @slow test
```
7822cd38
Fix: unpin flake8 and fix cs errors (#4367) · 448c4672
Julien Chaumond authored May 14, 2020
```
* Fix: unpin flake8 and fix cs errors

* Ok we still need to quote those
```
448c4672

13 May, 2020 1 commit
- [Marian Fixes] prevent predicting pad_token_id before softmax, support... · 9a687ebb
  Sam Shleifer authored May 13, 2020
```
[Marian Fixes] prevent predicting pad_token_id before softmax, support language codes, name multilingual models (#4290)
```
  9a687ebb