Commits · 8f1d0471489352ec01556ae61f8e8246002bbc58 · chenpangpang / transformers

19 May, 2020 6 commits

Iz Beltagy authored May 19, 2020

* first commit

* bug fixes

* better examples

* undo padding

* remove wrong VOCAB_FILES_NAMES

* License

* make style

* make isort happy

* unit tests

* integration test

* make `black` happy by undoing `isort` changes!!

* lint

* no need for the padding value

* batch_size not bsz

* remove unused type casting

* seqlen not seq_len

* staticmethod

* `bert` selfattention instead of `n2`

* uint8 instead of bool + lints

* pad inputs_embeds using embeddings not a constant

* black

* unit test with padding

* fix unit tests

* remove redundant unit test

* upload model weights

* resolve todo

* simpler _mask_invalid_locations without lru_cache + backward compatible masked_fill_

* increase unittest coverage

8f1d0471

Map optimizer to correct device after loading from checkpoint. (#4403) · 384f0eb2

Shaoyen authored May 18, 2020



* Map optimizer to correct device after loading from checkpoint.

* Make style test pass
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

384f0eb2

[Trainer] move model to device before setting optimizer (#4450) · bf14ef75
Julien Chaumond authored May 18, 2020

bf14ef75

Distributed eval: SequentialDistributedSampler + gather all results (#4243) · 5e7fe8b5

Julien Chaumond authored May 18, 2020

* Distributed eval: SequentialDistributedSampler + gather all results

* For consistency only write to disk from world_master

Close https://github.com/huggingface/transformers/issues/4272

* Working distributed eval

* Hook into scripts

* Fix #3721 again

* TPU.mesh_reduce: stay in tensor space

Thanks @jysohn23

* Just a small comment

* whitespace

* torch.hub: pip install packaging

* Add test scenarii

5e7fe8b5

Fix nn.DataParallel compatibility in PyTorch 1.5 (#4300) · 4c068936

Julien Chaumond authored May 18, 2020

* Test case for #3936

* multigpu tests pass on pytorch 1.4.0

* Fixup

* multigpu tests pass on pytorch 1.5.0

* Update src/transformers/modeling_utils.py

* Update src/transformers/modeling_utils.py

* rename multigpu to require_multigpu

* mode doc

4c068936

Make get_last_lr in trainer backward compatible (#4446) · 9de4afa8

Rakesh Chada authored May 18, 2020

* makes fetching last learning late in trainer backward compatible

* split comment to multiple lines

* fixes black styling issue

* uses version to create a more explicit logic

9de4afa8

18 May, 2020 6 commits
- Adding optimizations block from ONNXRuntime. (#4431) · ca4a3f4d
  Funtowicz Morgan authored May 18, 2020
```
* Adding optimizations block from ONNXRuntime.

* Turn off external data format by default for PyTorch export.

* Correct the way use_external_format is passed through the cmdline args.
```
  ca4a3f4d
- better naming in tf t5 (#4401) · d39bf0ac
  Patrick von Platen authored May 18, 2020
  
  d39bf0ac
- improve docstring (#4422) · 590adb13
  Patrick von Platen authored May 18, 2020
  
  590adb13
- [T5 fp16] Fix fp16 in T5 (#4436) · 026a5d08
  Patrick von Platen authored May 18, 2020
```
* fix fp16 in t5

* make style

* refactor invert_attention_mask fn

* fix typo
```
  026a5d08
- fix (#4419) · a27c7959
  Patrick von Platen authored May 18, 2020
  
  a27c7959
- [MbartTokenizer] save to sentencepiece.bpe.model (#4335) · 8581a670
  Mehrad Moradshahi authored May 18, 2020
  
  8581a670
17 May, 2020 1 commit

Allow the creation of "entity groups" for NerPipeline #3548 (#3957) · 18d233d5

Lorenzo Ampil authored May 17, 2020

* Add index to be returned by NerPipeline to allow for the creation of

* Add entity groups

* Convert entity list to dict

* Add entity to entity_group_disagg atfter updating entity gorups

* Change 'group' parameter to 'grouped_entities'

* Add unit tests for grouped NER pipeline case

* Correct variable name typo for NER_FINETUNED_MODELS

* Sync grouped tests to recent test updates

18d233d5

15 May, 2020 6 commits
- Fix addcmul_ · 3e0f0621
  Julien Chaumond authored May 15, 2020
  
  3e0f0621
- Fix: one more try · fc2a4c88
  Julien Chaumond authored May 15, 2020
  
  fc2a4c88
- Same fix for `addcmul_` · 55bda525
  Julien Chaumond authored May 15, 2020
  
  55bda525
- Fix UserWarning: This overload of add_ is deprecated in pytorch==1.5.0 · ad02c961
  Julien Chaumond authored May 15, 2020
  
  ad02c961
- [skip ci] remove local rank · 15550ce0
  Julien Chaumond authored May 15, 2020
  
  15550ce0
- Allow for None gradients in GradientAccumulator. (#4372) · 34706ba0
  Jared T Nielsen authored May 15, 2020
  
  34706ba0
14 May, 2020 8 commits

p_mask in SQuAD pre-processing (#4049) · 7defc667
Lysandre Debut authored May 14, 2020
```
* Better p_mask building

* Adressing @mfuntowicz comments
```
7defc667

Conversion script to export transformers models to ONNX IR. (#4253) · db0076a9

Funtowicz Morgan authored May 14, 2020

* Added generic ONNX conversion script for PyTorch model.

* WIP initial TF support.

* TensorFlow/Keras ONNX export working.

* Print framework version info

* Add possibility to check the model is correctly loading on ONNX runtime.

* Remove quantization option.

* Specify ONNX opset version when exporting.

* Formatting.

* Remove unused imports.

* Make functions more generally reusable from other part of the code.

* isort happy.

* flake happy

* Export only feature-extraction for now

* Correctly check inputs order / filter before export.

* Removed task variable

* Fix invalid args call in load_graph_from_args.

* Fix invalid args call in convert.

* Fix invalid args call in infer_shapes.

* Raise exception and catch in caller function instead of exit.

* Add 04-onnx-export.ipynb notebook

* More WIP on the notebook

* Remove unused imports

* Simplify & remove unused constants.

* Export with constant_folding in PyTorch

* Let's try to put function args in the right order this time ...

* Disable external_data_format temporary

* ONNX notebook draft ready.

* Updated notebooks charts + wording

* Correct error while exporting last chart in notebook.

* Adressing @LysandreJik comment.

* Set ONNX opset to 11 as default value.

* Set opset param mandatory

* Added ONNX export unittests

* Quality.

* flake8 happy

* Add keras2onnx dependency on extras["tf"]

* Pin keras2onnx on github master to v1.6.5

* Second attempt.

* Third attempt.

* Use the right repo URL this time ...

* Do the same for onnxconverter-common

* Added keras2onnx and onnxconveter-common to 1.7.0 to supports TF2.2

* Correct commit hash.

* Addressing PR review: Optimization are enabled by default.

* Addressing PR review: small changes in the notebook

* setup.py comment about keras2onnx versioning.

db0076a9

Fix trainer evaluation (#4363) · 2d054801

Suraj Patil authored May 15, 2020

* fix loss calculation in evaluation

* fix evaluation on TPU when prediction_loss_only is True

2d054801

Tokenizer.batch_decode convenience method (#4159) · 9535bf19
Sam Shleifer authored May 14, 2020

9535bf19
[tests] make pipelines tests faster with smaller models (#4238) · 7822cd38
Sam Shleifer authored May 14, 2020
```
covers torch and tf. Also fixes a failing @slow test
```
7822cd38
Fix: unpin flake8 and fix cs errors (#4367) · 448c4672
Julien Chaumond authored May 14, 2020
```
* Fix: unpin flake8 and fix cs errors

* Ok we still need to quote those
```
448c4672
Use Filelock to ensure distributed barriers · c547f15a
Julien Chaumond authored May 14, 2020
```
see context in https://github.com/huggingface/transformers/pull/4223
```
c547f15a
TPU needs a rendezvous (#4339) · ef46ccb0
Lysandre Debut authored May 14, 2020

ef46ccb0

13 May, 2020 5 commits

Release: v2.9.1 · 7cb203fa
Lysandre authored May 13, 2020

7cb203fa
[Marian Fixes] prevent predicting pad_token_id before softmax, support... · 9a687ebb
Sam Shleifer authored May 13, 2020
```
[Marian Fixes] prevent predicting pad_token_id before softmax, support language codes, name multilingual models (#4290)
```
9a687ebb

Question Answering for TF trainer (#4320) · ca136186

Julien Plu authored May 13, 2020

* Add QA trainer example for TF

* Make data_dir optional

* Fix parameter logic

* Fix feature convert

* Update the READMEs to add the question-answering task

* Apply style

* Change 'sequence-classification' to 'text-classification' and prefix with 'eval' all the metric names

* Apply style

* Apply style

ca136186

Fix for #3865. PretrainedTokenizer mapped " do not" into " don't" when... · 1e51bb71

Denis authored May 13, 2020

Fix for #3865. PretrainedTokenizer mapped " do not" into " don't" when .decode(...) is called. Removed the " do not" --> " don't" mapping from clean_up_tokenization(...). (#4024)

1e51bb71

(v2) Improvements to the wandb integration (#4324) · 24175910

Julien Chaumond authored May 12, 2020



* Improvements to the wandb integration

* small reorg + no global necessary

* feat(trainer): log epoch and final metrics

* Simplify logging a bit

* Fixup

* Fix crash when just running eval
Co-authored-by: Chris Van Pelt <vanpelt@gmail.com>
Co-authored-by: Boris Dayma <boris.dayma@gmail.com>

24175910

12 May, 2020 4 commits

Allow BatchEncoding to be initialized empty. (#4316) · 7d7fe499

Funtowicz Morgan authored May 12, 2020

* Allow BatchEncoding to be initialized empty.

This is required by recent changes introduced in TF 2.2.

* Attempt to unpin Tensorflow to 2.2 with the previous commit.

7d7fe499

Fix BART tests on GPU (#4298) · 4bf50422
Julien Chaumond authored May 12, 2020

4bf50422

Add MultipleChoice to TFTrainer [WIP] (#4270) · e4512aab

Viktor Alm authored May 12, 2020



* catch gpu len 1 set to gpu0

* Add mpc to trainer

* Add MPC for TF

* fix TF automodel for MPC and add Albert

* Apply style

* Fix import

* Note to self: double check

* Make shape None, None for datasetgenerator output shapes

* Add from_pt bool which doesnt seem to work

* Original checkpoint dir

* Fix docstrings for automodel

* Update readme and apply style

* Colab should probably not be from users

* Colabs should probably not be from users

* Add colab

* Update README.md

* Update README.md

* Cleanup __intit__

* Cleanup flake8 trailing comma

* Update src/transformers/training_args_tf.py

* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Viktor Alm <viktoralm@pop-os.localdomain>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

e4512aab

Remove hard-coded pad token id in distilbert and albert (#3965) · 31e67dd1
Jangwon Park authored May 12, 2020

31e67dd1

11 May, 2020 4 commits

Simplify cache vars and allow for TRANSFORMERS_CACHE env (#4226) · 61d22f9c

Bram Vanroy authored May 11, 2020

* simplify cache vars and allow for TRANSFORMERS_CACHE env

As it currently stands, "TRANSFORMERS_CACHE" is not an accepted variable. It seems that the these variables were not updated when moving from version pytorch_transformers to transformers. In addition, the fallback procedure could be improved. and simplified. Pathlib seems redundant here.

* Update file_utils.py

61d22f9c

Fix special token doc (#4292) · cd40cb88
Lysandre Debut authored May 11, 2020

cd40cb88
Allow gpt2 to be exported to valid ONNX (#4244) · 82601f4c
Tianlei Wu authored May 11, 2020
```
* allow gpt2 to be exported to valid ONNX model

* cast size from int to float explictly
```
82601f4c
CamemBERT does not make use of Token Type IDs (#4289) · 051dcb2a
Lysandre Debut authored May 11, 2020

051dcb2a