Commits · 0513f8d275022d4055b710a33cd520b2000982bf · chenpangpang / transformers

"vscode:/vscode.git/clone" did not exist on "eb8b1bda79ffef2ff219bb85f7445f5ad95e2c74"

03 Aug, 2020 10 commits

correct label extraction + add note on discrepancies on trained MNLI model and HANS (#6221) · 0513f8d2
Victor SANH authored Aug 03, 2020

0513f8d2

Remove outdated BERT tips (#6217) · 3c289fb3

Kevin Canwen Xu authored Aug 04, 2020

* Remove out-dated BERT tips

* Update modeling_outputs.py

* Update bert.rst

* Update bert.rst

3c289fb3

Doc pipelines (#6175) · e4920c92

Sylvain Gugger authored Aug 03, 2020



* Init work on pipelines doc

* Work in progress

* Work in progress

* Doc pipelines

* Rm unwanted default

* Apply suggestions from code review

Lysandre comments
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

e4920c92

s2s: fix LR logging, remove some dead code. (#6205) · b6b2f227
Sam Shleifer authored Aug 03, 2020

b6b2f227
Fix _shift_right function in TFT5PreTrainedModel (#6214) · 06f1692b
Maurice Gonzenbach authored Aug 03, 2020

06f1692b
fix labels (#6213) · 0b418673
Suraj Patil authored Aug 03, 2020

0b418673

Adds train_batch_size, eval_batch_size, and n_gpu to to_sanitized_dict output for logging. (#5331) · cedc547e

Jay Mody authored Aug 03, 2020

* Adds train_batch_size, eval_batch_size, and n_gpu to to_sanitized_dict() output

* Update wandb config logging to use to_sanitized_dict

* removed n_gpu from sanitized dict

* fix quality check errors

cedc547e

Fix saved model creation (#5468) · 9996f697

Julien Plu authored Aug 03, 2020

* Fix TF Serving when output_hidden_states and output_attentions are True

* Add tests for saved model creation + bug fix for multiple choices models

* remove unused import

* Fix the input for several layers

* Fix test

* Fix conflict printing

* Apply style

* Fix XLM and Flaubert for TensorFlow

* Apply style

* Fix TF check version

* Apply style

* Trigger CI

9996f697

Empty assert hunt (#6056) · 5a0dac53

Teven authored Aug 03, 2020



* Fixed empty asserts

* black-reformatted stragglers in templates

* More code quality checks

* Update src/transformers/convert_marian_to_pytorch.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/convert_marian_to_pytorch.py
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

* removed unused line as per @sshleifer
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>

5a0dac53

Add script to convert tf2.x checkpoint to PyTorch (#5791) · 16c22401

Martin Müller authored Aug 03, 2020

* Add script to convert tf2.x checkpoint to pytorch

The script converts the newer TF2.x checkpoints (as published on their official GitHub: https://github.com/tensorflow/models/tree/master/official/nlp/bert) to Pytorch.

* rename file in order to stay consistent with naming convention

16c22401

02 Aug, 2020 1 commit

Fix docstring for BertTokenizerFast (#6185) · 82a0e2b6

Philip May authored Aug 02, 2020

- remove duplicate doc-entry for tokenize_chinese_chars
- add doc for strip_accents and wordpieces_prefix

82a0e2b6

01 Aug, 2020 2 commits
- [s2s] clean up + doc (#6184) · d8dbf3b7
  Stas Bekman authored Aug 01, 2020
```
Co-authored-by: Sam Shleifer <sshleifer@gmail.com>
```
  d8dbf3b7
- Fixed typo in Longformer (#6180) · a39dfe4f
  Faiaz Rahman authored Aug 01, 2020
  
  a39dfe4f
31 Jul, 2020 10 commits

bart-large-mnli-yahoo-answers model card (#6133) · 8edfaaa8

Joe Davison authored Jul 31, 2020



* Add bart-large-mnli-yahoo-answers model card

* Add examples

* Add widget example

* Rm bart tag
Co-authored-by: Julien Chaumond <chaumond@gmail.com>
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

8edfaaa8

Model output test (#6155) · d951c14a
Sylvain Gugger authored Jul 31, 2020
```
* Use return_dict=True in all tests

* Formatting
```
d951c14a
Harmonize both Trainers API (#6157) · 86caab1e
Sylvain Gugger authored Jul 31, 2020
```
* Harmonize both Trainers API

* Fix test

* main_prcess -> process_zero
```
86caab1e

readme m3hrdadfi/albert-fa-base-v2 (#6153) · 603cd81a

Mehrdad Farahani authored Jul 31, 2020



* readme m3hrdadfi/albert-fa-base-v2

model_card readme for m3hrdadfi/albert-fa-base-v2

* Update model_cards/m3hrdadfi/albert-fa-base-v2/README.md
Co-authored-by: Julien Chaumond <chaumond@gmail.com>

603cd81a

parse arguments from dict (#4869) · 838dc06f
Suraj Patil authored Jul 31, 2020
```
* add parse_dict to parse arguments from dict

* add unit test for parse_dict
```
838dc06f

Replace mecab-python3 with fugashi for Japanese tokenization (#6086) · cf3cf304

Paul O'Leary McCann authored Jul 31, 2020



* Replace mecab-python3 with fugashi

This replaces mecab-python3 with fugashi for Japanese tokenization. I am
the maintainer of both projects.

Both projects are MeCab wrappers, so the underlying C++ code is the
same. fugashi is the newer wrapper and doesn't use SWIG, so for basic
use of the MeCab API it's easier to use.

This code insures the use of a version of ipadic installed via pip,
which should make versioning and tracking down issues easier.

fugashi has wheels for Windows, OSX, and Linux, which will help with
issues with installing old versions of mecab-python3 on Windows.
Compared to mecab-python3, because fugashi doesn't use SWIG, it doesn't
require a C++ runtime to be installed on Windows.

In adding this change I removed some code dealing with `cursor`,
`token_start`, and `token_end` variables. These variables didn't seem to
be used for anything, it is unclear to me why they were there.

I ran the tests and they passed, though I couldn't figure out how to run
the slow tests (`--runslow` gave an error) and didn't try testing with
Tensorflow.

* Style fix

* Remove unused variable

Forgot to delete this...

* Adapt doc with install instructions

* Fix typo
Co-authored-by: sgugger <sylvain.gugger@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

cf3cf304

enable easy checkout switch (#5645) · f250beb8

Stas Bekman authored Jul 31, 2020

* enable easy checkout switch

allow having multiple repository checkouts and not needing to remember to rerun 'pip install -e .[dev]' when switching between checkouts and running tests.

* make isort happy

* examples needs one too

f250beb8

Create README.md (#6169) · 7d50af4b
kolk authored Jul 31, 2020

7d50af4b

Add Pytorch Native AMP support in Trainer (#6151) · 0034a1d2

Prajjwal Bhargava authored Jul 31, 2020

* fixed type; add Pytorch Native CUDA AMP support

* reverted commit on modeling_utils

* confirming to HF black formatting rule

* changed bool value of _use_apex

* scaler support for gradient clipping

* fix inplace operation of clip_grad_norm

* removed not while version comparison

0034a1d2

Enable ONNX/ONNXRuntime optimizations through converter script (#6131) · 7231f7b5

Funtowicz Morgan authored Jul 31, 2020



* Add onnxruntime transformers optimization support
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Added Optimization section in ONNX/ONNXRuntime documentation.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Improve note reference
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Fixing imports order.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Add warning about different level of optimization between torch and tf export.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Address @LysandreJik wording suggestion
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address @LysandreJik wording suggestion
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Always optimize model before quantization for maximum performances.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Address comments on the documentation.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Improve TensorFlow optimization message as suggested by @yufenglee
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Removed --optimize parameter
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Warn the user about current quantization limitation when model is larger than 2GB.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Trigger CI for last check

* Small change in print for the optimization section.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

7231f7b5

30 Jul, 2020 10 commits

correct the correction (#6163) · c0b93a1c
Stas Bekman authored Jul 30, 2020

c0b93a1c
typos (#6162) · a2f6d521
Stas Bekman authored Jul 30, 2020
```
* 2 small typos

* more typos

* correct path
```
a2f6d521

Doc tokenizer (#6110) · f3065abd

Sylvain Gugger authored Jul 30, 2020



* Start doc tokenizers

* Tokenizer documentation

* Start doc tokenizers

* Tokenizer documentation

* Formatting after rebase

* Formatting after merge

* Update docs/source/main_classes/tokenizer.rst
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Address comment

* Update src/transformers/tokenization_utils_base.py
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

* Address Thom's comments
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

f3065abd

Addition of a DialoguePipeline (#5516) · e642c789

guillaume-be authored Jul 30, 2020



* initial commit for pipeline implementation

Addition of input processing and history concatenation

* Conversation pipeline tested and working for single & multiple conversation inputs

* Added docstrings for dialogue pipeline

* Addition of dialogue pipeline integration tests

* Delete test_t5.py

* Fixed max code length

* Updated styling

* Fixed test broken by formatting tools

* Removed unused import

* Added unit test for DialoguePipeline

* Fixed Tensorflow compatibility

* Fixed multi-framework support using framework flag

* - Fixed docstring
- Added `min_length_for_response` as an initialization parameter
- Renamed `*args` to `conversations`, `conversations` being a `Conversation` or a `List[Conversation]`
- Updated truncation to truncate entire segments of conversations, instead of cutting in the middle of a user/bot input

* - renamed pipeline name from dialogue to conversational
- removed hardcoded default value of 1000 and use config.max_length instead
- added `append_response` and `set_history` method to the Conversation class to avoid direct fields mutation
- fixed bug in history truncation method

* - Updated ConversationalPipeline to accept only active conversations (otherwise a ValueError is raised)

* - Simplified input tensor conversion

* - Updated attention_mask value for Tensorflow compatibility

* - Updated last dialogue reference to conversational & fixed integration tests

* Fixed conflict with master

* Updates following review comments

* Updated formatting

* Added Conversation and ConversationalPipeline to the library __init__, addition of docstrings for Conversation, added both to the docs

* Update src/transformers/pipelines.py

Updated docsting following review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

e642c789

Fix FlauBERT GPU test (#6142) · ec026747
Lysandre Debut authored Jul 30, 2020
```
* Fix GPU test

* Remove legacy constructor
```
ec026747

Switch from return_tuple to return_dict (#6138) · 91cb9546

Sylvain Gugger authored Jul 30, 2020



* Switch from return_tuple to return_dict

* Fix test

* [WIP] Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleC… (#5614)

* Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleChoice} models and tests

* AutoModels


Tiny tweaks

* Style

* Final changes before merge

* Re-order for simpler review

* Final fixes

* Addressing @sgugger's comments

* Test MultipleChoice

* Rework TF trainer (#6038)

* Fully rework training/prediction loops

* fix method name

* Fix variable name

* Fix property name

* Fix scope

* Fix method name

* Fix tuple index

* Fix tuple index

* Fix indentation

* Fix variable name

* fix eval before log

* Add drop remainder for test dataset

* Fix step number + fix logging datetime

* fix eval loss value

* use global step instead of step + fix logging at step 0

* Fix logging datetime

* Fix global_step usage

* Fix breaking loop + logging datetime

* Fix step in prediction loop

* Fix step breaking

* Fix train/test loops

* Force TF at least 2.2 for the trainer

* Use assert_cardinality to facilitate the dataset size computation

* Log steps per epoch

* Make tfds compliant with TPU

* Make tfds compliant with TPU

* Use TF dataset enumerate instead of the Python one

* revert previous commit

* Fix data_dir

* Apply style

* rebase on master

* Address Sylvain's comments

* Address Sylvain's and Lysandre comments

* Trigger CI

* Remove unused import

* Switch from return_tuple to return_dict

* Fix test

* Add recent model
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Julien Plu <plu.julien@gmail.com>

91cb9546

Tf trainer cleanup (#6143) · 562b6369
Sylvain Gugger authored Jul 30, 2020
```
* Clean up TFTrainer

* Add import

* Fix conflicts
```
562b6369
add another e.g. to avoid confusion (#6055) · c127d055
Oren Amsalem authored Jul 30, 2020

c127d055

Actually the extra_id are from 0-99 and not from 1-100 (#5967) · d24ea708

Oren Amsalem authored Jul 30, 2020

a = tokenizer.encode("we got a <extra_id_99>", return_tensors='pt',add_special_tokens=True)
print(a)
>tensor([[   62,   530,     3,     9, 32000]])
a = tokenizer.encode("we got a <extra_id_100>", return_tensors='pt',add_special_tokens=True)
print(a)
>tensor([[   62,   530,     3,     9,     3,     2, 25666,   834,    23,    26,
           834,  2915,  3155]])

d24ea708

[s2s] add support for overriding config params (#6149) · 3212b885
Stas Bekman authored Jul 29, 2020

3212b885

29 Jul, 2020 7 commits

Rework TF trainer (#6038) · 54f9fbef

Julien Plu authored Jul 29, 2020

* Fully rework training/prediction loops

* fix method name

* Fix variable name

* Fix property name

* Fix scope

* Fix method name

* Fix tuple index

* Fix tuple index

* Fix indentation

* Fix variable name

* fix eval before log

* Add drop remainder for test dataset

* Fix step number + fix logging datetime

* fix eval loss value

* use global step instead of step + fix logging at step 0

* Fix logging datetime

* Fix global_step usage

* Fix breaking loop + logging datetime

* Fix step in prediction loop

* Fix step breaking

* Fix train/test loops

* Force TF at least 2.2 for the trainer

* Use assert_cardinality to facilitate the dataset size computation

* Log steps per epoch

* Make tfds compliant with TPU

* Make tfds compliant with TPU

* Use TF dataset enumerate instead of the Python one

* revert previous commit

* Fix data_dir

* Apply style

* rebase on master

* Address Sylvain's comments

* Address Sylvain's and Lysandre comments

* Trigger CI

* Remove unused import

54f9fbef

[WIP] Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleC… (#5614) · 3f94170a

Lysandre Debut authored Jul 29, 2020

* Test TF Flaubert + Add {XLM, Flaubert}{TokenClassification, MultipleChoice} models and tests

* AutoModels


Tiny tweaks

* Style

* Final changes before merge

* Re-order for simpler review

* Final fixes

* Addressing @sgugger's comments

* Test MultipleChoice

3f94170a

Use google style to document properties (#6130) · 8a8ae276

Sylvain Gugger authored Jul 29, 2020



* Use google style to document properties

* Update src/transformers/configuration_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

8a8ae276

Fix TF CTRL model naming (#6134) · fc64559c
Julien Plu authored Jul 29, 2020

fc64559c
XLNet PLM Readme (#6121) · 641b873c
Lysandre Debut authored Jul 29, 2020

641b873c
add deepset/xlm-roberta-large-squad2 model card (#6128) · 8d157c93
Timo Moeller authored Jul 29, 2020
```
* Add xlm-r QA model card

* Add tags
```
8d157c93

Added capability to quantize a model while exporting through ONNX. (#6089) · 6c002853

Funtowicz Morgan authored Jul 29, 2020



* Added capability to quantize a model while exporting through ONNX.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

We do not support multiple extensions
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Reformat files
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* More quality
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Ensure test_generate_identified_name compares the same object types
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Added documentation everywhere on ONNX exporter
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Use pathlib.Path instead of plain-old string
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Use f-string everywhere
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Use the correct parameters for black formatting
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Use Python 3 super() style.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Use packaging.version to ensure installed onnxruntime version match requirements
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Fixing imports sorting order.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Missing raise(s)
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Added quantization documentation
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Fix some spelling.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Fix bad list header format
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

6c002853