Commits · 6c002853a68906a5b1c2dd2ebb416770f1fc322b · chenpangpang / transformers

29 Jul, 2020 1 commit

Added capability to quantize a model while exporting through ONNX. (#6089) · 6c002853

Funtowicz Morgan authored Jul 29, 2020



* Added capability to quantize a model while exporting through ONNX.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

We do not support multiple extensions
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Reformat files
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* More quality
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Ensure test_generate_identified_name compares the same object types
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Added documentation everywhere on ONNX exporter
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Use pathlib.Path instead of plain-old string
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Use f-string everywhere
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Use the correct parameters for black formatting
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Use Python 3 super() style.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Use packaging.version to ensure installed onnxruntime version match requirements
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Fixing imports sorting order.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Missing raise(s)
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Added quantization documentation
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Fix some spelling.
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

* Fix bad list header format
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>

6c002853

28 Jul, 2020 3 commits
- [Fix] position_ids tests again (#6100) · c49cd927
  Sam Shleifer authored Jul 28, 2020
  
  c49cd927
- Fix #6096: MBartTokenizer's mask token (#6098) · 5abe5038
  Sam Shleifer authored Jul 28, 2020
  
  5abe5038
- MBART: support summarization tasks where max_src_len > max_tgt_len (#6003) · 3c7fbf35
  Sam Shleifer authored Jul 28, 2020
```
* MBART: support summarization tasks

* fix test

* Style

* add tokenizer test
```
  3c7fbf35
27 Jul, 2020 1 commit

Zero shot classification pipeline (#5760) · 3deffc1d

Joe Davison authored Jul 27, 2020

* add initial zero-shot pipeline

* change default args

* update default template

* add label string splitting

* add str labels support, remove nli from name

* style

* add input validation and working tf defaults

* tests

* quality check

* add docstring to __call__

* add slow tests

* Change truncation to only_first

also lower precision on tests for readibility

* style

3deffc1d

23 Jul, 2020 2 commits
- Avoid unnecessary warnings when loading pretrained model (#5922) · f5b5c5bd
  Sylvain Gugger authored Jul 23, 2020
```
* Avoid unnecessary warnings when loading pretrained model

* Fix test

* Add other keys to ignore

* keys_to_ignore_at_load -> authorized_missing_keys
```
  f5b5c5bd
- MbartTokenizer: do not hardcode vocab size (#5998) · 9827d666
  Sam Shleifer authored Jul 23, 2020
  
  9827d666
20 Jul, 2020 2 commits

DataParallel fixes (#5733) · 35cb101e

Stas Bekman authored Jul 20, 2020

* DataParallel fixes:

1. switched to a more precise check
-        if self.args.n_gpu > 1:
+        if isinstance(model, nn.DataParallel):

2. fix tests - require the same fixup under DataParallel as the training module

* another fix

35cb101e

Trainer support for iterabledataset (#5834) · 290b6e18

Pradhy729 authored Jul 20, 2020

* Don't pass sampler for iterable dataset

* Added check for test and eval dataloaders.

* Formatting

* Don't pass sampler for iterable dataset

* Added check for test and eval dataloaders.

* Formatting

* Cleaner if nesting.

* Added test for trainer and iterable dataset

* Formatting for test

* Fixed import when torch is available only.

* Added require torch decorator to helper class

* Moved dataset class inside unittest

* Removed nested if and changed model in test

* Checking torch availability for IterableDataset

290b6e18

18 Jul, 2020 3 commits

Xlnet outputs (#5883) · 4b506a37

Teven authored Jul 18, 2020

Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.

4b506a37

Revert "Xlnet outputs (#5881)" (#5882) · a5580924
Teven authored Jul 18, 2020
```
This reverts commit 13be4872.
```
a5580924

Xlnet outputs (#5881) · 13be4872

Teven authored Jul 18, 2020

Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.

13be4872

17 Jul, 2020 3 commits

Revert "XLNet `use_cache` refactor (#5770)" (#5854) · 615be03f
Teven authored Jul 17, 2020
```
This reverts commit 0b2da0e5.
```
615be03f

XLNet `use_cache` refactor (#5770) · 0b2da0e5

Teven authored Jul 17, 2020

Slightly breaking change, changes functionality for `use_cache` in XLNet: if use_cache is True and mem_len is 0 or None (which is the case in the base model config), the model behaves like GPT-2 and returns mems to be used as past in generation. At training time `use_cache` is overriden and always True.

0b2da0e5

[Reformer] - Cache hidden states and buckets to speed up inference (#5578) · 9d37c56b

Patrick von Platen authored Jul 17, 2020

* fix merge rebase

* add intermediate reformer code

* save intermediate caching results

* save intermediate

* save intermediate results

* save intermediate

* upload next step

* fix generate tests

* make tests work

* add named tuple output

* Apply suggestions from code review

* fix use_cache for False case

* fix tensor to gpu

* fix tensor to gpu

* refactor

* refactor and make style

9d37c56b

16 Jul, 2020 1 commit
- fix benchmark for longformer (#5808) · 89a78be5
  Patrick von Platen authored Jul 16, 2020
  
  89a78be5
15 Jul, 2020 3 commits
- [fix] check code quality (#5772) · 1a647abf
  Sam Shleifer authored Jul 15, 2020
  
  1a647abf
- [fix] T5 ONNX test: model.to(torch_device) (#5769) · d533c7e9
  Funtowicz Morgan authored Jul 15, 2020
```
Signed-off-by: Morgan Funtowicz <morgan@huggingface.co>
```
  d533c7e9
- [cleanup] T5 test, warnings (#5761) · d0486c8b
  Sam Shleifer authored Jul 15, 2020
  
  d0486c8b
14 Jul, 2020 2 commits

[fix] mbart_en_ro_generate test now identical to fairseq (#5731) · 838950ee
Sam Shleifer authored Jul 14, 2020

838950ee

[Reformer classification head] Implement the reformer model classification... · f867000f

as-stevens authored Jul 14, 2020


[Reformer classification head] Implement the reformer model classification head for text classification (#5198)

* Reformer model head classification implementation for text classification

* Reformat the reformer model classification code

* PR review comments, and test case implementation for reformer for classification head changes

* CI/CD reformer for classification head test import error fix

* CI/CD test case implementation  added ReformerForSequenceClassification to all_model_classes

* Code formatting- fixed

* Normal test cases added for reformer classification head

* Fix test cases implementation for the reformer classification head

* removed token_type_id parameter from the reformer classification head

* fixed the test case for reformer classification head

* merge conflict with master fixed

* merge conflict, changed reformer classification to accept the choice_label parameter added in latest code

* refactored the the reformer classification head test code

* reformer classification head, common transform test cases fixed

* final set of the review comment, rearranging the reformer classes and docstring add to classification forward method

* fixed the compilation error and text case fix for reformer classification head

* Apply suggestions from code review

Remove unnecessary dup
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

f867000f

13 Jul, 2020 2 commits
- FlaubertForTokenClassification (#5644) · 45addfe9
  Stas Bekman authored Jul 13, 2020
```
* implement FlaubertForTokenClassification as a subclass of XLMForTokenClassification

* fix mapping order

* add the doc

* add common tests
```
  45addfe9
- rename the function to match the rest of the test convention (#5692) · 443b0cad
  Stas Bekman authored Jul 13, 2020
  
  443b0cad
10 Jul, 2020 1 commit

Change model outputs types to self-document outputs (#5438) · edfd82f5

Sylvain Gugger authored Jul 10, 2020

* [WIP] Proposal for model outputs

* All Bert models

* Make CI green maybe?

* Fix ONNX test

* Isolate ModelOutput from pt and tf

* Formatting

* Add Electra models

* Auto-generate docstrings from outputs

* Add TF outputs

* Add some BERT models

* Revert TF side

* Remove last traces of TF changes

* Fail with a clear error message

* Add Albert and work through Bart

* Add CTRL and DistilBert

* Formatting

* Progress on Bart

* Renames and finish Bart

* Formatting

* Fix last test

* Add DPR

* Finish Electra and add FlauBERT

* Add GPT2

* Add Longformer

* Add MMBT

* Add MobileBert

* Add GPT

* Formatting

* Add Reformer

* Add Roberta

* Add T5

* Add Transformer XL

* Fix test

* Add XLM + fix XLMForTokenClassification

* Style + XLMRoberta

* Add XLNet

* Formatting

* Add doc of return_tuple arg

edfd82f5

08 Jul, 2020 2 commits

Fix Inconsistent NER Grouping (Pipeline) (#4987) · 0cc4eae0

Lorenzo Ampil authored Jul 09, 2020



* Add B I handling to grouping

* Add fix to include separate entity as last token

* move last_idx definition outside loop

* Use first entity in entity group as reference for entity type

* Add test cases

* Take out extra class accidentally added

* Return tf ner grouped test to original

* Take out redundant last entity

* Get last_idx safely
Co-authored-by: ColleterVi <36503688+ColleterVi@users.noreply.github.com>

* Fix first entity comment

* Create separate functions for group_sub_entities and group_entities (splitting call method to testable functions)

* Take out unnecessary last_idx

* Remove additional forward pass test

* Move token classification basic tests to separate class

* Move token classification basic tests back to monocolumninputtestcase

* Move base ner tests to nerpipelinetests

* Take out unused kwargs

* Add back mandatory_keys argument

* Add unitary tests for group_entities in _test_ner_pipeline

* Fix last entity handling

* Fix grouping fucntion used

* Add typing to group_sub_entities and group_entities
Co-authored-by: ColleterVi <36503688+ColleterVi@users.noreply.github.com>

0cc4eae0

[Benchmark] Add benchmarks for TF Training (#5594) · f82a2a5e

Patrick von Platen authored Jul 08, 2020

* tf_train

* adapt timing for tpu

* fix timing

* fix timing

* fix timing

* fix timing

* update notebook

* add tests

f82a2a5e

07 Jul, 2020 8 commits

Add mbart-large-cc25, support translation finetuning (#5129) · 353b8f1e

Sam Shleifer authored Jul 07, 2020

improve unittests for finetuning, especially w.r.t testing frozen parameters
fix freeze_embeds for T5
add streamlit setup.cfg

353b8f1e

[Almost all TF models] TF clean up: add missing CLM / MLM loss; fix T5 naming... · 4dc65591

Patrick von Platen authored Jul 07, 2020


[Almost all TF models] TF clean up: add missing CLM / MLM loss; fix T5 naming and keras compile (#5395)

* add first version of clm tf

* make style

* add more tests for bert

* update tf clm loss

* fix tests

* correct tf ner script

* add mlm loss

* delete bogus file

* clean tf auto model + add tests

* finish adding clm loss everywhere

* fix training in distilbert

* fix flake8

* save intermediate

* fix tf t5 naming

* remove prints

* finish up

* up

* fix tf gpt2

* fix new test utils import

* fix flake8

* keep backward compatibility

* Update src/transformers/modeling_tf_albert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_electra.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_roberta.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_mobilebert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_auto.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_bert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_tf_distilbert.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply sylvains suggestions
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4dc65591

Fix tests imports dpr (#5576) · 4fedc125
Quentin Lhoest authored Jul 07, 2020
```
* fix test imports

* fix max_length

* style

* fix tests
```
4fedc125
[Bart] enable test_torchscript, update test_tie_weights (#5457) · d4886173
Sam Shleifer authored Jul 07, 2020
```
* Passing all but one torchscript test

* Style

* move comment

* remove unneeded assert
```
d4886173

Add DPR model (#5279) · fbd87921

Quentin Lhoest authored Jul 07, 2020



* beginning of dpr modeling

* wip

* implement forward

* remove biencoder + better init weights

* export dpr model to embed model for nlp lib

* add new api

* remove old code

* make style

* fix dumb typo

* don't load bert weights

* docs

* docs

* style

* move the `k` parameter

* fix init_weights

* add pretrained configs

* minor

* update config names

* style

* better config

* style

* clean code based on PR comments

* change Dpr to DPR

* fix config

* switch encoder config to a dict

* style

* inheritance -> composition

* add messages in assert startements

* add dpr reader tokenizer

* one tokenizer per model

* fix base_model_prefix

* fix imports

* typo

* add convert script

* docs

* change tokenizers conf names

* style

* change tokenizers conf names

* minor

* minor

* fix wrong names

* minor

* remove unused convert functions

* rename convert script

* use return_tensors in tokenizers

* remove n_questions dim

* move generate logic to tokenizer

* style

* add docs

* docs

* quality

* docs

* add tests

* style

* add tokenization tests

* DPR full tests

* Stay true to the attention mask building

* update docs

* missing param in bert input docs

* docs

* style
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>

fbd87921

Make T5 compatible with ONNX (#5518) · 69122657

Abel authored Jul 07, 2020



* Default decoder inputs to encoder ones for T5 if neither are specified.

* Fixing typo, now all tests are passing.

* Changing einsum to operations supported by onnx

* Adding a test to ensure T5 can be exported to onnx op>9

* Modified test for onnx export to make it faster

* Styling changes.

* Styling changes.

* Changing notation for matrix multiplication
Co-authored-by: Abel Riboulot <tkai@protomail.com>

69122657

[Reformer] Adapt Reformer MaskedLM Attn mask (#5560) · 989ae326
Patrick von Platen authored Jul 07, 2020
```
* fix attention mask

* fix slow test

* refactor attn masks

* fix fp16 generate test
```
989ae326

Added data collator for permutation (XLNet) language modeling and related calls (#5522) · 3dcb748e

Shashank Gupta authored Jul 07, 2020

* Added data collator for XLNet language modeling and related calls

Added DataCollatorForXLNetLanguageModeling in data/data_collator.py
to generate necessary inputs for language modeling training with
XLNetLMHeadModel. Also added related arguments, logic and calls in
examples/language-modeling/run_language_modeling.py.

Resolves: #4739, #2008 (partially)

* Changed name to `DataCollatorForPermutationLanguageModeling`

Changed the name of `DataCollatorForXLNetLanguageModeling` to the more general `DataCollatorForPermutationLanguageModelling`.
Removed the `--mlm` flag requirement for the new collator and defined a separate `--plm_probability` flag for its use.
CTRL uses a CLM loss just like GPT and GPT-2, so should work out of the box with this script (provided `past` is taken care of
similar to `mems` for XLNet).
Changed calls and imports appropriately.

* Added detailed comments, changed variable names

Added more detailed comments to `DataCollatorForPermutationLanguageModeling` in `data/data_collator.py` to explain working. Also cleaned up variable names and made them more informative.

* Added tests for new data collator

Added tests in `tests/test_trainer.py` for DataCollatorForPermutationLanguageModeling based on those in DataCollatorForLanguageModeling. A specific test has been added to check for odd-length sequences.

* Fixed styling issues

3dcb748e

06 Jul, 2020 1 commit

Various tokenizers fixes (#5558) · 5787e4c1

Anthony MOI authored Jul 06, 2020

* BertTokenizerFast - Do not specify strip_accents by default

* Bump tokenizers to new version

* Add test for AddedToken serialization

5787e4c1

03 Jul, 2020 2 commits

[cleanup] TF T5 tests only init t5-base once. (#5410) · 58cca47c
Sam Shleifer authored Jul 03, 2020

58cca47c

Exposing prepare_for_model for both slow & fast tokenizers (#5479) · 17ade127

Lysandre Debut authored Jul 03, 2020



* Exposing prepare_for_model for both slow & fast tokenizers

* Update method signature

* The traditional style commit

* Hide the warnings behind the verbose flag

* update default truncation strategy and prepare_for_model

* fix tests and prepare_for_models methods
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>

17ade127

02 Jul, 2020 1 commit

Changed expected_output_ids in TransfoXL generation test (#5462) · 6726416e

Teven authored Jul 02, 2020

* Changed expected_output_ids in TransfoXL generation test to match #4826 generation PR.

* making black happy

* making isort happy

6726416e

01 Jul, 2020 2 commits
- [Reformer] Add Masked LM Reformer (#5426) · d16e36c7
  Patrick von Platen authored Jul 01, 2020
```
* fix conflicts

* fix

* happy rebasing
```
  d16e36c7
- Fix tensor label type inference in default collator (#5250) · 35befd9c
  Joe Davison authored Jul 01, 2020
```
* allow tensor label inputs to default collator

* replace try/except with type check
```
  35befd9c