Commits · ef47d4f848ec4f18f555aff5f2c89e5c429206b7 · chenpangpang / transformers

23 Dec, 2021 9 commits

[AutoTokenizer] Fix incorrect from pretrained (#14900) · ef47d4f8
Patrick von Platen authored Dec 23, 2021

ef47d4f8

Yih-Dar authored Dec 23, 2021



* Start the work for TFCLIPModel

* Convert to TF code (TODO: loss + doc)

* Clean up

* Fix pooled_output for TFCLIPTextTransformer - using tf.gather_nd

* assert -> raise error

* Expose TFCLIPModel

* Deal with dummy_inputs

* Add tests

* Fix all tests. TODO: manual check weight loading + add more comments

* Fix pt tf equivalence test

* fixes

* update TFCLIPVisionEmbeddings's Conv2D

* Fix loss + overwrite test_pt_tf_model_equivalence from common

* Add a comment about the change about MainLayer in test_keras_save_load

* Set return_loss=True in TFCLIPModelTester + make tests pass

* overwrite test_pt_tf_model_equivalence from tf common

* fix base_model_prefix

* Fix examples

* remove unused

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply review suggestions

* change self.pre_layrnorm to self.pre_layernorm

* apply more review suggestions

* return attention probs before dropout (to align with PT)

* fix weight init

* fix

* build doc

* fix missing doc

* fix for test
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8f2cc1c3

Set `run_name` in MLflowCallback (#14894) · 2d30443c
Yang Dong authored Dec 23, 2021
```
* Set run_name in MLflowCallback

* Update the docs for `run_name` argument
```
2d30443c
add custom stopping criteria to human eval script (#14897) · 1d651868
Leandro von Werra authored Dec 23, 2021

1d651868

Add ONNX support for MarianMT models (#14586) · 6b655cc6

lewtun authored Dec 23, 2021

* First commit to add MarianMT to ONNX

* Now MarianModel.forward() automatically generates decoder_input_ids, like BartModel.forward()

* Adjusted MarianOnnxConfig.inputs and outputs to work with seq2seq-lm feature

* Style fix

* Added support for other features for already supported models

* Partial support for causal and seq2seq models

* Partial support for causal and seq2seq models

* Add default task for MarianMT ONNX

* Remove automatic creation of decoder_input_ids

* Extend inputs and outputs for MarianMT ONNX config

* Add MarianMT to ONNX unit tests

* Refactor

* OnnxSeq2SeqConfigWithPast to support seq2seq models

* Parameterized the onnx tests

* Restored run_mlm.py

* Restored run_mlm.py

* [WIP] BART update

* BART and MBART

* Add past_key_values and fix dummy decoder inputs

Using a sequence length of 1 in generate_dummy_outputs() produces large discrepancies, presumably due to some hidden optimisations.

* Refactor MarianOnnxConfig to remove custom past_key_values logic

* Fix quality

* Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"

This reverts commit 0f4e39c5.

* is_torch_available test to avoid failing imports

* sorting parameterize parameters to solve ERROR gw0 gw1

* tests fix

* tests fix

* GPT2 with past fix

* Fixed stateful class attribute change that was breaking things when converting multiple models sequentially

* Removed onnx file

* Refactor Marian export to account for base changes

* Fix copies

* Implemented suggestions

* Extend support for causal LM

* Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"

This reverts commit 0f4e39c5.

* is_torch_available test to avoid failing imports

* sorting parameterize parameters to solve ERROR gw0 gw1

* tests fix

* tests fix

* GPT2 with past fix

* Fixed stateful class attribute change that was breaking things when converting multiple models sequentially

* Removed onnx file

* Implemented suggestions

* Fixed __init__ to resolve conflict with master

* Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"

This reverts commit 0f4e39c5

.

* is_torch_available test to avoid failing imports

* sorting parameterize parameters to solve ERROR gw0 gw1

* tests fix

* tests fix

* GPT2 with past fix

* Fixed stateful class attribute change that was breaking things when converting multiple models sequentially

* Removed onnx file

* Implemented suggestions

* Fixed __init__ to resolve conflict with master

* Remove commented import

* Remove ONNX model

* Remove redundant class method

* Tidy up imports

* Fix quality

* Refactor dummy input function

* Add copied from statements to Marian config functions

* Remove false copied from comments

* Fix copy from comment
Co-authored-by: Massimiliano Bruni <massimiliano.bruni@hcl.com>
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

6b655cc6

Add 'with torch.no_grad()' to integration test forward pass (#14808) · 6a7b9da2
Henrik Holm authored Dec 23, 2021

6a7b9da2
Fix AttributeError from PreTrainedTokenizerFast.decoder (#14691) · d8c09c65
Alex Hedges authored Dec 23, 2021

d8c09c65

Fix doc examples: ... takes no keyword arguments (#14701) · 42105795

Yih-Dar authored Dec 23, 2021



* Fix doc examples: ... takes no keyword arguments

* fix copies
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

42105795

Fix installation instructions for BART ONNX example (#14885) · 355dc0ce
lewtun authored Dec 23, 2021

355dc0ce

22 Dec, 2021 14 commits

Convert rst files (#14888) · 207594be

Sylvain Gugger authored Dec 22, 2021

* Convert all tutorials and guides

* Convert all remaining rst to mdx

* Track and fix bad links

207594be

Keras metric callback (#14867) · b0c7d2ec

Matt authored Dec 22, 2021



* Working on splitting out labels

* First working version

* Fixed concatenation of outputs and labels

* val_dataset -> eval_dataset

* Only pass input arrays in tokenizer.model_input_names

* Only pass input arrays in tokenizer.model_input_names

* Only remove unexpected keys when predict_with_generate is True

* Adding proper docstring

* Adding example to docstring

* Add a proper ROUGE metric example

* Add a proper ROUGE metric example

* Add version checking

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove requirement for tokenizer with predict_with_generate
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b0c7d2ec

Docs for v4.16.0dev0 · fa39ff9f
Patrick von Platen authored Dec 22, 2021

fa39ff9f
Release: v4.15.0 · 05fa1a7a
Patrick von Platen authored Dec 22, 2021

05fa1a7a
Properly indent return block (#14887) · 87a033d9
Sylvain Gugger authored Dec 22, 2021

87a033d9

Onnx enable tasks for supported models (part 2) (#14700) · 13504dcb

Michael Benayoun authored Dec 22, 2021

* Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"

This reverts commit 0f4e39c5.

* is_torch_available test to avoid failing imports

* sorting parameterize parameters to solve ERROR gw0 gw1

* tests fix

* tests fix

* GPT2 with past fix

* Fixed stateful class attribute change that was breaking things when converting multiple models sequentially

* Removed onnx file

* Implemented suggestions

* Fixed __init__ to resolve conflict with master

* Remove commented import

13504dcb

Fix pytorch image classification example (#14883) · 1045a36c
Mario Šaško authored Dec 22, 2021
```
* Update example

* Remove skip in tests
```
1045a36c
Fix Perceiver docs (#14879) · 7df4b90c
NielsRogge authored Dec 22, 2021

7df4b90c
Fix typo in error message · e37bc579
Sylvain Gugger authored Dec 22, 2021

e37bc579
IterableDatasetShard should use per device batch size instead of real batch size (#14714) · 17efc806
charon____ authored Dec 22, 2021

17efc806

Updated deberta attention (#14625) · 2a56edb3

guillaume-be authored Dec 22, 2021



* Removed unused p2p attention handling

* Updated DeBERTa configuration

* Updated TF DeBERTa attention

* Rolled back accidental comment deletion
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

2a56edb3

Feature/fix slow test in mluke (#14749) · 824fd44f

Ryokan RI authored Dec 22, 2021

* make MLukeTokenizerTest fast

* make LukeTokenizerTest fast

* add entry to _toctree.yaml

824fd44f

update the arguments `add_prefix_space` and `trim_offsets` in... · c94c1b89

SaulLu authored Dec 22, 2021

update the arguments `add_prefix_space` and `trim_offsets` in `backend_tokenizer.post_processor` of `RobertaTokenizerFast` (#14752)

* add tests

* change post-processor, pre-tokenizer and decoder (can't update decoder)

* update test (remove decoder which doesn't depend on trim and add_prefix)

* just update the post_processor

* fix change

* `trim_offsets` has no influence on `pre_tokenizer`

* remove a test that need some input from the `tokenizers` lib maintainers

* format

* add new test offsets roberta

* polish comments

c94c1b89

Convert model files from rst to mdx (#14865) · ec3567fe

Lysandre Debut authored Dec 22, 2021



* First pass

* Apply suggestions from code review

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

ec3567fe

21 Dec, 2021 17 commits

Fix doc mistakes (#14874) · d0422de5
Sylvain Gugger authored Dec 21, 2021
```
* Remove double returns

* Last fixes

* Quality

* Last fix for Lxmert
```
d0422de5
Fix `FlaxMarianMTModel` return block. (#14873) · e846a56c
Sylvain Gugger authored Dec 21, 2021
```
* Fixes in marian doc

* Another time

* Add return block in FlaxMarianMTModel
```
e846a56c
Fixes in marian doc (#14872) · a6b7b47a
Sylvain Gugger authored Dec 21, 2021
```
* Fixes in marian doc

* Another time
```
a6b7b47a
Fix FLAX_MULTIPLE_CHOICE_SAMPLE typo (#14871) · eec9c8bb
Mishig Davaadorj authored Dec 21, 2021

eec9c8bb
Skip failing test · e51c7b58
Sylvain Gugger authored Dec 21, 2021

e51c7b58

Mass conversion of documentation from rst to Markdown (#14866) · 27b3031d

Sylvain Gugger authored Dec 21, 2021

* Convert docstrings of all configurations and tokenizers

* Processors and fixes

* Last modeling files and fixes to models

* Pipeline modules

* Utils files

* Data submodule

* All the other files

* Style

* Missing examples

* Style again

* Fix copies

* Say bye bye to rst docstrings forever

27b3031d

[doc porting] several docs (#14858) · 18587639

Stas Bekman authored Dec 21, 2021



* [doc porting] 2 docs

* [doc porting] 2 docs

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update docs/source/main_classes/deepspeed.mdx

* cleanup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

18587639

[examples/summarization] deal with None in data records (#14816) · 033c3ed9
Stas Bekman authored Dec 21, 2021
```
* [examples/summarization] deal with None in data records

* rewrite to use a simpler (slower) variant
```
033c3ed9

Replace commit sha by commit url for update jobs (#14852) · c075fb78

Sylvain Gugger authored Dec 21, 2021



* Replace commit sha by commit url for update jobs

* Typo

* Update .github/workflows/build_documentation.yml
Co-authored-by: Julien Chaumond <julien@huggingface.co>

* Apply review comments
Co-authored-by: Julien Chaumond <julien@huggingface.co>

c075fb78

Add custom `stopping_criteria` and `logits_processor` to `generate` (#14779) · 5722d058

Leandro von Werra authored Dec 21, 2021



* add custom `stopping_criteria` and `logits_processor` to `generate`

* add tests for custom `stopping_criteria` and `logits_processor`

* fix typo in RAG

* address reviewer comments

* improve custom logits processor/stopping criteria error message

* fix types in merge function signature

* change default for custom list from `None` to empty list

* fix rag generate

* add string split suggestion
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5722d058

Fix the value error typo of AdamW's betas' valid values checking (#14780) · 00620583
Zed authored Dec 21, 2021
```
* Fix the value error typo of AdamW's betas value check

* error fixed
```
00620583
[ASR example] Improve example + add more examples (#14848) · 7ae6f070
Patrick von Platen authored Dec 21, 2021
```
* up

* load up

* up
```
7ae6f070
Only create the model card on process 0 (#14857) · 97ec17f7
Sylvain Gugger authored Dec 21, 2021

97ec17f7
[Bart] better error message (#14854) · b513ec8b
Patrick von Platen authored Dec 21, 2021

b513ec8b

Convert docstrings of modeling files (#14850) · 7af80f66

Sylvain Gugger authored Dec 21, 2021

* Convert file_utils docstrings to Markdown

* Test on BERT

* Return block indent

* Temporarily disable doc styler

* Remove from quality checks as well

* Remove doc styler mess

* Remove check from circleCI

* Fix typo

* Convert file_utils docstrings to Markdown

* Test on BERT

* Return block indent

* Temporarily disable doc styler

* Remove from quality checks as well

* Remove doc styler mess

* Remove check from circleCI

* Fix typo

* Let's go on all other model files

* Add templates too

* Styling and quality

7af80f66

Make the onnx submodule init lazy (#14855) · 2a337346
Sylvain Gugger authored Dec 21, 2021
```
* Use lazy init for onnx submodule

* Remove debug statements
```
2a337346
[logging] implement warning_advice / TRANSFORMERS_NO_ADVISORY_WARNINGS (#14669) · b6ec9569
Stas Bekman authored Dec 20, 2021
```
* [logging] implement warning_advice / TRANSFORMERS_NO_ADVISORY_WARNINGS

* reword
```
b6ec9569