Commits · 87e6e4fe5c7e65cb69e70306f22de6daf16b6e14 · chenpangpang / transformers

27 Dec, 2021 5 commits

Sylvain Gugger authored Dec 27, 2021

* New doc styler

* Fix issue with args at the start

* Code sample fixes

* Style code examples in MDX

* Fix more patterns

* Typo

* Typo

* More patterns

* Do without black for now

* Get more info in error

* Docstring style

* Re-enable check

* Quality

* Fix add_end_docstring decorator

* Fix docstring

87e6e4fe

Fix duplicate call to save_checkpoint when using deepspeed (#14946) · c1138273

Mihai Balint authored Dec 27, 2021

* Fix duplicate call to save_checkpoint when using deepspeed / stage3_gather_fp16_weights_on_model_save

* Revert "Fix duplicate call to save_checkpoint when using deepspeed / stage3_gather_fp16_weights_on_model_save"

This reverts commit 6a3dec0397723a8417351dc38fdebf14ab17756c.

* Delete correct duplicate invocation of deepspeed save_checkpoint

c1138273

fix to issue #14833 in data_collator - consider no labels (#14930) · 03885a3f
Ayal Klein authored Dec 27, 2021

03885a3f

Add `ElectraForCausalLM` -> Enable Electra encoder-decoder model (#14729) · 501307b5

Daniel Stancl authored Dec 27, 2021

* Add ElectraForCausalLM and cover some basic tests & need to fix a few tests

* Fix bugs

* make style

* make fix-copies

* Update doc

* Change docstring to markdown format

* Remove redundant update_keys_to_ignore

501307b5

ChunkPipeline (batch_size enabled on `zero-cls` and `qa` pipelines. (#14225) · b058490c

Nicolas Patry authored Dec 27, 2021



* Pipeline chunks.

* Batching for Chunking pipelines ?

* Batching for `question-answering` and `zero-shot-cls`.

* Fixing for FNet.

* Making ASR a chunk pipeline.

* Chunking ASR API.

* doc style.

* Fixing ASR test.

* Fixing QA eror (p_mask, padding is 1, not 0).

* Enable both vad and simple chunking.

* Max length for vad.

* remove inference mode, crashing on s2t.

* Revert ChunkPipeline for ASRpipeline.

Too many knobs for simple integration within the pipeline, better stick
to external convenience functions instead, more control to be had,
simpler pipeline and also easier to replace with other things later.

* Drop necessity for PT for these.

* Enabling generators.

* Add mic + cleanup.

* Typo.

* Typo2.

* Remove ASR work, it does not belong in this PR anymore.

* Update src/transformers/pipelines/pt_utils.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Update src/transformers/pipelines/zero_shot_classification.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

* Adding many comments.

* Doc quality.

* `hidden_states` handling.

* Adding doc.

* Bad rebase.

* Autofixing docs.

* Fixing CRITICAL bug in the new Zerocls pipeline.
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

b058490c

24 Dec, 2021 1 commit
- Fix Perceiver docs (#14917) · 705ca7f2
  Qing authored Dec 24, 2021
  
  705ca7f2
23 Dec, 2021 10 commits

[WavLM] fix wavlm docs (#14910) · 11682990
Patrick von Platen authored Dec 23, 2021

11682990

Better logic for getting tokenizer config in AutoTokenizer (#14906) · 676643c6

Sylvain Gugger authored Dec 23, 2021

* Better logic for getting tokenizer config in AutoTokenizer

* Remove needless import

* Remove debug statement

* Address review comments

676643c6

[Generate] Remove attention_mask and integrate model_main_input_name (#14856) · fe4197ab
Patrick von Platen authored Dec 23, 2021
```
* up

* save

* correct

* up

* correct more

* up

* up

* up

* up

* up

* correct

* fix tf

* fix

* remove tokenizer
```
fe4197ab

[doc] post-porting (#14890) · 86b40073

Stas Bekman authored Dec 23, 2021

found a few oddities:

1. https://huggingface.co/docs/transformers/main_classes/logging#transformers.utils.logging.enable_explicit_format
has a :: - this PR fixes it

2.  this looks borked too:
https://huggingface.co/docs/transformers/main_classes/logging#transformers.utils.logging.set_verbosity
 has a <

but I'm not sure where this one is coming from

86b40073

[AutoTokenizer] Fix incorrect from pretrained (#14900) · ef47d4f8
Patrick von Platen authored Dec 23, 2021

ef47d4f8

Add TFCLIPModel (#13967) · 8f2cc1c3

Yih-Dar authored Dec 23, 2021



* Start the work for TFCLIPModel

* Convert to TF code (TODO: loss + doc)

* Clean up

* Fix pooled_output for TFCLIPTextTransformer - using tf.gather_nd

* assert -> raise error

* Expose TFCLIPModel

* Deal with dummy_inputs

* Add tests

* Fix all tests. TODO: manual check weight loading + add more comments

* Fix pt tf equivalence test

* fixes

* update TFCLIPVisionEmbeddings's Conv2D

* Fix loss + overwrite test_pt_tf_model_equivalence from common

* Add a comment about the change about MainLayer in test_keras_save_load

* Set return_loss=True in TFCLIPModelTester + make tests pass

* overwrite test_pt_tf_model_equivalence from tf common

* fix base_model_prefix

* Fix examples

* remove unused

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* apply review suggestions

* change self.pre_layrnorm to self.pre_layernorm

* apply more review suggestions

* return attention probs before dropout (to align with PT)

* fix weight init

* fix

* build doc

* fix missing doc

* fix for test
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

8f2cc1c3

Set `run_name` in MLflowCallback (#14894) · 2d30443c
Yang Dong authored Dec 23, 2021
```
* Set run_name in MLflowCallback

* Update the docs for `run_name` argument
```
2d30443c

Add ONNX support for MarianMT models (#14586) · 6b655cc6

lewtun authored Dec 23, 2021

* First commit to add MarianMT to ONNX

* Now MarianModel.forward() automatically generates decoder_input_ids, like BartModel.forward()

* Adjusted MarianOnnxConfig.inputs and outputs to work with seq2seq-lm feature

* Style fix

* Added support for other features for already supported models

* Partial support for causal and seq2seq models

* Partial support for causal and seq2seq models

* Add default task for MarianMT ONNX

* Remove automatic creation of decoder_input_ids

* Extend inputs and outputs for MarianMT ONNX config

* Add MarianMT to ONNX unit tests

* Refactor

* OnnxSeq2SeqConfigWithPast to support seq2seq models

* Parameterized the onnx tests

* Restored run_mlm.py

* Restored run_mlm.py

* [WIP] BART update

* BART and MBART

* Add past_key_values and fix dummy decoder inputs

Using a sequence length of 1 in generate_dummy_outputs() produces large discrepancies, presumably due to some hidden optimisations.

* Refactor MarianOnnxConfig to remove custom past_key_values logic

* Fix quality

* Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"

This reverts commit 0f4e39c5.

* is_torch_available test to avoid failing imports

* sorting parameterize parameters to solve ERROR gw0 gw1

* tests fix

* tests fix

* GPT2 with past fix

* Fixed stateful class attribute change that was breaking things when converting multiple models sequentially

* Removed onnx file

* Refactor Marian export to account for base changes

* Fix copies

* Implemented suggestions

* Extend support for causal LM

* Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"

This reverts commit 0f4e39c5.

* is_torch_available test to avoid failing imports

* sorting parameterize parameters to solve ERROR gw0 gw1

* tests fix

* tests fix

* GPT2 with past fix

* Fixed stateful class attribute change that was breaking things when converting multiple models sequentially

* Removed onnx file

* Implemented suggestions

* Fixed __init__ to resolve conflict with master

* Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"

This reverts commit 0f4e39c5

.

* is_torch_available test to avoid failing imports

* sorting parameterize parameters to solve ERROR gw0 gw1

* tests fix

* tests fix

* GPT2 with past fix

* Fixed stateful class attribute change that was breaking things when converting multiple models sequentially

* Removed onnx file

* Implemented suggestions

* Fixed __init__ to resolve conflict with master

* Remove commented import

* Remove ONNX model

* Remove redundant class method

* Tidy up imports

* Fix quality

* Refactor dummy input function

* Add copied from statements to Marian config functions

* Remove false copied from comments

* Fix copy from comment
Co-authored-by: Massimiliano Bruni <massimiliano.bruni@hcl.com>
Co-authored-by: Michael Benayoun <mickbenayoun@gmail.com>

6b655cc6

Fix AttributeError from PreTrainedTokenizerFast.decoder (#14691) · d8c09c65
Alex Hedges authored Dec 23, 2021

d8c09c65

Fix doc examples: ... takes no keyword arguments (#14701) · 42105795

Yih-Dar authored Dec 23, 2021



* Fix doc examples: ... takes no keyword arguments

* fix copies
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

42105795

22 Dec, 2021 11 commits

Convert rst files (#14888) · 207594be

Sylvain Gugger authored Dec 22, 2021

* Convert all tutorials and guides

* Convert all remaining rst to mdx

* Track and fix bad links

207594be

Keras metric callback (#14867) · b0c7d2ec

Matt authored Dec 22, 2021



* Working on splitting out labels

* First working version

* Fixed concatenation of outputs and labels

* val_dataset -> eval_dataset

* Only pass input arrays in tokenizer.model_input_names

* Only pass input arrays in tokenizer.model_input_names

* Only remove unexpected keys when predict_with_generate is True

* Adding proper docstring

* Adding example to docstring

* Add a proper ROUGE metric example

* Add a proper ROUGE metric example

* Add version checking

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/keras_callbacks.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Remove requirement for tokenizer with predict_with_generate
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

b0c7d2ec

Docs for v4.16.0dev0 · fa39ff9f
Patrick von Platen authored Dec 22, 2021

fa39ff9f
Release: v4.15.0 · 05fa1a7a
Patrick von Platen authored Dec 22, 2021

05fa1a7a
Properly indent return block (#14887) · 87a033d9
Sylvain Gugger authored Dec 22, 2021

87a033d9

Onnx enable tasks for supported models (part 2) (#14700) · 13504dcb

Michael Benayoun authored Dec 22, 2021

* Revert "Revert "Added support for other features for already supported models (#14358)" (#14679)"

This reverts commit 0f4e39c5.

* is_torch_available test to avoid failing imports

* sorting parameterize parameters to solve ERROR gw0 gw1

* tests fix

* tests fix

* GPT2 with past fix

* Fixed stateful class attribute change that was breaking things when converting multiple models sequentially

* Removed onnx file

* Implemented suggestions

* Fixed __init__ to resolve conflict with master

* Remove commented import

13504dcb

Fix Perceiver docs (#14879) · 7df4b90c
NielsRogge authored Dec 22, 2021

7df4b90c
IterableDatasetShard should use per device batch size instead of real batch size (#14714) · 17efc806
charon____ authored Dec 22, 2021

17efc806

Updated deberta attention (#14625) · 2a56edb3

guillaume-be authored Dec 22, 2021



* Removed unused p2p attention handling

* Updated DeBERTa configuration

* Updated TF DeBERTa attention

* Rolled back accidental comment deletion
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>

2a56edb3

Feature/fix slow test in mluke (#14749) · 824fd44f

Ryokan RI authored Dec 22, 2021

* make MLukeTokenizerTest fast

* make LukeTokenizerTest fast

* add entry to _toctree.yaml

824fd44f

update the arguments `add_prefix_space` and `trim_offsets` in... · c94c1b89

SaulLu authored Dec 22, 2021

update the arguments `add_prefix_space` and `trim_offsets` in `backend_tokenizer.post_processor` of `RobertaTokenizerFast` (#14752)

* add tests

* change post-processor, pre-tokenizer and decoder (can't update decoder)

* update test (remove decoder which doesn't depend on trim and add_prefix)

* just update the post_processor

* fix change

* `trim_offsets` has no influence on `pre_tokenizer`

* remove a test that need some input from the `tokenizers` lib maintainers

* format

* add new test offsets roberta

* polish comments

c94c1b89

21 Dec, 2021 12 commits

Fix doc mistakes (#14874) · d0422de5
Sylvain Gugger authored Dec 21, 2021
```
* Remove double returns

* Last fixes

* Quality

* Last fix for Lxmert
```
d0422de5
Fix `FlaxMarianMTModel` return block. (#14873) · e846a56c
Sylvain Gugger authored Dec 21, 2021
```
* Fixes in marian doc

* Another time

* Add return block in FlaxMarianMTModel
```
e846a56c
Fixes in marian doc (#14872) · a6b7b47a
Sylvain Gugger authored Dec 21, 2021
```
* Fixes in marian doc

* Another time
```
a6b7b47a
Fix FLAX_MULTIPLE_CHOICE_SAMPLE typo (#14871) · eec9c8bb
Mishig Davaadorj authored Dec 21, 2021

eec9c8bb

Mass conversion of documentation from rst to Markdown (#14866) · 27b3031d

Sylvain Gugger authored Dec 21, 2021

* Convert docstrings of all configurations and tokenizers

* Processors and fixes

* Last modeling files and fixes to models

* Pipeline modules

* Utils files

* Data submodule

* All the other files

* Style

* Missing examples

* Style again

* Fix copies

* Say bye bye to rst docstrings forever

27b3031d

Add custom `stopping_criteria` and `logits_processor` to `generate` (#14779) · 5722d058

Leandro von Werra authored Dec 21, 2021



* add custom `stopping_criteria` and `logits_processor` to `generate`

* add tests for custom `stopping_criteria` and `logits_processor`

* fix typo in RAG

* address reviewer comments

* improve custom logits processor/stopping criteria error message

* fix types in merge function signature

* change default for custom list from `None` to empty list

* fix rag generate

* add string split suggestion
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5722d058

Fix the value error typo of AdamW's betas' valid values checking (#14780) · 00620583
Zed authored Dec 21, 2021
```
* Fix the value error typo of AdamW's betas value check

* error fixed
```
00620583
Only create the model card on process 0 (#14857) · 97ec17f7
Sylvain Gugger authored Dec 21, 2021

97ec17f7
[Bart] better error message (#14854) · b513ec8b
Patrick von Platen authored Dec 21, 2021

b513ec8b

Convert docstrings of modeling files (#14850) · 7af80f66

Sylvain Gugger authored Dec 21, 2021

* Convert file_utils docstrings to Markdown

* Test on BERT

* Return block indent

* Temporarily disable doc styler

* Remove from quality checks as well

* Remove doc styler mess

* Remove check from circleCI

* Fix typo

* Convert file_utils docstrings to Markdown

* Test on BERT

* Return block indent

* Temporarily disable doc styler

* Remove from quality checks as well

* Remove doc styler mess

* Remove check from circleCI

* Fix typo

* Let's go on all other model files

* Add templates too

* Styling and quality

7af80f66

Make the onnx submodule init lazy (#14855) · 2a337346
Sylvain Gugger authored Dec 21, 2021
```
* Use lazy init for onnx submodule

* Remove debug statements
```
2a337346
[logging] implement warning_advice / TRANSFORMERS_NO_ADVISORY_WARNINGS (#14669) · b6ec9569
Stas Bekman authored Dec 20, 2021
```
* [logging] implement warning_advice / TRANSFORMERS_NO_ADVISORY_WARNINGS

* reword
```
b6ec9569

20 Dec, 2021 1 commit

Add a main_input_name attribute to all models (#14803) · 33f36c86

Sylvain Gugger authored Dec 20, 2021



* Add a main_input_name attribute to all models

* Fix tests

* Wtf Vs Code?

* Update src/transformers/models/imagegpt/modeling_imagegpt.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Style

* Fix copies
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

33f36c86