Commits · 72c6e8b8bf89c97f9ad1c02cb5d8161d5ef9caad · chenpangpang / transformers

14 Dec, 2021 2 commits

Adding support for multiple mask tokens. (#14716) · e7ed7ffd

Nicolas Patry authored Dec 14, 2021

* Adding support for multiple mask tokens.

- Original implem: https://github.com/huggingface/transformers/pull/10222

Co-authored-by: njafer <naveen.jafer@oracle.com>

* In order to accomodate optionally multimodal models like Perceiver

we add information to the tasks to specify tasks where we know for sure
if we need the tokenizer/feature_extractor or not.

* Adding info in the documentation about multi masks.

+ marked as experimental.

* Add a copy() to prevent overriding the same tensor over and over.

* Fixup.

* Adding small test for multi mask with real values..
Co-authored-by: njafer <naveen.jafer@oracle.com>

e7ed7ffd

Fixing tests for Perceiver (#14739) · 546a91ab

Nicolas Patry authored Dec 14, 2021

* Adding some slow test to check for perceiver at least from a high level.

* Re-enabling fast tests for Perceiver ImageClassification.

* Perceiver might try to run without Tokenizer (Fast doesn't exist) and
with FeatureExtractor some text only pipelines.

* Oops.

* Adding a comment for `update_config_with_model_class`.

* Remove `model_architecture` to get `tiny_config`.

* Finalize rebase.

* Smarter way to handle undefined FastTokenizer.

* Remove old code.

* Addressing some nits.

* Don't instantiate `None`.

546a91ab

13 Dec, 2021 7 commits

Improve perceiver (#14750) · e926ea2b

NielsRogge authored Dec 13, 2021

* First draft

* Improve docstring + clean up tests

* Remove unused code

* Add check in case one doesn't provide a preprocessor

e926ea2b

Avoid using tf.tile in embeddings for TF models (#14735) · 15a9d015

Yih-Dar authored Dec 13, 2021



* avoid tf.tile in embeddings

* remove more tf.tile in embeddings

* clean
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

15a9d015

Fix doc examples: cannot import name (#14698) · ca0b82bb

Yih-Dar authored Dec 13, 2021



* Fix doc examples: cannot import name

* remove copy because of some necessary minor changes (maybe add copy to the individual methods instead)

* Keep copy with some modifications
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ca0b82bb

Add ability to get a list of supported pipeline tasks (#14732) · c17e7cde
Suzen Fylke authored Dec 13, 2021

c17e7cde

Fixing tests for Perceiver (#14745) · 3d66146a

Lysandre Debut authored Dec 13, 2021



- Do not run image-classification pipeline (_CHECKPOINT_FOR_DOC uses the checkpoint for
langage, which cannot load a FeatureExtractor so current logic fails).
- Add a safeguard to not run tests when `tokenizer_class` or
`feature_extractor_class` **are** defined, but cannot be loaded
This happens for Perceiver for the "FastTokenizer" (which doesn't exist
so None) and FeatureExtractor (which does exist but cannot be loaded
because the checkpoint doesn't define one which is reasonable for the
said checkpoint)
- Added `get_vocab` function to `PerceiverTokenizer` since it is used by
`fill-mask` pipeline when the argument `targets` is used to narrow a
subset of possible values.
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

3d66146a

Improve documentation of some models (#14695) · 4c99e553

NielsRogge authored Dec 13, 2021



* Migrate docs to mdx

* Update TAPAS docs

* Remove lines

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Apply some more suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Add pt/tf switch to code examples

* More improvements

* Improve docstrings

* More improvements
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

4c99e553

Fix doc examples: modify config before super().__init__ (#14697) · 32eb29fe
Yih-Dar authored Dec 13, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
32eb29fe

12 Dec, 2021 1 commit
- [Adafactor] Fix adafactor (#14713) · 91f3dfbf
  Patrick von Platen authored Dec 12, 2021
```
* correct changes

* add comment
```
  91f3dfbf
11 Dec, 2021 1 commit
- Fixing tests for perceiver (texts) (#14719) · 7cb1fdd4
  Nicolas Patry authored Dec 11, 2021
```
* Fixing tests for perceiver (texts)

* For MaskedLM
```
  7cb1fdd4
10 Dec, 2021 4 commits
- Fix doc examples: unexpected keyword argument (#14689) · ae82ee6a
  Yih-Dar authored Dec 10, 2021
```
* Fix doc examples: unexpected keyword argument

* Don't delete token_type_ids from inputs
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  ae82ee6a
- Adding `Perceiver` to `AutoTokenizer`. (#14711) · 5b004001
  Nicolas Patry authored Dec 10, 2021
  
  5b004001
- Fix examples: 'CausalLMOutputWithCrossAttentions' object has no attribute... · 59d684fa
  Yih-Dar authored Dec 10, 2021
```
Fix examples: 'CausalLMOutputWithCrossAttentions' object has no attribute 'last_hidden_state' (#14678)
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  59d684fa
- Fix doc examples: KeyError (#14699) · 8395f14d
  Yih-Dar authored Dec 10, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  8395f14d
09 Dec, 2021 3 commits
- Docs for v4.14.0dev0 · ab31b3e4
  Lysandre authored Dec 09, 2021
  
  ab31b3e4
- Release: v4.13.0 · 4da3a696
  Lysandre authored Dec 09, 2021
  
  4da3a696
- add str hub token to repository when provided else fallback to default (#14682) · da7aabf2
  Philipp Schmid authored Dec 09, 2021
```
* add str hub token to repository when provided else fallback to default True

* make style
```
  da7aabf2
08 Dec, 2021 14 commits

Fix doc examples: name '...' is not defined (#14687) · ee6674d4

Yih-Dar authored Dec 08, 2021



* Fix doc examples: name '...' is not defined

* remove >>> and ... in some docstrings in visual_bert
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ee6674d4

Move pyctcdecode (#14686) · 13186d71

Sylvain Gugger authored Dec 08, 2021

* Move pyctcdecode dep

* Fix doc and last objects

* Quality

* Style

* Ignore this black

13186d71

[trainer] support UserDict inputs (torch-nightly) (#14688) · d104dd46
Stas Bekman authored Dec 08, 2021

d104dd46

[bf16 support] tweaks (#14580) · 12286612

Stas Bekman authored Dec 08, 2021



* [bf16 support] tweaks

* corrections
Co-authored-by: Manuel R. Ciosici <manuelrciosici@gmail.com>

12286612

Fix wrong checkpoint paths in doc examples (#14685) · 16870d11
Yih-Dar authored Dec 08, 2021
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
16870d11
Fixes in init (#14681) · f6b87c5f
Sylvain Gugger authored Dec 08, 2021
```
* Fixes in init

* Style
```
f6b87c5f

Improvements to Comet Integration (#14680) · fe06f8dc

Dhruv Nair authored Dec 09, 2021

* change args to address overwriting issue

* remove project name from args

* remove passing args as kwargs to experiment object

* remove passing args as kwargs to offline experiment

* fix offline directory assignment in experiment kwargs

* log checkpoint folder on training end

* log entire output_dir as asset folder

* log asset folder  recursively

* end experiment at the end of training

* clean up

* clean up

* Default to always log training assets to Comet when using CometCallback

* change logging training assets to be true when running callback setup

* fix so that experiment always ends when training ends

* styling and quality fixes

* update docstring for COMET_LOG_ASSETS environment variable

* run styling and quality checks

* clean up to docstring

* remove merge markers

* change asset logging to false to avoid hitting max assets per experiment limit

* update training asset description

* fix styling

fe06f8dc

Revert "Added support for other features for already supported models (#14358)" (#14679) · 0f4e39c5
lewtun authored Dec 08, 2021
```
This reverts commit 0c70f145.
```
0f4e39c5

Added support for other features for already supported models (#14358) · 0c70f145

Michael Benayoun authored Dec 08, 2021

* Added support for other features for already supported models

* Partial support for causal and seq2seq models

* Partial support for causal and seq2seq models

* OnnxSeq2SeqConfigWithPast to support seq2seq models

* Parameterized the onnx tests

* Restored run_mlm.py

* Restored run_mlm.py

* [WIP] BART update

* BART and MBART

* Added comments

* Another sequence length of the past_key_values

0c70f145

[AutoProcessor] Add Wav2Vec2WithLM & small fix (#14675) · ee4fa2e4

Patrick von Platen authored Dec 08, 2021

* [AutoProcessor] Add Wav2Vec2WithLM & small fix

* revert line removal

* Update src/transformers/__init__.py

* add test

* up

* up

* small fix

ee4fa2e4

fix deprecated tf method (#14671) · fab3b518
ZOHETH authored Dec 08, 2021
```
tf.matrix_band_part -> tf.linalg.band_part
```
fab3b518

Add Perceiver IO (#14487) · 65b20b73

NielsRogge authored Dec 08, 2021

* First draft

* Style and remove mlm

* Make forward pass work

* More improvements

* More improvements

* Fix bug

* More improvements

* More improvements

* Add PerceiverTokenizer first draft

* Improve conversion script

* More improvements

* Make conversion script work for the encoder

* Make conversion script work with local pickle files

* Style & quality, fix-copies

* Add dummy input to conversion script

* Add absolute position embeddings to TextPreProcessor

* Make forward pass of encoder work

* More improvements

* Move text preprocessor to separate script

* More improvements

* More improvements

* Add post processor

* Make MLM model work

* Style

* Add PerceiverForMaskedLM

* Add PerceiverImagePreprocessor

* Make style

* Make PerceiverForImageClassification work

* More improvements

* More improvements

* Use tokenizer in conversion script

* Use PerceiverForMaskedLM in conversion script

* Define custom PerceiverModelOutput

* Improve PerceiverAttention to make it work for both MLM and image classification

* More improvements

* More improvements

* More improvements to the conversion script

* Make conversion script work for both MLM and image classification

* Add PerceiverFeatureExtractor

* More improvements

* Style and quality

* Add center cropping

* Fix bug

* Small fix

* Add print statement

* Fix bug in image preprocessor

* Fix bug with conversion script

* Make output position embeddings an nn.Parameter layer instead of nn.Embedding

* Comment out print statements

* Add position encoding classes

* More improvements

* Use position_encoding_kwargs

* Add PerceiverForImageClassificationFourier

* Make style & quality

* Add PerceiverForImageClassificationConvProcessing

* Style & quality

* Add flow model

* Move processors to modeling file

* Make position encodings modular

* Make basic decoder use modular position encodings

* Add PerceiverForOpticalFlow to conversion script

* Add AudioPreprocessor

* Make it possible for the basic decoder to use Fourier position embeddings

* Add PerceiverForMultimodalAutoencoding

* Improve model for optical flow

* Improve _build_network_inputs method

* Add print statement

* Fix device issue

* Fix device of Fourier embeddings

* Add print statements for debugging

* Add another print statement

* Add another print statement

* Add another print statement

* Add another print statement

* Improve PerceiverAudioPreprocessor

* Improve conversion script for multimodal modal

* More improvements

* More improvements

* Improve multimodal model

* Make forward pass multimodal model work

* More improvements

* Improve tests

* Fix some more tests

* Add output dataclasses

* Make more tests pass

* Add print statements for debuggin

* Add tests for image classification

* Add PerceiverClassifierOutput

* More improvements

* Make more tests pass for the optical flow model

* Make style & quality

* Small improvements

* Don't support training for optical flow model for now

* Fix _prepare_for_class for tests

* Make more tests pass, add some docs

* Add multimodal model to tests

* Minor fixes

* Fix tests

* Improve conversion script

* Make fixup

* Remove pos_dim argument

* Fix device issue

* Potential fix for OOM

* Revert previous commit

* Fix test_initialization

* Add print statements for debugging

* Fix print statement

* Add print statement

* Add print statement

* Add print statement

* Add print statement

* Add print statement

* Add print statement

* Remove need for output_shape

* Comment out output_shape

* Remove unnecessary code

* Improve docs

* Fix make fixup

* Remove PerceiverTextProcessor from init

* Improve docs

* Small improvement

* Apply first batch of suggestions from code review

* Apply more suggestions from code review

* Update docstrings

* Define dicts beforehand for readability

* Rename task to architecture in conversion script, include PerceiverModel in tests

* Add print statements for debugging

* Fix tests on GPU

* Remove preprocessors, postprocessors and decoders from main init

* Add integration test

* Fix docs

* Replace einops by torch

* Update for new docs frontend

* Rename PerceiverForImageClassification

* Improve docs

* Improve docs

* Improve docs of PerceiverModel

* Fix some more tests

* Improve center_crop

* Add PerceiverForSequenceClassification

* Small improvements

* Fix tests

* Add integration test for optical flow model

* Clean up

* Add tests for tokenizer

* Fix tokenizer by adding special tokens properly

* Fix CI

65b20b73

[Wav2Vec2] PyCTCDecode Integration to support language model boosted decoding (#14339) · 961732c2

Patrick von Platen authored Dec 08, 2021



* up

* up

* up

* make it cleaner

* correct

* make styhahalal

* add more tests

* finish

* small fix

* make style

* up

* tryout to solve cicrle ci

* up

* fix more tests

* fix more tests

* apply sylvains suggestions

* fix import

* correct docs

* add pyctcdecode only to speech tests

* fix more tests

* add tf, flax and pt tests

* add pt

* fix last tests

* fix more tests

* Apply suggestions from code review

* change lines

* Apply suggestions from code review
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* correct tests

* correct tests

* add doc string
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

961732c2

Fixing Dataset for TQA + token-classification. (#14658) · 2e12d90b

Nicolas Patry authored Dec 08, 2021

* Fixing Dataset for TQA + token-classification.

* Fixing the tests.

* Making sure `offset_mappings` is a valid argument.

2e12d90b

07 Dec, 2021 5 commits

[trainer] conditional ctx managers into one wrapper (#14663) · fae0b9fa

Stas Bekman authored Dec 07, 2021



* [trainer] conditional ctx managers into one wrapper

* workaround for contextlib.nullcontext for py<3.7

* Update src/transformers/trainer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* one more autocast

* style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

fae0b9fa

Fix a Bug, trainer_seq2seq.py, in the else branch at Line 172,... · 39f1dff5

TranSirius authored Dec 08, 2021

Fix a Bug, trainer_seq2seq.py, in the else branch at Line 172, generation_inputs should be a dict (#14546)

* fix bug, trainer_seq2seq.py, Line 172, generation_inputs must be a dict before feeding into self.model.generation()

* fix bug, trainer_seq2seq.py, Line 172, generation_inputs must be a dict before feeding into self.model.generation()

39f1dff5

quick fix SummarizationPipeline error messages (#14618) · 2171695c

Nouamane Tazi authored Dec 07, 2021

* quick fix SummarizationPipeline error messages

Fix error messages to avoid spam errors, and errors of type:
`Your max_length is set to 50, but you input_length is only 46. You might consider decreasing max_length manually, e.g. summarizer('...', max_length=50)`

* correcto SummarizationPipeline error messages fixes

2171695c

[deepspeed] fix --load_best_model_at_end (#14652) · b66c5ab2

Stas Bekman authored Dec 06, 2021

* [deepspeed] fix load_best_model_at_end

* try with pull_request_target

* revert: try with pull_request_target

* style

* add test

* cleanup

b66c5ab2

Add mLUKE (#14640) · 30646a0a

Ryokan RI authored Dec 07, 2021

* implement MLukeTokenizer and LukeForMaskedLM

* update tests

* update docs

* add LukeForMaskedLM to check_repo.py

* update README

* fix test and specify the entity pad id in tokenization_(m)luke

* fix EntityPredictionHeadTransform

30646a0a

06 Dec, 2021 3 commits

Use cross_attention_hidden_size in Encoder-Decoder models (#14378) · 4cdb67ca

Yih-Dar authored Dec 07, 2021



* add cross_attention_hidden_size to text-2-text encoder-decoder models (PT/Flax)

* for TFEncoderDecoderModel

* add equivalence test for TFEncoderDecoderModel

* fix

* fix failed equivalence tests

* remove unused import

* add detailed comment

* Fix check_equivalence_tf_to_pt by using encoder/decoder

* cleaning

* Use cross_attention_hidden_size in speech-to-text

* clean fast init logging msg in encoder decoder models

* increase tol from 1e-5 to 1e-3 for tf test

* style

* style

* make sure projection layer can run

* remove type conversion + add check

* fix conflict (config.output_hidden_size)

* Remove TF -> PT in check_pt_tf_equivalence for TFEncoderDecoderModel
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

4cdb67ca

Fix syntax for class references (#14644) · e513c16e
Sylvain Gugger authored Dec 06, 2021

e513c16e

Auto processor fix (#14623) · e9688875

Lysandre Debut authored Dec 06, 2021



* Add AutoProcessor class
Init and tests
Add doc
Fix init
Update src/transformers/models/auto/processing_auto.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Reverts to tokenizer or feature extractor when available
Adapt test

* Revert "Adapt test"

This reverts commit bbdde5fab02465f24b54b227390073082cb32093.

* Revert "Reverts to tokenizer or feature extractor when available"

This reverts commit 77659ff5d21b6cc0baf6f443017e35e056a525bb.

* Don't revert everything Lysandre!
Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>

e9688875