Commits · 13254591054630b08d1a1338aa5ca9674d2513ed · chenpangpang / transformers

02 Mar, 2023 5 commits

Refactor whisper asr pipeline to include language too. (#21427) · 13254591

Nicolas Patry authored Mar 02, 2023



* [WIP] whisper refacto to support language output.

* Handling merges.

* A bit more cleanup and comments.

* Many improvements.

Lots of details everywhere.

* Cleanup old code and tests.

* Handle lone timestamp tokens (just recover when something bad happens).

* Adding return_language example.

* No ffmpeg.

* Hmm.

* Some corrections.

* Both fast and slow.

* New black.

* Update src/transformers/models/whisper/tokenization_whisper.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/whisper/tokenization_whisper.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Remove print.

* Undoing tests modifications.

* Smaller test modifications.

* Rename.

* Remove maxDiff.

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

13254591

Make schedulers picklable by making lr_lambda fns global (#21768) · 8e5a1b2a

Connor Henderson authored Mar 02, 2023

* Make schedulers picklable by making lr_lambda fns global

* add unused _get_constant_schedule_lr_lambda arg

* remove unneeded _get_constant_schedule_lr_lamda

* add test

* make style

* rebase, remove torch dep, put lambda back

* repo-consistency and style

8e5a1b2a

Prophetnet batch dimension inversion fix (#21870) · 6bf88537

Kian Sierra McGettigan authored Mar 02, 2023

* decoder forward pass is working

* no model has forward pass returning attentions

* decoder ngram changed to not mix batch size

* current basic forward pass returns identical result

* passed test_model attentions

* passed test_encoder_decoder_model_generate

* passed test_headmasking

* removed old block

* removed comments bug/fixme

* removed bug comments

* applied styling

* applied fix-copies

* applied ngram forward comments

* corrected dimension notation

* applied styling and comment fixes

* changed asserts for raise ValueError

* changed question gen test

* updated hidden_states integration test

* applied styling

6bf88537

Mark pipeline tests to skip them easily (#21887) · 50a8ed3e

Sylvain Gugger authored Mar 02, 2023



* Mark pipeline tests to skip them easily

* Mark the mixin as pipeline test

* Update src/transformers/testing_utils.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

---------
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

50a8ed3e

[Whisper] Add rescaling function with `do_normalize` (#21263) · c87654dc

Arthur authored Mar 02, 2023



* add `zero_mean_unit_var_norm` function

* normalize before MEL computation

* fixup

* add simple test

* quality

* Update tests/models/whisper/test_feature_extraction_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* fixup

* use attention masks if padding was applied

* Update based on review
Co-authored-by: bofeng huang <bofenghuang7@gmail.com>

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: bofeng huang <bofenghuang7@gmail.com>

c87654dc

01 Mar, 2023 6 commits

Fix `WhisperModelTest` (#21883) · 36ee1283

Yih-Dar authored Mar 01, 2023



* force on the same device

* fix tests

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

36ee1283

Add ALIGN to transformers (#21741) · 269b0549

Alara Dirik authored Mar 01, 2023

Adds the ALIGN model to transformers. ALIGN is introduced in "Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision" by Chao Jia, Yinfei Yang, Ye Xia, Yi-Ting Chen, Zarana Parekh, Hieu Pham, Quoc V. Le, Yunhsuan Sung, Zhen Li, Tom Duerig.

269b0549

Add TFVisionTextDualEncoder (#21873) · f7c618e3

Matt authored Mar 01, 2023



* Temporary commit to stash everything so far

* Temporary commit to stash everything so far

* stash commit

* Refactor from_pretrained

* Fix final test, make fixup

* Update dummies

* Add model to TEST_FILES_WITH_NO_COMMON_TESTS

* Update src/transformers/models/vision_text_dual_encoder/modeling_tf_vision_text_dual_encoder.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/vision_text_dual_encoder/modeling_tf_vision_text_dual_encoder.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/vision_text_dual_encoder/modeling_tf_vision_text_dual_encoder.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/models/vision_text_dual_encoder/modeling_tf_vision_text_dual_encoder.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Add TFVisionTextDualEncoder to utils/documentation_tests.txt

* make fixup

---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

f7c618e3

Add an utility file to get information from test files (#21856) · 53735d7c

Yih-Dar authored Mar 01, 2023



* Add an utility file to get information from test files

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

53735d7c

[ConvBert] Fix #21523 (#21849) · b599b192

Arthur authored Mar 01, 2023

* fix reshaping
Fixes #21523

* add test

* styling

* last fixes

* Update src/transformers/models/convbert/modeling_convbert.py

* code quallity

b599b192

prepare for "__floordiv__ is deprecated and its behavior will change in a... · 44e3e3fb

Arthur authored Mar 01, 2023

prepare for "__floordiv__ is deprecated  and its behavior will change in a future version of pytorch" (#20211)

* rounding_mode = "floor"  instead of // to prevent behavioral change

* add other TODO

* use `torch_int_div` from pytrch_utils

* same for tests

* fix copies

* style

* use relative imports when needed

* Co-authored-by: sgugger <sylvain.gugger@gmail.com>

44e3e3fb

28 Feb, 2023 9 commits

Fix flaky test for log level (#21776) · b29e2dca
Sylvain Gugger authored Feb 28, 2023
```
* Fix flaky test for log level

* Fix other flaky test
```
b29e2dca

Improve TF weight loading, especially PT crossloading (#21792) · acfb714b

Matt authored Feb 28, 2023

* First commit for the improved PT-TF weight loading

* Remove workarounds from TFEncoderDecoder tests

* Allow a custom weight renaming function in from_pretrained and use that to clean up EncoderDecoder

* make fixup

* First attempt at visionencoderdecoder

* Disable tensorfloat32 in tests to get consistent outputs

* Quick fix to tf_vision_encoder_decoder tests

* make fixup

* Update Blenderbot tests

* Remove unused arg in modeling_tf_opt

* load_tf_sharded_weights had strict=True! This meant transfer learning was impossible, so I'm setting it to False.

* Support prefixes when loading sharded TF checkpoints

* make fixup

* Add test to load sharded models with a weight prefix

* Fix sharded weight loading test

* Add a test for transfer from a sharded checkpoint

* make fixup

* Add test to check that crossloading from PT with a prefix works

* Refactor from_pretrained in the encoderdecoder classes

* Refactor from_pretrained in the encoderdecoder classes

* missmatched -> mismatched

* Explicitly check for None

* No comments showing my very impressive and attractive knowledge of Py3.9+

* Disable TF32 across all TF tests

acfb714b

🔥

Rework pipeline testing by removing `PipelineTestCaseMeta`

🚀

(#21516) · 871c31a6

Yih-Dar authored Feb 28, 2023



* Add PipelineTesterMixin

* remove class PipelineTestCaseMeta

* move validate_test_components

* Add for ViT

* Add to SPECIAL_MODULE_TO_TEST_MAP

* style and quality

* Add feature-extraction

* update

* raise instead of skip

* add tiny_model_summary.json

* more explicit

* skip tasks not in mapping

* add availability check

* Add Copyright

* A way to diable irrelevant tests

* update with main

* remove disable_irrelevant_tests

* skip tests

* better skip message

* better skip message

* Add all pipeline task tests

* revert

* Import PipelineTesterMixin

* subclass test classes with PipelineTesterMixin

* Add pipieline_model_mapping

* Fix import after adding pipieline_model_mapping

* Fix style and quality after adding pipieline_model_mapping

* Fix one more import after adding pipieline_model_mapping

* Fix style and quality after adding pipieline_model_mapping

* Fix test issues

* Fix import requirements

* Fix mapping for MobileViTModelTest

* Update

* Better skip message

* pipieline_model_mapping could not be None

* Remove some PipelineTesterMixin

* Fix typo

* revert tests_fetcher.py

* update

* rename

* revert

* Remove PipelineTestCaseMeta from ZeroShotAudioClassificationPipelineTests

* style and quality

* test fetcher for all pipeline/model tests

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

871c31a6

Add loss for BridgeTowerForMaskedLM and BridgeTowerForImageAndTextRetrieval (#21684) · 4cb5ffa9

Anahita Bhiwandiwalla authored Feb 28, 2023



* Add loss for BridgeTowerForMaskedLM and BridgeTowerForImageAndTextRetrieval

* minor fix return_dict

* implement test for loss computation

---------
Co-authored-by: Tiep Le <97980157+tileintel@users.noreply.github.com>
Co-authored-by: Tiep Le <tiep.le@intel.com>

4cb5ffa9

[`Blip2`] Fix Blip-2 multi gpu (#21707) · 7f4f8b97

Younes Belkada authored Feb 28, 2023



* fix blip multi gpu

* fix

* final changes

* adapt suggestions

* fix failing slow test

* forward contrib credits from testing and suggestions

* reformat

---------
Co-authored-by: akkikiki <akkikiki@users.noreply.github.com>

7f4f8b97

Fix the issue of blip model returning loss even when the label is not provided. (#21811) · eec76042

raghavanone authored Feb 28, 2023

* Fix the issue of blip model returning loss even when the label is not provoided

* Fix ruff failure

* Incorporate PR feedbacks

* Incorporate PR feedbacks

* Incorporate PR feedbacks

* Incorporate PR feedbacks

eec76042

[`Blip2`] Add `Blip2Model` (#21817) · b8de7e44

Younes Belkada authored Feb 28, 2023

* add v1

* add `Blip2Model`

- add relevant functions
- add tests
- add on automapping

* fix docs

* fix doctest

b8de7e44

[`T5`] Fix torchquant issue (#21843) · ae9230af
Younes Belkada authored Feb 28, 2023
```
* fix torchquant issue

* add tests
```
ae9230af
Rename `MobileViTModelTest` to `TFMobileViTModelTest` (#21825) · a9dd1243
Yih-Dar authored Feb 28, 2023
```
Let's give TF a bit more love ❤️ 🙏

Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a9dd1243

27 Feb, 2023 4 commits

Inheritance-based framework detection (#21784) · 92dfceb1
Joao Gante authored Feb 27, 2023

92dfceb1
[`tests`] add `accelerate` marker (#21743) · 831f3144
Younes Belkada authored Feb 27, 2023
```
* add `accelerate` marker

* add to docs

* Update docs/source/en/testing.mdx
```
831f3144

[torch] remove deprecated uint8 in favor of bool (#21384) · c51dc4f9

Arthur authored Feb 27, 2023



* uint8 -> bool

* fix copies

* style

* update test modeling commen when checking attention buffers

* style

* use logical not on random mask instead of subtraction with 1

* remove torch uint8

* quality

* remove modified modeling utils

* Update based on review
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

---------
Co-authored-by: sgugger <sylvain.gugger@gmail.com>

c51dc4f9

[Pipeline] Add zero shot audio classificatoin pipeline (#21600) · cc44e72d

Arthur authored Feb 27, 2023



* add pipeline

* update init

* add zero shot to init

* update inits and correct checkpoints

* update base to support input features

* add tests

* Update src/transformers/pipelines/zero_shot_audio_classification.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/pipelines/zero_shot_audio_classification.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* update pieline code

* use tiny checkpoint

* nits and expected value with tiny model

* style

* last nit on tests values

* fix styling

* fix collate fn that was casting t float

* update

---------
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

cc44e72d

24 Feb, 2023 6 commits

[SpeechT5] Fix HiFiGAN tests (#21788) · 3dae0d7b
Sanchit Gandhi authored Feb 24, 2023

3dae0d7b

[time series] updated expected values for integration test. (#21762) · ba0e370d

Kashif Rasul authored Feb 24, 2023

* updated expected

* prediction_length fix

* prediction_length default value

* default prediction_length 24

* revert back prediction_length default

* move prediction_length test

ba0e370d

Fix-ci-whisper (#21767) · 087436c9

Arthur authored Feb 24, 2023

* fix history

* input_features instead of input ids for TFWhisport doctest

* use translate intead of transcribe

087436c9

[Whisper] Add SpecAugment (#21298) · c8545d2a

bofeng huang authored Feb 24, 2023



* Return and rescale attention_mask

* Add SpecAugment to Whisper modeling

* Fix test

* Update docstring

* Add SpecAug related parameters to model config

* Add the _mask_input_features function to doc

* Fix quality

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Remove dev comments

* Add test

* Resolve conflict

* feat: mask {feature, time} prob fast tests

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c8545d2a

[Flax] adding support for batch norm layers (#21581) · f7ca656f

Shubhamai authored Feb 24, 2023

* [flax] adding support for batch norm layers

* fixing bugs related to pt+flax integration

* cleanup, batchnorm support in sharded pt to flax

* support for batchnorm tests in pt+flax integration

* simplifying checking batch norm layer

f7ca656f

fix: Change is_last chunk calc and add conditional break in chunk_iter (#21612) · 279008ad

Connor Henderson authored Feb 24, 2023

* fix: Change is_last chunk calc and add conditional break

* format fix

* account for 0 and full stride_rights, add comment

* add new test

* make style

* update slow whisper asr test timestamps

* use nested_simplify on output and round timestamp to hundreths place

279008ad

23 Feb, 2023 4 commits
- [deepspeed tests] fix issues introduced by #21700 (#21769) · 63306263
  Stas Bekman authored Feb 23, 2023
```
* [deepspeed tests] fix issues introduced by #21700

* fix

* fix
```
  63306263
- Skip test_log_level for now · aa3787c8
  ydshieh authored Feb 23, 2023
  
  aa3787c8
- Generate: Fix GIT batched captioning (#21738) · 1d4b7978
  Joao Gante authored Feb 23, 2023
  
  1d4b7978
- Make ImageProcessorMixin compatible with subfolder kwarg (#21725) · 448e050b
  Naga Sai Abhinay authored Feb 23, 2023
```
* Add subfolder support

* Add kwarg docstring

* formatting fix

* Add test
```
  448e050b
22 Feb, 2023 5 commits

[SpeechT5HifiGan] Handle batched inputs (#21702) · 82e61f34
Sanchit Gandhi authored Feb 22, 2023
```
* [SpeechT5HifiGan] Handle batched inputs

* fix docstring

* rebase and new ruff style
```
82e61f34

Fix `GPTSanJapaneseModel` (#21731) · 09127c57

Yih-Dar authored Feb 22, 2023



* fix

* skip test_model_parallelism

* skip test_model_parallelism

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

09127c57

Respect documentation on passive log level (#21700) · b19d64d8
Sylvain Gugger authored Feb 22, 2023
```
* Respect documentation on passive log level

* Fix test and set log level in examples

* Add doc
```
b19d64d8
Apply ruff flake8-comprehensions (#21694) · 5e8c8eb5
Aaron Gokaslan authored Feb 22, 2023

5e8c8eb5

Time series transformer: input projection and Std scaler (#21020) · df06fb1f

Kashif Rasul authored Feb 22, 2023



* added loc and scale outputs from scalers

* fix typo

* fix tests

* fixed formatting

* initial StdScaler

* move scaling to optional str

* calculate std feature for scalers

* undid change as it does not help

* added StdScaler with weights

* added input projection layer and d_model hyperparam

* use linear proj

* add back layernorm_embedding

* add sin-cos pos embeddings

* updated scalers

* formatting

* fix type

* fixed test

* fix repeated_past_values cal.

* fix when keepdim=false

* fix default_scale

* backward compatibility of scaling config

* update integration test expected output

* fix style

* fix docs

* use the actual num_static_real_features in feature_dim cal

* clarified docs

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* prediction_length is not optional

* fix for reviewer

* Update src/transformers/models/time_series_transformer/configuration_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* get rid of un-needed new lines

* fix doc

* remove unneeded new lines

* fix style

* static_categorical_features and static_real_features are optional

* fix integration test

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* fixing docs for multivariate setting

* documentation for generate

---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

df06fb1f

21 Feb, 2023 1 commit

Fix TVLT (torch device issue) (#21710) · 03aaac35

Yih-Dar authored Feb 21, 2023



* fix tvlt ci

* fix tvlt ci

* fix tvlt ci

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

03aaac35