Commits · 279008adc3976ee6b914c07c401527bbc178dffe · chenpangpang / transformers

24 Feb, 2023 2 commits

fix: Change is_last chunk calc and add conditional break in chunk_iter (#21612) · 279008ad

Connor Henderson authored Feb 24, 2023

* fix: Change is_last chunk calc and add conditional break

* format fix

* account for 0 and full stride_rights, add comment

* add new test

* make style

* update slow whisper asr test timestamps

* use nested_simplify on output and round timestamp to hundreths place

279008ad

Graphormer fix (#21699) · 4446b6b0

Clémentine Fourrier authored Feb 24, 2023

* Removed useless check for backend

* fix style check for graphormer

* Reverted change and corrected requires_backend for cython

* code qual

4446b6b0

23 Feb, 2023 10 commits

[deepspeed tests] fix issues introduced by #21700 (#21769) · 63306263
Stas Bekman authored Feb 23, 2023
```
* [deepspeed tests] fix issues introduced by #21700

* fix

* fix
```
63306263

Auto api Value Error addition to Troubleshoot (#21708) · 04d90ac4

Maria Khalusova authored Feb 23, 2023



* troubleshooting guide: added an error description for missing auto-mapping

* minor polishing

* changed the example

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/troubleshooting.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

04d90ac4

Added Type Hints for modeling_tf_encoder_decoder.py (#21673) · 0ffa22f9

Batese2001 authored Feb 23, 2023



* Ran Black formatting

* Added imports and reformatted

* Update src/transformers/models/encoder_decoder/modeling_tf_encoder_decoder.py

---------
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

0ffa22f9

Skip test_log_level for now · aa3787c8
ydshieh authored Feb 23, 2023

aa3787c8
Generate: Fix GIT batched captioning (#21738) · 1d4b7978
Joao Gante authored Feb 23, 2023

1d4b7978

[`GPTNeo`] Fix gradient checkpointing bug (#21733) · 78a93d17

Younes Belkada authored Feb 23, 2023



* fix bug

* forward contrib credits from discussions

* change logic

---------
Co-authored-by: edbeeching <edbeeching@users.noreply.github.com>

78a93d17

Fix 2 quicktour file doctest (#21742) · 36a6a1ad

Yih-Dar authored Feb 23, 2023



* Update expect output values - as Hub repo. files are updated

* Update expect output values - as librosa is from 0.9.2 to 0.10.0 on CI docker

* fix

* update one more

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

36a6a1ad

Update doctest GH workflow file (#21744) · ff143ae1
Yih-Dar authored Feb 23, 2023
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
ff143ae1
Make ImageProcessorMixin compatible with subfolder kwarg (#21725) · 448e050b
Naga Sai Abhinay authored Feb 23, 2023
```
* Add subfolder support

* Add kwarg docstring

* formatting fix

* Add test
```
448e050b
typos in french documentation (#21750) · 064f3748
Thomas Paviot authored Feb 23, 2023

064f3748

22 Feb, 2023 12 commits

Added "Open in Colab" to task guides (#21729) · 619d51e0
Maria Khalusova authored Feb 22, 2023
```
added Open in Colab to task guides
```
619d51e0

Fix to KerasMetricCallback when the model returns unstructured output (#21727) · d913f4aa

Matt authored Feb 22, 2023

* Stop doing dict-things to non-dict inputs

* Add a debug check

* Add a debug check

* Remove debug checks, looks good now!

* make fixup

d913f4aa

[SpeechT5HifiGan] Handle batched inputs (#21702) · 82e61f34
Sanchit Gandhi authored Feb 22, 2023
```
* [SpeechT5HifiGan] Handle batched inputs

* fix docstring

* rebase and new ruff style
```
82e61f34

Fix `GPTSanJapaneseModel` (#21731) · 09127c57

Yih-Dar authored Feb 22, 2023



* fix

* skip test_model_parallelism

* skip test_model_parallelism

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

09127c57

Fix `ErnieMEmbeddings` device issue (#21726) · aff87da1

Yih-Dar authored Feb 22, 2023



* remove .parameters()).device

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

aff87da1

Change doc example for `BigBirdForQuestionAnswering` (#21723) · 2f2b19ff

Yih-Dar authored Feb 22, 2023



Change doc example for BigBirdForQuestionAnswering
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2f2b19ff

Remove `gptsan_japanese` from doctest list to avoid GPU OOM (#21722) · 354b3383
Yih-Dar authored Feb 22, 2023
```
remove from doctest list to avoid GPU OOM
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
354b3383
Respect documentation on passive log level (#21700) · b19d64d8
Sylvain Gugger authored Feb 22, 2023
```
* Respect documentation on passive log level

* Fix test and set log level in examples

* Add doc
```
b19d64d8
Fix quality · ee6e71e2
Sylvain Gugger authored Feb 22, 2023

ee6e71e2
[`MBart`] Fix cross attention mask check (#21730) · 24b930ad
Younes Belkada authored Feb 22, 2023
```
fix typo
```
24b930ad
Apply ruff flake8-comprehensions (#21694) · 5e8c8eb5
Aaron Gokaslan authored Feb 22, 2023

5e8c8eb5

Time series transformer: input projection and Std scaler (#21020) · df06fb1f

Kashif Rasul authored Feb 22, 2023



* added loc and scale outputs from scalers

* fix typo

* fix tests

* fixed formatting

* initial StdScaler

* move scaling to optional str

* calculate std feature for scalers

* undid change as it does not help

* added StdScaler with weights

* added input projection layer and d_model hyperparam

* use linear proj

* add back layernorm_embedding

* add sin-cos pos embeddings

* updated scalers

* formatting

* fix type

* fixed test

* fix repeated_past_values cal.

* fix when keepdim=false

* fix default_scale

* backward compatibility of scaling config

* update integration test expected output

* fix style

* fix docs

* use the actual num_static_real_features in feature_dim cal

* clarified docs

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* prediction_length is not optional

* fix for reviewer

* Update src/transformers/models/time_series_transformer/configuration_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* get rid of un-needed new lines

* fix doc

* remove unneeded new lines

* fix style

* static_categorical_features and static_real_features are optional

* fix integration test

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* fixing docs for multivariate setting

* documentation for generate

---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

df06fb1f

21 Feb, 2023 8 commits

Adding type hints to call() functions in this file (#21548) · bb5a2f2f

mollerup23 authored Feb 21, 2023



* Adding type hints to call() functions in this file

* make fixup

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

---------
Co-authored-by: Matt <rocketknight1@gmail.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

bb5a2f2f

Adding task guides to resources (#21704) · 78a53d59

Maria Khalusova authored Feb 21, 2023



* added resources: links to task guides that support these models

* minor polishing

* conflict resolved

* link fix

* Update docs/source/en/model_doc/vision-encoder-decoder.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

78a53d59

Fix TVLT (torch device issue) (#21710) · 03aaac35

Yih-Dar authored Feb 21, 2023



* fix tvlt ci

* fix tvlt ci

* fix tvlt ci

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

03aaac35

Fix `get_class_in_module` (#21709) · 4c6346cc

Yih-Dar authored Feb 21, 2023



Fix get_class_in_module
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

4c6346cc

Fix typo in `PROCESSOR_MAPPING_NAMES` and add tests (#21703) · ed6ceb76

Yih-Dar authored Feb 21, 2023



* Add test

* Fix GITProcessor

* Update

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ed6ceb76

remove position ids and token type ids from forward args in docstring (#21701) · 4deaa534
Arthur authored Feb 21, 2023

4deaa534

Fix axial positional encoding calculations for reformer.mdx (#21649) · c40e3581

Ishan Jindal authored Feb 20, 2023



* Update reformer.mdx

Fix axial positional encoding calculations

* Update docs/source/en/model_doc/reformer.mdx
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

c40e3581

Add WhisperTokenizerFast (#21222) · deafc243

Jonatan Kłosko authored Feb 21, 2023



* Add WhisperTokenizerFast

* Fixup

* Up

* Up

* Improve tests

* Update src/transformers/models/whisper/tokenization_whisper_fast.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Keep stride in whisper pipelien test

* Remove unknown token special case

* Reduce vocabulary size in tests

* Fix vocab size assertion

* Sync copied changes from WhisperTokenizer

* Skip pipeline tests

* Update assertion

* Remove Whisper tokenizer dependency on sentencepiece

* Format

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

deafc243

20 Feb, 2023 8 commits

Pass along revision in dynamic code fetch (#21698) · 8b3db33a
Sylvain Gugger authored Feb 20, 2023

8b3db33a
Fix-rag-finetune-project-requirement (#21697) · 4194e5f4
Arthur authored Feb 20, 2023
```
pin pytorch lightning requirement
```
4194e5f4
Add EfficientNet (#21563) · 49ab1623
Alara Dirik authored Feb 20, 2023
```
* Add EfficientNet to transformers
```
49ab1623
[`bnb`] fix `bnb` decoders bug (#21688) · c9a06714
Younes Belkada authored Feb 20, 2023
```
* fix `bnb` decoders bug

* make fixup
```
c9a06714

add GPTSAN model (reopen) (#21291) · f56174ac

tanreinama authored Feb 20, 2023

* add GPTSAN-Japanese

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN (update for review)

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* fix typo in comment text

* add GPTSAN

* add GPTSAN

* add GPTSAN

* add GPTSAN

* fix document and comments

* fix class name GPTSAN->GPTSan

* fix import and test for tokenizer

f56174ac

Fix quality · c87bbe1f
Sylvain Gugger authored Feb 20, 2023

c87bbe1f
Fix for non-contiguous label tensors in VisonEncoderDecoder (#21582) · 011cc17a
Morgan McGuire authored Feb 20, 2023
```
* add prints

* add shape

* add reshape

* clean up
```
011cc17a

add flax whisper implementation (#20479) · 2840272c

Andy Ehrenberg authored Feb 20, 2023



* add flax whisper implementation

* rever change to setup

* remove unused imports

* revert generation changes

* flax whisper docs

* docs

* import order

* import sorting

* isort

* add dummy objects

* doc formatting

* formatting

* remove trailing whitespaces

* fix flax whisper docs

* add generation logic to unlock flax whisper

* remove scans

* give credits to Flax Bart implementation

* remove unused imports

* add license

* remove assert

* more credits to Bart

* fix style

* formatting

* support left padding

* add flax whisper generation test

* remove copied from comments whenever not a full copy

* fix docstrings for logits processors

* revert change to FlaxForceTokensLogitsProcessor

* revert doc changes

* improve generation docs

* reorganize

* formatting

* cleanup docs

* add tests

* handle empty list case

* fix forced decoder ids in flax tests

* add flax whisper to inits

* upate dummy objects

* docs for FlaxAutoModelForSpeechSeq2Seq

* fix decoder_position_ids computation in pretrained model decode/__call__ fns

* add Copied from statements as necessary

* compute position_ids only in __call__ and decode methods of pretrained model subclasses

* improve readabilityof compute positional embeddings

* check dimensionality of input_features instead of hidden_states

* copied from statement for init_cache

* formatting

* fix copies

* fix copies

* pass attention mask to encoder layers

* fix decoder module outputs

* set dtype
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* smaller flax model for whisper test

* Update src/transformers/generation/flax_utils.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/whisper/modeling_flax_whisper.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update tests/models/whisper/test_modeling_flax_whisper.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* cleanup
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/models/whisper/modeling_flax_whisper.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* bias cleanup

* doc fix

* align style for force tokens processor

* readability

* fix input shape in tests

* revert FlaxGenerationMixin docstring

* formatting

* fix tests

* fix imports

* consistent encoder hidden states

* consistent hidden states

* input shapes

* typo

* partial class trick

* partial class for input shape

* base_class with correct input shape

* partial base classes

* match by name

* set main_input_name

* compare on names

* formatting

* remove unused import

* safer position ids computation

* safer position id computation

* Update src/transformers/models/whisper/modeling_flax_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update src/transformers/models/whisper/modeling_flax_whisper.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* remove identical inherited tests

* fix prompt ids in tests

* use generation config

* use jnp array

* better var names

* more explicit bias use

* import transformers

* formatting

* test formatting

* remove unused imports

* remove unused imports

* formatting

* isort

* docs

* fix ln orders for encoder hidden states

* whisper unique generation stuff

* flake

* use finfo for attention bias

* docs

* Update src/transformers/generation/flax_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* docs

* add timestamp flax test

* jit for timestamps

* formatting

* clean up timestamps processor

* formatting

* remove if_true

* cleanup

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

2840272c