Commits · 9ddf4f4f03608095224cd3354b62c6f7d0d4b009 · chenpangpang / transformers

25 Feb, 2023 1 commit

Fix resume_from_checkpoint for deepspeed (#21735) · 9ddf4f4f

Moshe Berchansky authored Feb 25, 2023



* Fix resume_from_checkpoint for deepspeed

Fix resume_from_checkpoint for deepspeed, by ensuring that the deepspeed engine is the one to load the checkpoint.

* Empty commit to trigger CI

* Removed deepspeed skipping 

Removed deepspeed skipping inside the _load_from_checkpoint function, as it is obsolete

* another adjustment

* Trigger CI

* trigger circleci

* style

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>

9ddf4f4f

24 Feb, 2023 12 commits

[SpeechT5] Fix HiFiGAN tests (#21788) · 3dae0d7b
Sanchit Gandhi authored Feb 24, 2023

3dae0d7b

[GPT2, ProphetNet] Fix gradient checkpointing bug (#21772) · 59c1d5b9

Yi Heng Lim authored Feb 24, 2023

* fix gradient checkpointing bug

* fix gradient checkpointing bug

* ran make fix-copies

* fixed bug

* fixed bug

59c1d5b9

[time series] updated expected values for integration test. (#21762) · ba0e370d

Kashif Rasul authored Feb 24, 2023

* updated expected

* prediction_length fix

* prediction_length default value

* default prediction_length 24

* revert back prediction_length default

* move prediction_length test

ba0e370d

Generate - update cookie cutters to not initialize cache with training and... · 440f3975
Joao Gante authored Feb 24, 2023
```
Generate - update cookie cutters to not initialize cache with training and gradient checkpointing (#21759)
```
440f3975

Fix-ci-whisper (#21767) · 087436c9

Arthur authored Feb 24, 2023

* fix history

* input_features instead of input ids for TFWhisport doctest

* use translate intead of transcribe

087436c9

[Whisper] Add SpecAugment (#21298) · c8545d2a

bofeng huang authored Feb 24, 2023



* Return and rescale attention_mask

* Add SpecAugment to Whisper modeling

* Fix test

* Update docstring

* Add SpecAug related parameters to model config

* Add the _mask_input_features function to doc

* Fix quality

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Remove dev comments

* Add test

* Resolve conflict

* feat: mask {feature, time} prob fast tests

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: sanchit-gandhi <sanchit@huggingface.co>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

c8545d2a

[Flax] Fix erroneous kwargs being passed to generate config (#21765) · 75bd49ff
Sanchit Gandhi authored Feb 24, 2023

75bd49ff
Different behavior in DistilBERT when using "inputs_embeds" (#21752) · 14f33205
Arthur authored Feb 24, 2023
```
* Different behavior in DistilBERT when using "inputs_embeds"
Fixes #21089

* fix failing test
```
14f33205
[Examples] Generalise run audio classification for log-mel models (#21756) · 13489248
Sanchit Gandhi authored Feb 24, 2023
```
* [Examples] Generalise run audio classification for log-mel models

* batch feature extractor

* make style
```
13489248

[Flax] adding support for batch norm layers (#21581) · f7ca656f

Shubhamai authored Feb 24, 2023

* [flax] adding support for batch norm layers

* fixing bugs related to pt+flax integration

* cleanup, batchnorm support in sharded pt to flax

* support for batchnorm tests in pt+flax integration

* simplifying checking batch norm layer

f7ca656f

fix: Change is_last chunk calc and add conditional break in chunk_iter (#21612) · 279008ad

Connor Henderson authored Feb 24, 2023

* fix: Change is_last chunk calc and add conditional break

* format fix

* account for 0 and full stride_rights, add comment

* add new test

* make style

* update slow whisper asr test timestamps

* use nested_simplify on output and round timestamp to hundreths place

279008ad

Graphormer fix (#21699) · 4446b6b0

Clémentine Fourrier authored Feb 24, 2023

* Removed useless check for backend

* fix style check for graphormer

* Reverted change and corrected requires_backend for cython

* code qual

4446b6b0

23 Feb, 2023 10 commits

[deepspeed tests] fix issues introduced by #21700 (#21769) · 63306263
Stas Bekman authored Feb 23, 2023
```
* [deepspeed tests] fix issues introduced by #21700

* fix

* fix
```
63306263

Auto api Value Error addition to Troubleshoot (#21708) · 04d90ac4

Maria Khalusova authored Feb 23, 2023



* troubleshooting guide: added an error description for missing auto-mapping

* minor polishing

* changed the example

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/troubleshooting.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

04d90ac4

Added Type Hints for modeling_tf_encoder_decoder.py (#21673) · 0ffa22f9

Batese2001 authored Feb 23, 2023



* Ran Black formatting

* Added imports and reformatted

* Update src/transformers/models/encoder_decoder/modeling_tf_encoder_decoder.py

---------
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

0ffa22f9

Skip test_log_level for now · aa3787c8
ydshieh authored Feb 23, 2023

aa3787c8
Generate: Fix GIT batched captioning (#21738) · 1d4b7978
Joao Gante authored Feb 23, 2023

1d4b7978

[`GPTNeo`] Fix gradient checkpointing bug (#21733) · 78a93d17

Younes Belkada authored Feb 23, 2023



* fix bug

* forward contrib credits from discussions

* change logic

---------
Co-authored-by: edbeeching <edbeeching@users.noreply.github.com>

78a93d17

Fix 2 quicktour file doctest (#21742) · 36a6a1ad

Yih-Dar authored Feb 23, 2023



* Update expect output values - as Hub repo. files are updated

* Update expect output values - as librosa is from 0.9.2 to 0.10.0 on CI docker

* fix

* update one more

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

36a6a1ad

Update doctest GH workflow file (#21744) · ff143ae1
Yih-Dar authored Feb 23, 2023
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
ff143ae1
Make ImageProcessorMixin compatible with subfolder kwarg (#21725) · 448e050b
Naga Sai Abhinay authored Feb 23, 2023
```
* Add subfolder support

* Add kwarg docstring

* formatting fix

* Add test
```
448e050b
typos in french documentation (#21750) · 064f3748
Thomas Paviot authored Feb 23, 2023

064f3748

22 Feb, 2023 12 commits

Added "Open in Colab" to task guides (#21729) · 619d51e0
Maria Khalusova authored Feb 22, 2023
```
added Open in Colab to task guides
```
619d51e0

Fix to KerasMetricCallback when the model returns unstructured output (#21727) · d913f4aa

Matt authored Feb 22, 2023

* Stop doing dict-things to non-dict inputs

* Add a debug check

* Add a debug check

* Remove debug checks, looks good now!

* make fixup

d913f4aa

[SpeechT5HifiGan] Handle batched inputs (#21702) · 82e61f34
Sanchit Gandhi authored Feb 22, 2023
```
* [SpeechT5HifiGan] Handle batched inputs

* fix docstring

* rebase and new ruff style
```
82e61f34

Fix `GPTSanJapaneseModel` (#21731) · 09127c57

Yih-Dar authored Feb 22, 2023



* fix

* skip test_model_parallelism

* skip test_model_parallelism

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

09127c57

Fix `ErnieMEmbeddings` device issue (#21726) · aff87da1

Yih-Dar authored Feb 22, 2023



* remove .parameters()).device

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

aff87da1

Change doc example for `BigBirdForQuestionAnswering` (#21723) · 2f2b19ff

Yih-Dar authored Feb 22, 2023



Change doc example for BigBirdForQuestionAnswering
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2f2b19ff

Remove `gptsan_japanese` from doctest list to avoid GPU OOM (#21722) · 354b3383
Yih-Dar authored Feb 22, 2023
```
remove from doctest list to avoid GPU OOM
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
354b3383
Respect documentation on passive log level (#21700) · b19d64d8
Sylvain Gugger authored Feb 22, 2023
```
* Respect documentation on passive log level

* Fix test and set log level in examples

* Add doc
```
b19d64d8
Fix quality · ee6e71e2
Sylvain Gugger authored Feb 22, 2023

ee6e71e2
[`MBart`] Fix cross attention mask check (#21730) · 24b930ad
Younes Belkada authored Feb 22, 2023
```
fix typo
```
24b930ad
Apply ruff flake8-comprehensions (#21694) · 5e8c8eb5
Aaron Gokaslan authored Feb 22, 2023

5e8c8eb5

Time series transformer: input projection and Std scaler (#21020) · df06fb1f

Kashif Rasul authored Feb 22, 2023



* added loc and scale outputs from scalers

* fix typo

* fix tests

* fixed formatting

* initial StdScaler

* move scaling to optional str

* calculate std feature for scalers

* undid change as it does not help

* added StdScaler with weights

* added input projection layer and d_model hyperparam

* use linear proj

* add back layernorm_embedding

* add sin-cos pos embeddings

* updated scalers

* formatting

* fix type

* fixed test

* fix repeated_past_values cal.

* fix when keepdim=false

* fix default_scale

* backward compatibility of scaling config

* update integration test expected output

* fix style

* fix docs

* use the actual num_static_real_features in feature_dim cal

* clarified docs

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* prediction_length is not optional

* fix for reviewer

* Update src/transformers/models/time_series_transformer/configuration_time_series_transformer.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* get rid of un-needed new lines

* fix doc

* remove unneeded new lines

* fix style

* static_categorical_features and static_real_features are optional

* fix integration test

* Update src/transformers/models/time_series_transformer/modeling_time_series_transformer.py
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* fixing docs for multivariate setting

* documentation for generate

---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

df06fb1f

21 Feb, 2023 5 commits

Adding type hints to call() functions in this file (#21548) · bb5a2f2f

mollerup23 authored Feb 21, 2023



* Adding type hints to call() functions in this file

* make fixup

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

* Update src/transformers/models/marian/modeling_tf_marian.py

---------
Co-authored-by: Matt <rocketknight1@gmail.com>
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

bb5a2f2f

Adding task guides to resources (#21704) · 78a53d59

Maria Khalusova authored Feb 21, 2023



* added resources: links to task guides that support these models

* minor polishing

* conflict resolved

* link fix

* Update docs/source/en/model_doc/vision-encoder-decoder.mdx
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

---------
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

78a53d59

Fix TVLT (torch device issue) (#21710) · 03aaac35

Yih-Dar authored Feb 21, 2023



* fix tvlt ci

* fix tvlt ci

* fix tvlt ci

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

03aaac35

Fix `get_class_in_module` (#21709) · 4c6346cc

Yih-Dar authored Feb 21, 2023



Fix get_class_in_module
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

4c6346cc

Fix typo in `PROCESSOR_MAPPING_NAMES` and add tests (#21703) · ed6ceb76

Yih-Dar authored Feb 21, 2023



* Add test

* Fix GITProcessor

* Update

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

ed6ceb76