Commits · 3b4d3d09fd8cee2b4cc2fdd7c12ea51ca147c6cc · chenpangpang / transformers

06 Jun, 2024 16 commits

Fix SwinLayer / DonutSwinLayer / ClapAudioLayer attention mask device (#31295) · 3b4d3d09
Alex Gorodnitskiy authored Jun 06, 2024
```
Fix DonutSwinLayer attention mask device
```
3b4d3d09

Bump transformers from 3.5.1 to 4.38.0 in /examples/research_projects/bertabs (#31290) · b6c9f47f

dependabot[bot] authored Jun 06, 2024

Bump transformers in /examples/research_projects/bertabs

Bumps [transformers](https://github.com/huggingface/transformers) from 3.5.1 to 4.38.0.
- [Release notes](https://github.com/huggingface/transformers/releases)
- [Commits](https://github.com/huggingface/transformers/compare/v3.5.1...v4.38.0

)

---
updated-dependencies:
- dependency-name: transformers
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

b6c9f47f

Pipeline VQA: Add support for list of images and questions as pipeline input (#31217) · f9296249

Vu Huy Nguyen authored Jun 06, 2024

* Add list check for image and question

* Handle passing two lists and update docstring

* Add tests

* Add support for dataset

* Add test for dataset as input

* fixup

* fix unprotected import

* fix unprotected import

* fix import again

* fix param type

f9296249

Bump transformers from 4.19.0 to 4.38.0 in /examples/research_projects/codeparrot (#31285) · 4c821025

dependabot[bot] authored Jun 06, 2024

Bump transformers in /examples/research_projects/codeparrot

Bumps [transformers](https://github.com/huggingface/transformers) from 4.19.0 to 4.38.0.
- [Release notes](https://github.com/huggingface/transformers/releases)
- [Commits](https://github.com/huggingface/transformers/compare/v4.19.0...v4.38.0

)

---
updated-dependencies:
- dependency-name: transformers
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

4c821025

Mark MobileNetV1ModelTest::test_batching_equivalence as flaky (#31258) · c53fcd83
amyeroberts authored Jun 06, 2024
```
* Mark MobileNetV1ModelTest::test_batching_equivalence as flaky

* Add link to issue

* woops
```
c53fcd83

Enable dynamic resolution input for Beit (#31053) · 68118397

Omar Salman authored Jun 06, 2024

* Initial attempt

* Updates: PR suggestions

* Interpolate the relative position bias when interpolate_pos_encoding is True

* Add slow tag for the added tests

* Add in DATA2VEC_VISION_INPUTS_DOCSTRING

68118397

fix accelerate tests for roberta xl (#31288) · 99895ae5
Marc Sun authored Jun 06, 2024
```
* fix accelerate tests for roberta xl

* style
```
99895ae5
Fix _save_tpu: use _maybe_convert_to_cpu instead of to cpu. (#31264) · 5ba8ac54
Baole Ai authored Jun 06, 2024
```
* Fix _save_tpu: use _maybe_convert_to_cpu instead of to cpu.

* fix lint
```
5ba8ac54

Bump transformers from 3.5.1 to 4.38.0 in /examples/research_projects/bertology (#31256) · 14ff5dd9

dependabot[bot] authored Jun 06, 2024

Bump transformers in /examples/research_projects/bertology

Bumps [transformers](https://github.com/huggingface/transformers) from 3.5.1 to 4.38.0.
- [Release notes](https://github.com/huggingface/transformers/releases)
- [Commits](https://github.com/huggingface/transformers/compare/v3.5.1...v4.38.0

)

---
updated-dependencies:
- dependency-name: transformers
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

14ff5dd9

fix: `str` should be used not `int` when setting env variables (#31272) · 9e9679c0
Huazhong Ji authored Jun 06, 2024

9e9679c0
Switch from `cached_download` to `hf_hub_download` in remaining occurrences (#31284) · 9ef93fcc
Lucain authored Jun 06, 2024
```
Switch from hf_hub_url to hf_hub_download in remaining occurences
```
9ef93fcc
Generation: fix handling of special tokens (#31254) · 5fabd1e8
Raushan Turganbay authored Jun 06, 2024
```
* fix special tokens in generatioon

* fix test

* add warning

* fix the check

* warn once

* fix
```
5fabd1e8
Make mamba use cache (#31116) · 7729b774
Raushan Turganbay authored Jun 06, 2024
```
* make mamba use cache

* uss cache naming as in mamba

* fix musicgen
```
7729b774
fix loading special_tokens_map_file (#31012) · f5c0fa9f
Zhiyuan Chen authored Jun 06, 2024

f5c0fa9f
[`SwitchTransformer`] Significant performance improvement on MoE blocks (#31173) · 9b85e405
Ranggi Hwang authored Jun 06, 2024
```
* SwitchTransformer MoE layer performance improvement

* make fixup

* comments about shapes

* make fixup
```
9b85e405
no need for explicit EXTRA_TOKENS in processing_paligemma.py (#31022) · 8177aa0e
graham authored Jun 06, 2024
```
no need for explicit EXTRA_TOKENS
```
8177aa0e

05 Jun, 2024 12 commits

Skip failing JetMOE generation tests (#31266) · 940fde8d
amyeroberts authored Jun 05, 2024
```
Skip failing tests for now
```
940fde8d

Reduce by 2 the memory requirement in `generate()`

🔥

(#30536) · bd5091df

Cyril Vallez authored Jun 05, 2024

* Fix contrastive_search for new cache structure, and improve performance by removing inneficient torch.stack(torch.split(x, top_k, dim=0))

* Fix _contrastive_search for non-standard cache using ellipsis slicing

* Fix all outputs.logits memory leaks for all decoding strategies!

* Fix small error in _contrastive_search()

* Make all necessary change and revert for the new class

* Apply coding style

* Remove pipes in type hints for compatibility

* correct type hint

* apply style

* Use DynamicCache by default and solve conflicts

* Fix rebase issues

* Add `_supports_dynamic_cache_class` in models for models that support DynamicCache but not other caches to make DynamicCache the default for more models

* Create generation config to return legacy format by default, or to choose not to

* style

* Fix case when use_cache is False

* Remove default DynamicCache in assiste_decoding if assistant_model does not support it + fix _seen_tokens when cropping cache

* Update prepare_inputs_for_generation() for case with empty DynamicCache

* Correct return of args in _assisted_decoding

* Remove EfficientDynamicCache as it is no longer needed

* Correct mistake in generation config

* Move cache logic of assisted decoding to AssistedCandidateGenerator.__init__

* change DynamicCache function names from "split" to "batch_split" for readability + apply coding style

* Remove `_supports_dynamic_cache_class` attribute after rebase

* Correct missing line lost in conflict resolution during rebasing

* Add special case for Jamba

* Fix jamba test

* Coding style

* coding style

* Correct missing import in rebasing

* Simplify _validate_model_kwargs based on removal of _supports_dynamic_cache attribute

* Simplify code paths in _contrastive_search

* coding style

* Update docstrings of cache methods

* Update prepare_inputs_for_generation() -> past_key_values are always Cache objects

bd5091df

Add condition to `benchmark` job in `push-important-models.yml` (#31259) · d6276f0f
Yih-Dar authored Jun 05, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d6276f0f
Fix circular reference issue in CLIPTokenizerFast (#31075) · b72752f0
Dhaivat Bhatt authored Jun 05, 2024

b72752f0

Add missing Flaubert tokenizer tests (#30492) · 464d986b

bastrob authored Jun 05, 2024

* add flaubert tokenization test, enrich inheritance in FlaubertTokenizer.

* fix quality code ci

* ensure parameter consistency

* fix ci

* fix copyright year and flatten vocab list.

* fix style

464d986b

enable deterministic mode for npu (#31253) · 41cf4097
Huazhong Ji authored Jun 05, 2024

41cf4097

doc: add info about wav2vec2 bert in older wav2vec2 models. (#31120) · 4a602492

Vaibhav Srivastav authored Jun 05, 2024



* doc: add info about wav2vec2 bert in older wav2vec2 models.

* apply suggestions from review.

* forward contrib credits from review

---------
Co-authored-by: Sanchit Gandhi <sanchit-gandhi@users.noreply.github.com>

4a602492

Bump transformers from 3.5.1 to 4.38.0 in /examples/research_projects/deebert (#31244) · c39aaea9

dependabot[bot] authored Jun 05, 2024

Bump transformers in /examples/research_projects/deebert

Bumps [transformers](https://github.com/huggingface/transformers) from 3.5.1 to 4.38.0.
- [Release notes](https://github.com/huggingface/transformers/releases)
- [Commits](https://github.com/huggingface/transformers/compare/v3.5.1...v4.38.0

)

---
updated-dependencies:
- dependency-name: transformers
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

c39aaea9

Early labels validation (#31240) · 54659048

amyeroberts authored Jun 05, 2024

* Move label validation checks - fail early

* Remove some formatting changes - add back labels change wav2vec2

54659048

Benchmark GitHub Actions workflow (#31163) · 03ea1609

Yih-Dar authored Jun 05, 2024



* benchmark workflow

* benchmark workflow

* benchmark workflow

* benchmark workflow

* build

* build

* build

* build

* build

* build

* build

* build

* build

* build

* build

* build

* build

* build

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

03ea1609

Fixing `name 'torch' is not defined` in `bitsandbytes` integration (#31243) · 63fb253d
James Braza authored Jun 04, 2024
```
Fixed torch definition error
```
63fb253d

Specify dtype=torch.bool to avoid xla error (#31191) · 66875ac0

Yury Sulsky authored Jun 05, 2024

The StoppingCriteriaList allocates is_done without specifying dtype=torch.bool. On XLA this allocates a float tensor and causes a failure on the following line:

is_done = is_done | criteria(input_ids, scores, **kwargs)

by attempting to OR float with bool.

66875ac0

04 Jun, 2024 12 commits

Bump transformers from 4.26.0 to 4.38.0 in /examples/research_projects/vqgan-clip (#31242) · 8685b3c5

dependabot[bot] authored Jun 04, 2024

Bump transformers in /examples/research_projects/vqgan-clip

Bumps [transformers](https://github.com/huggingface/transformers) from 4.26.0 to 4.38.0.
- [Release notes](https://github.com/huggingface/transformers/releases)
- [Commits](https://github.com/huggingface/transformers/compare/v4.26.0...v4.38.0

)

---
updated-dependencies:
- dependency-name: transformers
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

8685b3c5

Upload (daily) CI results to Hub (#31168) · 3714f3f8

Yih-Dar authored Jun 04, 2024



* build

* build

* build

* build

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

3714f3f8

Move out common backbone config param validation (#31144) · 99de3a84
amyeroberts authored Jun 04, 2024
```
* Move out common validation

* Add missing backbone config arguments
```
99de3a84
Blip: Deprecate `BlipModel` (#31235) · 485d913d
Younes Belkada authored Jun 04, 2024
```
* deprecate blip

* mention deprecation on docs
```
485d913d

Fix `MistralIntegrationTest` (#31231) · fd3238b4

Yih-Dar authored Jun 04, 2024



* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

fd3238b4

add no split modules for xlmrobertaxl (#31223) · 2965b204
Manuel Faysse authored Jun 04, 2024

2965b204

Add new line switch before logging ***** Running {description} ***** (#31225) · 821b772a

Jacklanda authored Jun 04, 2024

✨

 Add new line switch before logging "***** Running {description} *****".
Signed-off-by: jacklanda <yonyonlau@gmail.com>

821b772a

Fix pipeline tests - torch imports (#31227) · 4ba66fdb
amyeroberts authored Jun 04, 2024
```
* Fix pipeline tests - torch imports

* Frameowrk dependant float conversion
```
4ba66fdb

fix bf16 issue in text classification pipeline (#30996) · 6b22a8f2

Chujie Zheng authored Jun 04, 2024

* fix logits dtype

* Add bf16/fp16 tests for text_classification pipeline

* Update test_pipelines_text_classification.py

* fix

* fix

6b22a8f2

Add dynamic resolution input/interpolate position embedding to deit (#31131) · de460e28

Kristen Pereira authored Jun 04, 2024



* Added interpolate pos encoding feature and test to deit

* Added interpolate pos encoding feature and test for deit TF model

* readded accidentally delted test for multi_gpu

* storing only patch_size instead of entire config and removed commented code

* Update modeling_tf_deit.py to remove extra line
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

de460e28

Video-LLaVa: handle any number of frames (#31221) · d64e4da7
Raushan Turganbay authored Jun 04, 2024
```
video-llava can handle more frames
```
d64e4da7

fix(PatchTST): Wrong dropout used for PretainHead (#31117) · 36ade4a3

Max Strobel authored Jun 04, 2024



* fix(PatchTST): Wrong dropout used for PretainHead

* feat(PatchTST): remove unused config.dropout

---------
Co-authored-by: Strobel Maximilian (IFAG PSS SIS SCE ACM) <Maximilian.Strobel@infineon.com>

36ade4a3