Commits · 98e2d48e9af3631e8f8f2070198912fc5d6bc19e · chenpangpang / transformers

28 May, 2024 8 commits

Fix OWLv2 post_process_object_detection for multiple images (#31082) · 98e2d48e
Pavel Iakubovskii authored May 28, 2024
```
* Add test for multiple images

* [run slow] owlv2

* Fix box rescaling

* [run slow] owlv2
```
98e2d48e
Remove float64 cast for OwlVit and OwlV2 to support MPS device (#31071) · c31473ed
Pavel Iakubovskii authored May 28, 2024
```
Remove float64
```
c31473ed

fix from_pretrained in offline mode when model is preloaded in cache (#31010) · 936ab7ba

oOraph authored May 28, 2024



* Unit test to verify fix
Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>

* fix from_pretrained in offline mode when model is preloaded in cache
Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>

* minor: fmt
Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>

---------
Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>
Co-authored-by: Raphael Glon <oOraph@users.noreply.github.com>

936ab7ba

Remove redundant backend checks in training_args.py (#30999) · 537deb78

Hengwen Tong authored May 28, 2024



* Remove backend checks in training_args.py

* Expilicit initialize the device

---------
Co-authored-by: tonghengwen <tonghengwen@cambricon.com>

537deb78

Update quicktour.md to fix broken link to Glossary (#31072) · dd4654ea

AP authored May 28, 2024

Update quicktour.md to fix broken link

Missing '/' in attention mask link in the transformers quicktour

dd4654ea

fix "piano" typo (#31027) · e18da4e3
Clint Adams authored May 28, 2024

e18da4e3
Remove `ninja` from docker image build (#31080) · 8e3b1fef
Yih-Dar authored May 28, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8e3b1fef

use `@main` (#31065) · 8f0f7271

Yih-Dar authored May 28, 2024



use main
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

8f0f7271

27 May, 2024 7 commits
- skip `test_model_parallelism` for 2 model test classes (#31067) · 9d35edbb
  Yih-Dar authored May 27, 2024
```
skip
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  9d35edbb
- Fix pad_to_max_length Whisper (#30787) · d355741e
  Yoach Lacombe authored May 27, 2024
```
* fix pad_to_max_length Whisper

* add tests

* make style
```
  d355741e
- Fix quanto tests (#31062) · b84cd675
  Marc Sun authored May 27, 2024
```
fix quanto tests
```
  b84cd675
- Update feature request label in template (#30940) · cd797778
  amyeroberts authored May 27, 2024
  
  cd797778
- Follow up: Fix link in dbrx.md (#30514) · 0a064dc0
  Eitan Turok authored May 27, 2024
```
* Fix link in dbrx.md

* remove "though this may not be up to date"

---------
Co-authored-by: Lysandre Debut <hi@lysand.re>
```
  0a064dc0
- unpin uv (#31055) · d7942d9d
  Yih-Dar authored May 27, 2024
```
[push-ci-image]
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  d7942d9d
- Redirect transformers_agents doc to agents (#31054) · 84c4b72e
  Aymeric Roucher authored May 27, 2024
  
  84c4b72e
24 May, 2024 14 commits

Paligemma- fix devices and dtype assignments (#31008) · bdb9106f
Pablo Montalvo authored May 24, 2024
```
* fix devices and dtype assignments

* [run-slow]paligemma
```
bdb9106f

Add split special tokens (#30772) · deba7655

Ita Zaporozhets authored May 24, 2024



* seems like `split_special_tokens` is used here

* split special token

* add new line at end of file

* moving split special token test to common tests

* added assertions

* test

* fixup

* add co-author

* passing rest of args to gptsan_japanese, fixing tests

* removing direct comparison of fast and slow models

* adding test support for UDOP and LayoutXLM

* ruff fix

* readd check if slow tokenizer

* modify test to handle bos tokens

* removing commented function

* trigger build

* applying review feedback - updated docstrings, var names, and simplified tests

* ruff fixes

* Update tests/test_tokenization_common.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* applying feedback, comments

* shutil temp directory fix

---------
Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MBP.localdomain>
Co-authored-by: itazap <itazap@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MacBook-Pro.local>

deba7655

added interpolation for vitmae model in pytorch as well as tf. (#30732) · e5103a76

BHUVAN M authored May 24, 2024



* added interpolation for vitmae model in pytorch as well as tf.

* Update modeling_vit_mae.py

irreugalr import fixed

* small changes and proper formatting

* changes suggested in review.

* modified decoder interpolate_func

* arguments and docstring fix

* Apply suggestions from code review

doc fixes
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

e5103a76

save the list of new model failures (#31013) · a3cdff41
Yih-Dar authored May 24, 2024
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a3cdff41
Quantization / TST: Fix remaining quantization tests (#31000) · 658b849a
Younes Belkada authored May 24, 2024
```
* Fix remaining quant tests

* Update test_quanto.py
```
658b849a
Fix resume_download future warning (#31007) · fd3c1280
Lucain authored May 24, 2024
```
* Fix resume_download future warning

* better like this

* Add regression test
```
fd3c1280

allow multi-gpu (#31011) · acbfaf69

Yih-Dar authored May 24, 2024



* allow multi-gpu

* allow multi-gpu

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

acbfaf69

FIX / TST: Fix expected results on Mistral AWQ test (#30971) · ae87f979
Marc Sun authored May 24, 2024
```
fix awq mistral test
```
ae87f979
[tests] make `test_model_parallelism` device-agnostic (#30844) · 04c7c176
Fanli Lin authored May 24, 2024
```
* enable on xpu

* fix style

* add comment and mps
```
04c7c176

Perceiver interpolate position embedding (#30979) · 42d8dd87

Yixiang Gao authored May 24, 2024



* add test that currently fails

* test passed

* all perceiver passed

* fixup, style, quality, repo-consistency, all passed

* Apply suggestions from code review: default to False + compute sqrt once only
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix a minor bracket

* replace dim with self._num_channels

* add arguments to the rest preprocessors

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

42d8dd87

pin `uv==0.1.45` (#31006) · 5855afd1

Yih-Dar authored May 24, 2024



* fix

* [push-ci-image]

* run with latest

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5855afd1

Do not trigger autoconversion if local_files_only (#31004) · 03935d30
Lucain authored May 24, 2024

03935d30

Fix training speed regression introduced by "optimize VRAM for calculating... · 21e259d8

Kevin Koehncke authored May 24, 2024

Fix training speed regression introduced by "optimize VRAM for calculating pos_bias in LayoutLM v2, v3 (#26139)" (#30988)

* Revert "optimize VRAM for calculating pos_bias in LayoutLM v2, v3 (#26139)"

This reverts commit a7e0ed82

.

* Instead of reverting commit, wrap indexing in torch.no_grad context

* Apply wrapping in LayoutLMv2

* Add comments explaining reason for no_grad

* Fix code format

---------
Co-authored-by: Kevin Koehncke <kevin.koehncke@uipath.com>

21e259d8

add prefix space ignored in llama #29625 (#30964) · 7f6e8741

Ita Zaporozhets authored May 24, 2024



* add prefix space ignored in llama #29625

* adding test with add_prefix_space=False

* ruff

---------
Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MBP.localdomain>

7f6e8741

23 May, 2024 11 commits

Bugfix: WandbCallback uploads initial model checkpoint (#30897) · 6657fb5f

Matthias Gerstgrasser authored May 23, 2024

* fix wandb always uploading initial model

* Update comment.

* Optionally log initial model

* Revert "Optionally log initial model"

This reverts commit 9602cc1fad3feaf218f82a7339a194d3d2fbb946.

6657fb5f

Remove deprecated properties in tokenization_nllb.py and tokenization_nllb_fast.py (#29834) · 6d3d5b10

Yasmin Moslem authored May 23, 2024

* Fix typo in tokenization_nllb.py

Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability.

* Fix typo in tokenization_nllb_fast.py

Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability.

* Remove deprecated attributes in tokenization_nllb.py

Remove deprecated attributes: `lang_code_to_id`, `fairseq_tokens_to_ids`, `id_to_lang_code`, and `fairseq_ids_to_tokens`

* Remove deprecated attribute in tokenization_nllb_fast.py

Remove deprecated attribute `lang_code_to_id`

* Remove deprecated properties in tokenization_nllb.py

Remove deprecated properties - fix format

* Remove deprecated properties in tokenization_nllb_fast.py

Remove deprecated properties - fix format

* Update test_tokenization_nllb.py

* update test_tokenization_nllb.py

* Update tokenization_nllb.py

* Update test_tokenization_seamless_m4t.py

* Update test_tokenization_seamless_m4t.py

6d3d5b10

[Port] TensorFlow implementation of Mistral (#29708) · 965e98dc

Aritra Roy Gosthipaty authored May 23, 2024



* chore: initial commit

* chore: adding imports and inits

* chore: adding the causal and classification code

* chore: adding names to the layers

* chore: using single self attn layer

* chore: built the model and layers

* chore: start with testing

* chore: docstring change, transpose fix

* fix: rotary embedding

* chore: adding cache implementation

* remove unused torch

* chore: fixing the indexing issue

* make fix-copies

* Use modeling_tf_utils.keras

* make fixup

* chore: fixing tests

* chore: adding past key value logic

* chore: adding multi label classfication test

* fix: switching on the built parameters in the layers

* fixing repo consistency

* ruff formats

* style changes

* fix: tf and pt equivalence

* removing returns from docstrings

* fix docstrings

* fix docstrings

* removing todos

* fix copies

* fix docstring

* fix docstring

* chore: using easier rotate_half

* adding integration tests

* chore: addressing review related to rotary embedding layer

* review changes

* [run-slow] mistral

* skip: test save load after resize token embedding

* style

---------
Co-authored-by: Matt <rocketknight1@gmail.com>

965e98dc

Update 4 `MptIntegrationTests` expected outputs (#30989) · 2a89673f

Yih-Dar authored May 23, 2024



* fix

* fix

* fix

* fix

* fix

* [run-slow] mpt

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2a89673f

Add a check that warmup_setps is either 0 or >= 1 (#30764) · 892b13d3

Yasmin Moslem authored May 23, 2024



* Add a check that warmup_setps is either 0 or >= 1

Update training_args.py to add a check that warmup_setps is either 0 or >= 1. Otherwise, raise an error.

* Update src/transformers/training_args.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

892b13d3

[tests] add `torch.use_deterministic_algorithms` for XPU (#30774) · 21339a52
Fanli Lin authored May 23, 2024
```
* add xpu check

* add marker

* add documentation

* update doc

* fix ci

* remove from global init

* fix
```
21339a52

Fix accelerate failing tests (#30836) · 8366b572

Marc Sun authored May 23, 2024

* Fix accelerate tests

* fix clip

* skip dbrx tests

* fix GPTSan

* fix M2M100Model

* same fix as jamba

* fix mt5

* Fix T5Model

* Fix umt5 model

* fix switch_transformers

* fix whisper

* fix gptsan again

* fix siglip recent test

* skip siglip tests

* wrong place fixed

8366b572

FIX / Docs: Minor changes in quantization docs (#30985) · 5a74ae6d

Younes Belkada authored May 23, 2024



* Change in quantization docs

* Update overview.md

* Update docs/source/en/quantization/overview.md
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

5a74ae6d

Finish adding support for torch.compile dynamic shapes (#30919) · 046c2ad7
Benjamin Warner authored May 23, 2024
```
add torch.compile dynamic support
```
046c2ad7
test_custom_4d_attention_mask skip with sliding window attn (#30833) · 6739e1d2
Poedator authored May 23, 2024

6739e1d2

Docs / Quantization: refactor quantization documentation (#30942) · 87a35181

Younes Belkada authored May 23, 2024



* refactor quant docs

* delete file

* rename to overview

* fix

* fix table

* fix

* add content

* fix library versions

* fix table

* fix table

* fix table

* fix table

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* replace to quantization_config

* fix aqlm snippet

* add DLAI courses

* fix

* fix table

* fix bulet points

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

87a35181