Commits · 94d416f018e3599affe53dbe43f26d9aded2fe29 · chenpangpang / transformers

28 May, 2024 15 commits

FIX: Add `accelerate` as a hard requirement (#31090) · 94d416f0
Younes Belkada authored May 28, 2024
```
add accelerate
```
94d416f0
Render chat template tojson filter as unicode (#31041) · 22dab246
Sigbjørn Skjæret authored May 28, 2024
```
* Render chat template tojson filter as unicode

* ruff--
```
22dab246

Docs / PEFT: Add PEFT API documentation (#31078) · 4f98b144

Younes Belkada authored May 28, 2024

* add peft references

* add peft references

* Update docs/source/en/peft.md

* Update docs/source/en/peft.md

4f98b144

Watermark: fix tests (#30961) · 779bc360

Raushan Turganbay authored May 28, 2024



* fix tests

* style

* Update tests/generation/test_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

779bc360

Fix failing tokenizer tests (#31083) · a3c7b59e
Lysandre Debut authored May 28, 2024
```
* Fix failing tokenizer tests

* Use small tokenizer

* Fix remaining reference
```
a3c7b59e
[SuperPoint, PaliGemma] Update docs (#31025) · 90da0b1c
NielsRogge authored May 28, 2024
```
* Update docs

* Add PaliGemma resources

* Address comment

* Update docs
```
90da0b1c
Fix typo in trainer.py (#31048) · 66add161
Sina Taslimi authored May 28, 2024

66add161
Fix OWLv2 post_process_object_detection for multiple images (#31082) · 98e2d48e
Pavel Iakubovskii authored May 28, 2024
```
* Add test for multiple images

* [run slow] owlv2

* Fix box rescaling

* [run slow] owlv2
```
98e2d48e
Remove float64 cast for OwlVit and OwlV2 to support MPS device (#31071) · c31473ed
Pavel Iakubovskii authored May 28, 2024
```
Remove float64
```
c31473ed

fix from_pretrained in offline mode when model is preloaded in cache (#31010) · 936ab7ba

oOraph authored May 28, 2024



* Unit test to verify fix
Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>

* fix from_pretrained in offline mode when model is preloaded in cache
Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>

* minor: fmt
Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>

---------
Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>
Co-authored-by: Raphael Glon <oOraph@users.noreply.github.com>

936ab7ba

Remove redundant backend checks in training_args.py (#30999) · 537deb78

Hengwen Tong authored May 28, 2024



* Remove backend checks in training_args.py

* Expilicit initialize the device

---------
Co-authored-by: tonghengwen <tonghengwen@cambricon.com>

537deb78

Update quicktour.md to fix broken link to Glossary (#31072) · dd4654ea

AP authored May 28, 2024

Update quicktour.md to fix broken link

Missing '/' in attention mask link in the transformers quicktour

dd4654ea

fix "piano" typo (#31027) · e18da4e3
Clint Adams authored May 28, 2024

e18da4e3
Remove `ninja` from docker image build (#31080) · 8e3b1fef
Yih-Dar authored May 28, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8e3b1fef

use `@main` (#31065) · 8f0f7271

Yih-Dar authored May 28, 2024



use main
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

8f0f7271

27 May, 2024 7 commits
- skip `test_model_parallelism` for 2 model test classes (#31067) · 9d35edbb
  Yih-Dar authored May 27, 2024
```
skip
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  9d35edbb
- Fix pad_to_max_length Whisper (#30787) · d355741e
  Yoach Lacombe authored May 27, 2024
```
* fix pad_to_max_length Whisper

* add tests

* make style
```
  d355741e
- Fix quanto tests (#31062) · b84cd675
  Marc Sun authored May 27, 2024
```
fix quanto tests
```
  b84cd675
- Update feature request label in template (#30940) · cd797778
  amyeroberts authored May 27, 2024
  
  cd797778
- Follow up: Fix link in dbrx.md (#30514) · 0a064dc0
  Eitan Turok authored May 27, 2024
```
* Fix link in dbrx.md

* remove "though this may not be up to date"

---------
Co-authored-by: Lysandre Debut <hi@lysand.re>
```
  0a064dc0
- unpin uv (#31055) · d7942d9d
  Yih-Dar authored May 27, 2024
```
[push-ci-image]
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  d7942d9d
- Redirect transformers_agents doc to agents (#31054) · 84c4b72e
  Aymeric Roucher authored May 27, 2024
  
  84c4b72e
24 May, 2024 14 commits

Paligemma- fix devices and dtype assignments (#31008) · bdb9106f
Pablo Montalvo authored May 24, 2024
```
* fix devices and dtype assignments

* [run-slow]paligemma
```
bdb9106f

Add split special tokens (#30772) · deba7655

Ita Zaporozhets authored May 24, 2024



* seems like `split_special_tokens` is used here

* split special token

* add new line at end of file

* moving split special token test to common tests

* added assertions

* test

* fixup

* add co-author

* passing rest of args to gptsan_japanese, fixing tests

* removing direct comparison of fast and slow models

* adding test support for UDOP and LayoutXLM

* ruff fix

* readd check if slow tokenizer

* modify test to handle bos tokens

* removing commented function

* trigger build

* applying review feedback - updated docstrings, var names, and simplified tests

* ruff fixes

* Update tests/test_tokenization_common.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* applying feedback, comments

* shutil temp directory fix

---------
Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MBP.localdomain>
Co-authored-by: itazap <itazap@us...

deba7655

added interpolation for vitmae model in pytorch as well as tf. (#30732) · e5103a76

BHUVAN M authored May 24, 2024



* added interpolation for vitmae model in pytorch as well as tf.

* Update modeling_vit_mae.py

irreugalr import fixed

* small changes and proper formatting

* changes suggested in review.

* modified decoder interpolate_func

* arguments and docstring fix

* Apply suggestions from code review

doc fixes
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

e5103a76

save the list of new model failures (#31013) · a3cdff41
Yih-Dar authored May 24, 2024
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
a3cdff41
Quantization / TST: Fix remaining quantization tests (#31000) · 658b849a
Younes Belkada authored May 24, 2024
```
* Fix remaining quant tests

* Update test_quanto.py
```
658b849a
Fix resume_download future warning (#31007) · fd3c1280
Lucain authored May 24, 2024
```
* Fix resume_download future warning

* better like this

* Add regression test
```
fd3c1280

allow multi-gpu (#31011) · acbfaf69

Yih-Dar authored May 24, 2024



* allow multi-gpu

* allow multi-gpu

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

acbfaf69

FIX / TST: Fix expected results on Mistral AWQ test (#30971) · ae87f979
Marc Sun authored May 24, 2024
```
fix awq mistral test
```
ae87f979
[tests] make `test_model_parallelism` device-agnostic (#30844) · 04c7c176
Fanli Lin authored May 24, 2024
```
* enable on xpu

* fix style

* add comment and mps
```
04c7c176

Perceiver interpolate position embedding (#30979) · 42d8dd87

Yixiang Gao authored May 24, 2024



* add test that currently fails

* test passed

* all perceiver passed

* fixup, style, quality, repo-consistency, all passed

* Apply suggestions from code review: default to False + compute sqrt once only
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix a minor bracket

* replace dim with self._num_channels

* add arguments to the rest preprocessors

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

42d8dd87

pin `uv==0.1.45` (#31006) · 5855afd1

Yih-Dar authored May 24, 2024



* fix

* [push-ci-image]

* run with latest

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

5855afd1

Do not trigger autoconversion if local_files_only (#31004) · 03935d30
Lucain authored May 24, 2024

03935d30

Fix training speed regression introduced by "optimize VRAM for calculating... · 21e259d8

Kevin Koehncke authored May 24, 2024

Fix training speed regression introduced by "optimize VRAM for calculating pos_bias in LayoutLM v2, v3 (#26139)" (#30988)

* Revert "optimize VRAM for calculating pos_bias in LayoutLM v2, v3 (#26139)"

This reverts commit a7e0ed82

.

* Instead of reverting commit, wrap indexing in torch.no_grad context

* Apply wrapping in LayoutLMv2

* Add comments explaining reason for no_grad

* Fix code format

---------
Co-authored-by: Kevin Koehncke <kevin.koehncke@uipath.com>

21e259d8

add prefix space ignored in llama #29625 (#30964) · 7f6e8741

Ita Zaporozhets authored May 24, 2024



* add prefix space ignored in llama #29625

* adding test with add_prefix_space=False

* ruff

---------
Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MBP.localdomain>

7f6e8741

23 May, 2024 4 commits

Bugfix: WandbCallback uploads initial model checkpoint (#30897) · 6657fb5f

Matthias Gerstgrasser authored May 23, 2024

* fix wandb always uploading initial model

* Update comment.

* Optionally log initial model

* Revert "Optionally log initial model"

This reverts commit 9602cc1fad3feaf218f82a7339a194d3d2fbb946.

6657fb5f

Remove deprecated properties in tokenization_nllb.py and tokenization_nllb_fast.py (#29834) · 6d3d5b10

Yasmin Moslem authored May 23, 2024

* Fix typo in tokenization_nllb.py

Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability.

* Fix typo in tokenization_nllb_fast.py

Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability.

* Remove deprecated attributes in tokenization_nllb.py

Remove deprecated attributes: `lang_code_to_id`, `fairseq_tokens_to_ids`, `id_to_lang_code`, and `fairseq_ids_to_tokens`

* Remove deprecated attribute in tokenization_nllb_fast.py

Remove deprecated attribute `lang_code_to_id`

* Remove deprecated properties in tokenization_nllb.py

Remove deprecated properties - fix format

* Remove deprecated properties in tokenization_nllb_fast.py

Remove deprecated properties - fix format

* Update test_tokenization_nllb.py

* update test_tokenization_nllb.py

* Update tokenization_nllb.py

* Update test_tokenization_seamless_m4t.py

* Update test_tokenization_seamless_m4t.py

6d3d5b10

[Port] TensorFlow implementation of Mistral (#29708) · 965e98dc

Aritra Roy Gosthipaty authored May 23, 2024



* chore: initial commit

* chore: adding imports and inits

* chore: adding the causal and classification code

* chore: adding names to the layers

* chore: using single self attn layer

* chore: built the model and layers

* chore: start with testing

* chore: docstring change, transpose fix

* fix: rotary embedding

* chore: adding cache implementation

* remove unused torch

* chore: fixing the indexing issue

* make fix-copies

* Use modeling_tf_utils.keras

* make fixup

* chore: fixing tests

* chore: adding past key value logic

* chore: adding multi label classfication test

* fix: switching on the built parameters in the layers

* fixing repo consistency

* ruff formats

* style changes

* fix: tf and pt equivalence

* removing returns from docstrings

* fix docstrings

* fix docstrings

* removing todos

* fix copies

* fix docstring

* fix docstring

* chore: using easier rotate_half

* adding integration tests

* chore: addressing review related to rotary embedding layer

* review changes

* [run-slow] mistral

* skip: test save load after resize token embedding

* style

---------
Co-authored-by: Matt <rocketknight1@gmail.com>

965e98dc

Update 4 `MptIntegrationTests` expected outputs (#30989) · 2a89673f

Yih-Dar authored May 23, 2024



* fix

* fix

* fix

* fix

* fix

* [run-slow] mpt

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2a89673f