Commits · f4f696255f318346f3cd660f984c48de531ed7e1 · chenpangpang / transformers

03 Jun, 2024 1 commit

Ahmed Moubtahij authored Jun 03, 2024



* token healing impl + trie with extensions

* make fixup

* prefix-robust space tokenization

* examples readme and requirements

* make fixup

* allow input prompt and model

* redundant defaults

* Specialized Trie

* make fixup

* updated tests with new inherited Tree

* input ids to auto device_map

* rm unused import

* Update src/transformers/generation/utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* naming convention

* Revert "naming convention"

This reverts commit dd39d9c5b7a969e2d8a8d2a8e54f121b82dc44f0.

* naming convention

* last -hopefully- changes

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

39b2ff69

31 May, 2024 2 commits
- Add streaming, various fixes (#30838) · 9837a254
  Aymeric Roucher authored May 31, 2024
```
* Implement streaming run in ReAct agents
* Allow additional imports in code agents
* Python interpreter: support classes and exceptions, fixes
```
  9837a254
- Fix quantized cache output (#31143) · 48cada87
  Marc Sun authored May 31, 2024
  
  48cada87
30 May, 2024 2 commits
- fix get_scheduler when name is warmup_stable_decay (#31128) · cda9c82a
  zspo authored May 30, 2024
```
fix get_scheduler args
```
  cda9c82a
- FIX / Quantization: Add extra validation for bnb config (#31135) · 5e5c4d62
  Younes Belkada authored May 30, 2024
```
add validation for bnb config
```
  5e5c4d62
29 May, 2024 2 commits

Add on_optimizer_step to callback options (#31095) · 5c882535

Dhruv Pai authored May 29, 2024

* Modified test

* Added on_optimizer_step to callbacks

* Move callback after step is called

* Added on optimizer step callback

5c882535

Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016) · c3044ec2

Lucain authored May 29, 2024

* Fix has_file in offline mode

* harmonize env variable for offline mode

* Switch to HF_HUB_OFFLINE

* fix test

* revert test_offline to test TRANSFORMERS_OFFLINE

* Add new offline test

* merge conflicts

* docs

c3044ec2

28 May, 2024 8 commits

Deprecate low use models (#30781) · a564d10a

amyeroberts authored May 28, 2024

* Deprecate models
- graphormer
- time_series_transformer
- xlm_prophetnet
- qdqbert
- nat
- ernie_m
- tvlt
- nezha
- mega
- jukebox
- vit_hybrid
- x_clip
- deta
- speech_to_text_2
- efficientformer
- realm
- gptsan_japanese

* Fix up

* Fix speech2text2 imports

* Make sure message isn't indented

* Fix docstrings

* Correctly map for deprecated models from model_type

* Uncomment out

* Add back time series transformer and x-clip

* Import fix and fix-up

* Fix up with updated ruff

a564d10a

TST: Fix instruct-blip tests (#31088) · 3264be41
Younes Belkada authored May 28, 2024
```
* fix flan t5 tests

* better format
```
3264be41
skip `test_multi_gpu_data_parallel_forward` for `vit` and `deit` (#31086) · 3af7bf30
Yih-Dar authored May 28, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
3af7bf30

Watermark: fix tests (#30961) · 779bc360

Raushan Turganbay authored May 28, 2024



* fix tests

* style

* Update tests/generation/test_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

779bc360

Fix failing tokenizer tests (#31083) · a3c7b59e
Lysandre Debut authored May 28, 2024
```
* Fix failing tokenizer tests

* Use small tokenizer

* Fix remaining reference
```
a3c7b59e
Fix OWLv2 post_process_object_detection for multiple images (#31082) · 98e2d48e
Pavel Iakubovskii authored May 28, 2024
```
* Add test for multiple images

* [run slow] owlv2

* Fix box rescaling

* [run slow] owlv2
```
98e2d48e

fix from_pretrained in offline mode when model is preloaded in cache (#31010) · 936ab7ba

oOraph authored May 28, 2024



* Unit test to verify fix
Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>

* fix from_pretrained in offline mode when model is preloaded in cache
Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>

* minor: fmt
Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>

---------
Signed-off-by: Raphael Glon <oOraph@users.noreply.github.com>
Co-authored-by: Raphael Glon <oOraph@users.noreply.github.com>

936ab7ba

Remove `ninja` from docker image build (#31080) · 8e3b1fef
Yih-Dar authored May 28, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
8e3b1fef

27 May, 2024 3 commits
- skip `test_model_parallelism` for 2 model test classes (#31067) · 9d35edbb
  Yih-Dar authored May 27, 2024
```
skip
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  9d35edbb
- Fix pad_to_max_length Whisper (#30787) · d355741e
  Yoach Lacombe authored May 27, 2024
```
* fix pad_to_max_length Whisper

* add tests

* make style
```
  d355741e
- Fix quanto tests (#31062) · b84cd675
  Marc Sun authored May 27, 2024
```
fix quanto tests
```
  b84cd675
24 May, 2024 8 commits

Add split special tokens (#30772) · deba7655

Ita Zaporozhets authored May 24, 2024



* seems like `split_special_tokens` is used here

* split special token

* add new line at end of file

* moving split special token test to common tests

* added assertions

* test

* fixup

* add co-author

* passing rest of args to gptsan_japanese, fixing tests

* removing direct comparison of fast and slow models

* adding test support for UDOP and LayoutXLM

* ruff fix

* readd check if slow tokenizer

* modify test to handle bos tokens

* removing commented function

* trigger build

* applying review feedback - updated docstrings, var names, and simplified tests

* ruff fixes

* Update tests/test_tokenization_common.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* applying feedback, comments

* shutil temp directory fix

---------
Co-authored-by: Arthur Zucker <arthur.zucker@gmail.com>
Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MBP.localdomain>
Co-authored-by: itazap <itazap@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MacBook-Pro.local>

deba7655

added interpolation for vitmae model in pytorch as well as tf. (#30732) · e5103a76

BHUVAN M authored May 24, 2024



* added interpolation for vitmae model in pytorch as well as tf.

* Update modeling_vit_mae.py

irreugalr import fixed

* small changes and proper formatting

* changes suggested in review.

* modified decoder interpolate_func

* arguments and docstring fix

* Apply suggestions from code review

doc fixes
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

e5103a76

Quantization / TST: Fix remaining quantization tests (#31000) · 658b849a
Younes Belkada authored May 24, 2024
```
* Fix remaining quant tests

* Update test_quanto.py
```
658b849a
Fix resume_download future warning (#31007) · fd3c1280
Lucain authored May 24, 2024
```
* Fix resume_download future warning

* better like this

* Add regression test
```
fd3c1280
FIX / TST: Fix expected results on Mistral AWQ test (#30971) · ae87f979
Marc Sun authored May 24, 2024
```
fix awq mistral test
```
ae87f979
[tests] make `test_model_parallelism` device-agnostic (#30844) · 04c7c176
Fanli Lin authored May 24, 2024
```
* enable on xpu

* fix style

* add comment and mps
```
04c7c176

Perceiver interpolate position embedding (#30979) · 42d8dd87

Yixiang Gao authored May 24, 2024



* add test that currently fails

* test passed

* all perceiver passed

* fixup, style, quality, repo-consistency, all passed

* Apply suggestions from code review: default to False + compute sqrt once only
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* fix a minor bracket

* replace dim with self._num_channels

* add arguments to the rest preprocessors

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

42d8dd87

add prefix space ignored in llama #29625 (#30964) · 7f6e8741

Ita Zaporozhets authored May 24, 2024



* add prefix space ignored in llama #29625

* adding test with add_prefix_space=False

* ruff

---------
Co-authored-by: Ita Zaporozhets <itazaporozhets@Itas-MBP.localdomain>

7f6e8741

23 May, 2024 8 commits

Remove deprecated properties in tokenization_nllb.py and tokenization_nllb_fast.py (#29834) · 6d3d5b10

Yasmin Moslem authored May 23, 2024

* Fix typo in tokenization_nllb.py

Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability.

* Fix typo in tokenization_nllb_fast.py

Change `adder_tokens_decoder` into `added_tokens_decoder` and improve the warning's readability.

* Remove deprecated attributes in tokenization_nllb.py

Remove deprecated attributes: `lang_code_to_id`, `fairseq_tokens_to_ids`, `id_to_lang_code`, and `fairseq_ids_to_tokens`

* Remove deprecated attribute in tokenization_nllb_fast.py

Remove deprecated attribute `lang_code_to_id`

* Remove deprecated properties in tokenization_nllb.py

Remove deprecated properties - fix format

* Remove deprecated properties in tokenization_nllb_fast.py

Remove deprecated properties - fix format

* Update test_tokenization_nllb.py

* update test_tokenization_nllb.py

* Update tokenization_nllb.py

* Update test_tokenization_seamless_m4t.py

* Update test_tokenization_seamless_m4t.py

6d3d5b10

[Port] TensorFlow implementation of Mistral (#29708) · 965e98dc

Aritra Roy Gosthipaty authored May 23, 2024



* chore: initial commit

* chore: adding imports and inits

* chore: adding the causal and classification code

* chore: adding names to the layers

* chore: using single self attn layer

* chore: built the model and layers

* chore: start with testing

* chore: docstring change, transpose fix

* fix: rotary embedding

* chore: adding cache implementation

* remove unused torch

* chore: fixing the indexing issue

* make fix-copies

* Use modeling_tf_utils.keras

* make fixup

* chore: fixing tests

* chore: adding past key value logic

* chore: adding multi label classfication test

* fix: switching on the built parameters in the layers

* fixing repo consistency

* ruff formats

* style changes

* fix: tf and pt equivalence

* removing returns from docstrings

* fix docstrings

* fix docstrings

* removing todos

* fix copies

* fix docstring

* fix docstring

* chore: using easier rotate_half

* adding integration tests

* chore: addressing review related to rotary embedding layer

* review changes

* [run-slow] mistral

* skip: test save load after resize token embedding

* style

---------
Co-authored-by: Matt <rocketknight1@gmail.com>

965e98dc

Update 4 `MptIntegrationTests` expected outputs (#30989) · 2a89673f

Yih-Dar authored May 23, 2024



* fix

* fix

* fix

* fix

* fix

* [run-slow] mpt

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2a89673f

[tests] add `torch.use_deterministic_algorithms` for XPU (#30774) · 21339a52
Fanli Lin authored May 23, 2024
```
* add xpu check

* add marker

* add documentation

* update doc

* fix ci

* remove from global init

* fix
```
21339a52

Fix accelerate failing tests (#30836) · 8366b572

Marc Sun authored May 23, 2024

* Fix accelerate tests

* fix clip

* skip dbrx tests

* fix GPTSan

* fix M2M100Model

* same fix as jamba

* fix mt5

* Fix T5Model

* Fix umt5 model

* fix switch_transformers

* fix whisper

* fix gptsan again

* fix siglip recent test

* skip siglip tests

* wrong place fixed

8366b572

test_custom_4d_attention_mask skip with sliding window attn (#30833) · 6739e1d2
Poedator authored May 23, 2024

6739e1d2

Quantized KV Cache (#30483) · d583f131

Raushan Turganbay authored May 23, 2024



* clean-up

* Update src/transformers/cache_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/cache_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/cache_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

* Update tests/quantization/quanto_integration/test_quanto.py
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* more suggestions

* mapping if torch available

* run tests & add 'support_quantized' flag

* fix jamba test

* revert, will be fixed by another PR

* codestyle

* HQQ and versatile cache classes

* final update

* typo

* make tests happy

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>

d583f131

Using assistant in AutomaticSpeechRecognitionPipeline with different encoder size (#30637) · eb1a77bb

Kamil Akesbi authored May 23, 2024



* fiw input to generate in pipeline

* fixup

* pass input_features to generate with assistant

* error if model and assistant with different enc size

* fix

* apply review suggestions

* use self.config.is_encoder_decoder

* pass inputs to generate directly

* add slow tests

* Update src/transformers/generation/utils.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update tests/pipelines/test_pipelines_automatic_speech_recognition.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update tests/pipelines/test_pipelines_automatic_speech_recognition.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update tests/pipelines/test_pipelines_automatic_speech_recognition.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update tests/pipelines/test_pipelines_automatic_speech_recognition.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* Update tests/pipelines/test_pipelines_automatic_speech_recognition.py
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* apply review

* Update src/transformers/generation/utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update tests/pipelines/test_pipelines_automatic_speech_recognition.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* apply code review

* update attributes encoder_xyz to check

* Update src/transformers/generation/utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* add slow test

* solve conflicts

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

eb1a77bb

22 May, 2024 6 commits

Paligemma causal attention mask (#30967) · a25f7d3c

Pablo Montalvo authored May 22, 2024



* PaliGemma working causal attention

* Formatting

* Style

* Docstrings + remove commented code

* Update docstring for PaliGemma Config

* PaliGemma - add separator ind to model/labels

* Refactor + docstring paligemma processor method

* Style

* return token type ids when tokenizing labels

* use token type ids when building causal mask

* add token type ids to tester

* remove separator from config

* fix style

* don't ignore separator

* add processor documentation

* simplify tokenization

* fix causal mask

* style

* fix label propagation, revert suffix naming

* fix style

* fix labels tokenization

* [run-slow]paligemma

* add eos if suffixes are present

* [run-slow]paligemma

* [run-slow]paligemma

* add misssing tokens to fast version

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix style

* [run-slow]paligemma

---------
Co-authored-by: Peter Robicheaux <peter@roboflow.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

a25f7d3c

[Whisper] Strip prompt before finding common subsequence (#27836) · 0948c827
Sanchit Gandhi authored May 22, 2024

0948c827
Generation: get special tokens from model config (#30899) · b1065aa0
Raushan Turganbay authored May 22, 2024
```
* fix

* let's do this way?

* codestyle

* update

* add tests
```
b1065aa0

🚨

out_indices always a list (#30941) · dff54ad2

amyeroberts authored May 22, 2024

* out_indices always a list

* Update src/transformers/utils/backbone_utils.py

* Update src/transformers/utils/backbone_utils.py

* Move type casting

* nit

dff54ad2

Paligemma - fix slow tests, add bf16 and f16 slow tests (#30851) · 250ae9f7

Pablo Montalvo authored May 22, 2024

* fix slow tests, add bf16 and f16 slow tests

* few fixes

* [run-slow]paligemma

* add gate decorator

* [run-slow]paligemma

* add missing gating

* [run-slow]paligemma

* [run-slow]paligemma

250ae9f7

Avoid extra chunk in speech recognition (#29539) · 15185084
Jonatan Kłosko authored May 22, 2024

15185084