Commits · 25245ec26dc29bcf6102e1b4ddd0dfd02e720cf5 · chenpangpang / transformers

07 Jun, 2024 6 commits

Rename test_model_common_attributes -> test_model_get_set_embeddings (#31321) · 25245ec2

amyeroberts authored Jun 07, 2024

* Rename to test_model_common_attributes
The method name is misleading - it is testing being able to get and set embeddings, not common attributes to all models

* Explicitly skip

25245ec2

interpolation added for TVP. (#30863) · 3b9174f2

BHUVAN M authored Jun 07, 2024

* Update TVP model to interpolate pre-trained image pad prompter encodings

* feat: Add 2D positional embeddings interpolation in TvpVisualInputEmbedding

* added required comments

* Update TVP model to interpolate pre-trained image pad prompter encodings

* feat: Add 2D positional embeddings interpolation in TvpVisualInputEmbedding

* added required comments

* docstring and argument fix

* doc fixes and test case fix suggested in review.

* varibale typo fix

* styling and name fixes for padding interpolation flag.

3b9174f2

Remove ConversationalPipeline and Conversation object (#31165) · 065729a6

Matt authored Jun 07, 2024

* Remove ConversationalPipeline and Conversation object, as they have been deprecated for some time and are due for removal

* Update not-doctested.txt

* Fix JA and ZH docs

* Fix JA and ZH docs some more

* Fix JA and ZH docs some more

065729a6

Implement JSON dump conversion for torch_dtype in TrainingArguments (#31224) · 60861fe1

조준래 authored Jun 07, 2024



* Implement JSON dump conversion for torch_dtype in TrainingArguments

* Add unit test for converting torch_dtype in TrainingArguments to JSON

* move unit test for converting torch_dtype into TrainerIntegrationTest class

* reformating using ruff

* convert dict_torch_dtype_to_str to private method _dict_torch_dtype_to_str

---------
Co-authored-by: jun.4 <jun.4@kakaobrain.com>

60861fe1

Extend save_pretrained to offloaded models (#27412) · ff689f57

Benjamin Badger authored Jun 07, 2024



* added hidden subset

* debugged hidden subset contrastive search

* added contrastive search compression

* debugged compressed contrastive search

* memory reduction for contrastive search

* debugged mem red

* added low memory option feature

* debugged mem optmimization output stack

* debugged mem optmimization output stack

* debugged low mem

* added low mem cache

* fixed 2047 tensor view

* debugged 2042 past key val inputs

* reformatted tensors

* changed low mem output

* final clean

* removed subset hidden csearch

* fixed hidden device

* fixed hidden device

* changed compressor dtype

* removed hstate compression

* integrated csearch in generate

* test csearch integration into generation

exit()

* fixed csearch kwarg integration with generation

* final wrap and added doc

* Update src/transformers/generation/utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* added debug print

* direct hstate cat

* direct hstate cat

* direct hstate cat debug

* direct hstate cat debug

* expanded full hidden state stack

* expanded full hidden state stack

* matched dims for hstates

* matched dims for hstates

* logits fix

* equality test

* equality hidden debug

* debug

* added prints for debug

* added prints for debug

* equality check

* switched squeeze dim

* input format debug

* tracing top_k_ids

* removed trace

* added test context

* added jitter

* added jitter

* added jitter

* returned state

* rebuilt past key value reconstruction

* debugged

* cleaned traces

* added selection for pkv

* changed output to dict

* cleaned

* cleaned

* cleaned up contrastive search test

* moved low_memory kwarg

* debugged

* changed low mem test batch size to 1

* removed output

* debugged test input shape

* reformatted csearch test

* added trace

* removed unsqueeze on final forward pass

* replaced unsqueeze with view

* removed traces

* cleaned

* debugged model kwargs

* removed special models from test

* ran make quality

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* refactored

* refactored

* refactored

* make fixup

* renamed flag sequential

* renamed flag sequential

* iterative onloading

* black style and test utils

* added traces for integrated test

* debugged

* added traces

* make style

* removed traces, make style

* included suggestions and added test

* debugged test

* added offload module check and make style

* is_accelerate_available and make style

* added test decorator

* changed test model and config spec

* added offload condition

* added lazy loading for each shard

* debugged

* modified sharding

* debugged

* added traces

* removed safe serialization

* no index overload;

* trace on safe save ptrs

* added ptr condition

* debugged

* debugged ptr

* moved module map init

* remake shard only for offloaded modules

* refactored

* debugged

* refactored

* debugged

* cleaned and make style

* cleaned and make style

* added trace

* sparse module map

* debugged

* removed module map conditional

* refactored

* debug

* debugged

* added traces

* added shard mem trace

* added shard mem trace

* removed underlying storage check

* refactored

* memory leak removal and make style

* cleaned

* swapped test decs and make style

* added mem checks and make style

* added free mem warning

* implemented some suggestions

* moved onloading to accelerate

* refactored for accelerate integration

* cleaned test

* make style

* debugged offload map name

* cleaned and make style

* replaced meta device check for sharding

* cleaned and make style

* implemented some suggestions

* more suggestions

* update warning
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* more suggestions

* make style

* new make style

* Update src/transformers/modeling_utils.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* Update src/transformers/modeling_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

ff689f57

Fix jetmoe model (#31279) · 8bcf9c8d
Cyril Vallez authored Jun 07, 2024
```
* Fix jetmoe model

* Remove skip-tests
```
8bcf9c8d

06 Jun, 2024 7 commits

Enable HF pretrained backbones (#31145) · bdf36dcd

amyeroberts authored Jun 06, 2024

* Enable load HF or tim backbone checkpoints

* Fix up

* Fix test - pass in proper out_indices

* Update docs

* Fix tvp tests

* Fix doc examples

* Fix doc examples

* Try to resolve DPT backbone param init

* Don't conditionally set to None

* Add condition based on whether backbone is defined

* Address review comments

bdf36dcd

Pipeline VQA: Add support for list of images and questions as pipeline input (#31217) · f9296249

Vu Huy Nguyen authored Jun 06, 2024

* Add list check for image and question

* Handle passing two lists and update docstring

* Add tests

* Add support for dataset

* Add test for dataset as input

* fixup

* fix unprotected import

* fix unprotected import

* fix import again

* fix param type

f9296249

Mark MobileNetV1ModelTest::test_batching_equivalence as flaky (#31258) · c53fcd83
amyeroberts authored Jun 06, 2024
```
* Mark MobileNetV1ModelTest::test_batching_equivalence as flaky

* Add link to issue

* woops
```
c53fcd83

Enable dynamic resolution input for Beit (#31053) · 68118397

Omar Salman authored Jun 06, 2024

* Initial attempt

* Updates: PR suggestions

* Interpolate the relative position bias when interpolate_pos_encoding is True

* Add slow tag for the added tests

* Add in DATA2VEC_VISION_INPUTS_DOCSTRING

68118397

fix accelerate tests for roberta xl (#31288) · 99895ae5
Marc Sun authored Jun 06, 2024
```
* fix accelerate tests for roberta xl

* style
```
99895ae5
Generation: fix handling of special tokens (#31254) · 5fabd1e8
Raushan Turganbay authored Jun 06, 2024
```
* fix special tokens in generatioon

* fix test

* add warning

* fix the check

* warn once

* fix
```
5fabd1e8
Make mamba use cache (#31116) · 7729b774
Raushan Turganbay authored Jun 06, 2024
```
* make mamba use cache

* uss cache naming as in mamba

* fix musicgen
```
7729b774

05 Jun, 2024 2 commits

Skip failing JetMOE generation tests (#31266) · 940fde8d
amyeroberts authored Jun 05, 2024
```
Skip failing tests for now
```
940fde8d

Add missing Flaubert tokenizer tests (#30492) · 464d986b

bastrob authored Jun 05, 2024

* add flaubert tokenization test, enrich inheritance in FlaubertTokenizer.

* fix quality code ci

* ensure parameter consistency

* fix ci

* fix copyright year and flatten vocab list.

* fix style

464d986b

04 Jun, 2024 7 commits

Fix `MistralIntegrationTest` (#31231) · fd3238b4

Yih-Dar authored Jun 04, 2024



* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

fd3238b4

Fix pipeline tests - torch imports (#31227) · 4ba66fdb
amyeroberts authored Jun 04, 2024
```
* Fix pipeline tests - torch imports

* Frameowrk dependant float conversion
```
4ba66fdb

fix bf16 issue in text classification pipeline (#30996) · 6b22a8f2

Chujie Zheng authored Jun 04, 2024

* fix logits dtype

* Add bf16/fp16 tests for text_classification pipeline

* Update test_pipelines_text_classification.py

* fix

* fix

6b22a8f2

Add dynamic resolution input/interpolate position embedding to deit (#31131) · de460e28

Kristen Pereira authored Jun 04, 2024



* Added interpolate pos encoding feature and test to deit

* Added interpolate pos encoding feature and test for deit TF model

* readded accidentally delted test for multi_gpu

* storing only patch_size instead of entire config and removed commented code

* Update modeling_tf_deit.py to remove extra line
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

de460e28

Video-LLaVa: handle any number of frames (#31221) · d64e4da7
Raushan Turganbay authored Jun 04, 2024
```
video-llava can handle more frames
```
d64e4da7
Fix sentence fragment within test comments (#31218) · e83cf581
DomHudson authored Jun 04, 2024

e83cf581

Pass device in Logits Processor's init (#29804) · 83238eee

Raushan Turganbay authored Jun 04, 2024



* add device in logits processor

* remove device when not needed

* codestyle

* tests

* forgot `melody` version

* Update src/transformers/models/whisper/generation_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* codestyle

* updates

---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

83238eee

03 Jun, 2024 6 commits

Fix GPU OOM for `mistral.py::Mask4DTestHard` (#31212) · 8a1a23ae

Yih-Dar authored Jun 03, 2024



* build

* build

* build

* build

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

8a1a23ae

fix the get_size_with_aspect_ratio in max_size situation (#30902) · 874ac129

Sangbum Daniel Choi authored Jun 04, 2024



* fix the get_size_with_aspect_ratio in max_size situation

* make fix-up

* add more general solution

* consider when max_size is not defined

* fix typo

* fix typo

* simple fix

* fix error

* fix if else error

* fix error of size overwrite

* fix yolos image processing

* fix detr image processing

* make

* add longest related test script

* Update src/transformers/models/yolos/image_processing_yolos.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add more test

* add test script about longest size

* remove deprecated

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

874ac129

Add Qwen2 GGUF loading support (#31175) · e4628434

Isotr0py authored Jun 03, 2024

* add qwen2 gguf support

* Update docs

* fix qwen2 tokenizer

* add qwen2 gguf test

* fix typo in qwen2 gguf test

* format code

* Remove mistral, clarify the error message

* format code

* add typing and update docstring

e4628434

Fix `test_compile_static_cache` (#30991) · df848acc

Yih-Dar authored Jun 03, 2024



* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

df848acc

Ignore non-causal mask in more cases with SDPA (#30138) · 221aaec6

fxmarty authored Jun 03, 2024

* update non-causal mask for sdpa

* add test

* update docstrings

* add one more test

* fix cross attention bug

* gentler atol/rtol

221aaec6

Token healing (#30081) · 39b2ff69

Ahmed Moubtahij authored Jun 03, 2024



* token healing impl + trie with extensions

* make fixup

* prefix-robust space tokenization

* examples readme and requirements

* make fixup

* allow input prompt and model

* redundant defaults

* Specialized Trie

* make fixup

* updated tests with new inherited Tree

* input ids to auto device_map

* rm unused import

* Update src/transformers/generation/utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* naming convention

* Revert "naming convention"

This reverts commit dd39d9c5b7a969e2d8a8d2a8e54f121b82dc44f0.

* naming convention

* last -hopefully- changes

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

39b2ff69

31 May, 2024 2 commits
- Add streaming, various fixes (#30838) · 9837a254
  Aymeric Roucher authored May 31, 2024
```
* Implement streaming run in ReAct agents
* Allow additional imports in code agents
* Python interpreter: support classes and exceptions, fixes
```
  9837a254
- Fix quantized cache output (#31143) · 48cada87
  Marc Sun authored May 31, 2024
  
  48cada87
30 May, 2024 2 commits
- fix get_scheduler when name is warmup_stable_decay (#31128) · cda9c82a
  zspo authored May 30, 2024
```
fix get_scheduler args
```
  cda9c82a
- FIX / Quantization: Add extra validation for bnb config (#31135) · 5e5c4d62
  Younes Belkada authored May 30, 2024
```
add validation for bnb config
```
  5e5c4d62
29 May, 2024 2 commits

Add on_optimizer_step to callback options (#31095) · 5c882535

Dhruv Pai authored May 29, 2024

* Modified test

* Added on_optimizer_step to callbacks

* Move callback after step is called

* Added on optimizer step callback

5c882535

Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016) · c3044ec2

Lucain authored May 29, 2024

* Fix has_file in offline mode

* harmonize env variable for offline mode

* Switch to HF_HUB_OFFLINE

* fix test

* revert test_offline to test TRANSFORMERS_OFFLINE

* Add new offline test

* merge conflicts

* docs

c3044ec2

28 May, 2024 6 commits

Deprecate low use models (#30781) · a564d10a

amyeroberts authored May 28, 2024

* Deprecate models
- graphormer
- time_series_transformer
- xlm_prophetnet
- qdqbert
- nat
- ernie_m
- tvlt
- nezha
- mega
- jukebox
- vit_hybrid
- x_clip
- deta
- speech_to_text_2
- efficientformer
- realm
- gptsan_japanese

* Fix up

* Fix speech2text2 imports

* Make sure message isn't indented

* Fix docstrings

* Correctly map for deprecated models from model_type

* Uncomment out

* Add back time series transformer and x-clip

* Import fix and fix-up

* Fix up with updated ruff

a564d10a

TST: Fix instruct-blip tests (#31088) · 3264be41
Younes Belkada authored May 28, 2024
```
* fix flan t5 tests

* better format
```
3264be41
skip `test_multi_gpu_data_parallel_forward` for `vit` and `deit` (#31086) · 3af7bf30
Yih-Dar authored May 28, 2024
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
3af7bf30

Watermark: fix tests (#30961) · 779bc360

Raushan Turganbay authored May 28, 2024



* fix tests

* style

* Update tests/generation/test_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

779bc360

Fix failing tokenizer tests (#31083) · a3c7b59e
Lysandre Debut authored May 28, 2024
```
* Fix failing tokenizer tests

* Use small tokenizer

* Fix remaining reference
```
a3c7b59e
Fix OWLv2 post_process_object_detection for multiple images (#31082) · 98e2d48e
Pavel Iakubovskii authored May 28, 2024
```
* Add test for multiple images

* [run slow] owlv2

* Fix box rescaling

* [run slow] owlv2
```
98e2d48e