Commits · bd5091df8db7cea1a9f94f797fc11487f840ade1 · chenpangpang / transformers

05 Jun, 2024 7 commits

Reduce by 2 the memory requirement in `generate()`

(#30536) · bd5091df

Cyril Vallez authored Jun 05, 2024

* Fix contrastive_search for new cache structure, and improve performance by removing inneficient torch.stack(torch.split(x, top_k, dim=0))

* Fix _contrastive_search for non-standard cache using ellipsis slicing

* Fix all outputs.logits memory leaks for all decoding strategies!

* Fix small error in _contrastive_search()

* Make all necessary change and revert for the new class

* Apply coding style

* Remove pipes in type hints for compatibility

* correct type hint

* apply style

* Use DynamicCache by default and solve conflicts

* Fix rebase issues

* Add `_supports_dynamic_cache_class` in models for models that support DynamicCache but not other caches to make DynamicCache the default for more models

* Create generation config to return legacy format by default, or to choose not to

* style

* Fix case when use_cache is False

* Remove default DynamicCache in assiste_decoding if assistant_model does not support it + fix _seen_tokens when cropping cache

* Update prepare_inputs_for_generation() for case with empty DynamicCache

* Correct return of args in _assisted_decoding

* Remove EfficientDynamicCache as it is no longer needed

* Correct mistake in generation config

* Move cache logic of assisted decoding to AssistedCandidateGenerator.__init__

* change DynamicCache function names from "split" to "batch_split" for readability + apply coding style

* Remove `_supports_dynamic_cache_class` attribute after rebase

* Correct missing line lost in conflict resolution during rebasing

* Add special case for Jamba

* Fix jamba test

* Coding style

* coding style

* Correct missing import in rebasing

* Simplify _validate_model_kwargs based on removal of _supports_dynamic_cache attribute

* Simplify code paths in _contrastive_search

* coding style

* Update docstrings of cache methods

* Update prepare_inputs_for_generation() -> past_key_values are always Cache objects

bd5091df

Fix circular reference issue in CLIPTokenizerFast (#31075) · b72752f0
Dhaivat Bhatt authored Jun 05, 2024

b72752f0

Add missing Flaubert tokenizer tests (#30492) · 464d986b

bastrob authored Jun 05, 2024

* add flaubert tokenization test, enrich inheritance in FlaubertTokenizer.

* fix quality code ci

* ensure parameter consistency

* fix ci

* fix copyright year and flatten vocab list.

* fix style

464d986b

enable deterministic mode for npu (#31253) · 41cf4097
Huazhong Ji authored Jun 05, 2024

41cf4097

Early labels validation (#31240) · 54659048

amyeroberts authored Jun 05, 2024

* Move label validation checks - fail early

* Remove some formatting changes - add back labels change wav2vec2

54659048

Fixing `name 'torch' is not defined` in `bitsandbytes` integration (#31243) · 63fb253d
James Braza authored Jun 04, 2024
```
Fixed torch definition error
```
63fb253d

Specify dtype=torch.bool to avoid xla error (#31191) · 66875ac0

Yury Sulsky authored Jun 05, 2024

The StoppingCriteriaList allocates is_done without specifying dtype=torch.bool. On XLA this allocates a float tensor and causes a failure on the following line:

is_done = is_done | criteria(input_ids, scores, **kwargs)

by attempting to OR float with bool.

66875ac0

04 Jun, 2024 10 commits

Move out common backbone config param validation (#31144) · 99de3a84
amyeroberts authored Jun 04, 2024
```
* Move out common validation

* Add missing backbone config arguments
```
99de3a84
Blip: Deprecate `BlipModel` (#31235) · 485d913d
Younes Belkada authored Jun 04, 2024
```
* deprecate blip

* mention deprecation on docs
```
485d913d
add no split modules for xlmrobertaxl (#31223) · 2965b204
Manuel Faysse authored Jun 04, 2024

2965b204

Add new line switch before logging ***** Running {description} ***** (#31225) · 821b772a

Jacklanda authored Jun 04, 2024

✨

 Add new line switch before logging "***** Running {description} *****".
Signed-off-by: jacklanda <yonyonlau@gmail.com>

821b772a

Fix pipeline tests - torch imports (#31227) · 4ba66fdb
amyeroberts authored Jun 04, 2024
```
* Fix pipeline tests - torch imports

* Frameowrk dependant float conversion
```
4ba66fdb

fix bf16 issue in text classification pipeline (#30996) · 6b22a8f2

Chujie Zheng authored Jun 04, 2024

* fix logits dtype

* Add bf16/fp16 tests for text_classification pipeline

* Update test_pipelines_text_classification.py

* fix

* fix

6b22a8f2

Add dynamic resolution input/interpolate position embedding to deit (#31131) · de460e28

Kristen Pereira authored Jun 04, 2024



* Added interpolate pos encoding feature and test to deit

* Added interpolate pos encoding feature and test for deit TF model

* readded accidentally delted test for multi_gpu

* storing only patch_size instead of entire config and removed commented code

* Update modeling_tf_deit.py to remove extra line
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

de460e28

Video-LLaVa: handle any number of frames (#31221) · d64e4da7
Raushan Turganbay authored Jun 04, 2024
```
video-llava can handle more frames
```
d64e4da7

fix(PatchTST): Wrong dropout used for PretainHead (#31117) · 36ade4a3

Max Strobel authored Jun 04, 2024



* fix(PatchTST): Wrong dropout used for PretainHead

* feat(PatchTST): remove unused config.dropout

---------
Co-authored-by: Strobel Maximilian (IFAG PSS SIS SCE ACM) <Maximilian.Strobel@infineon.com>

36ade4a3

Pass device in Logits Processor's init (#29804) · 83238eee

Raushan Turganbay authored Jun 04, 2024



* add device in logits processor

* remove device when not needed

* codestyle

* tests

* forgot `melody` version

* Update src/transformers/models/whisper/generation_whisper.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* codestyle

* updates

---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

83238eee

03 Jun, 2024 13 commits

Set greater_is_better to False if metric_for_best_model ends with "loss" (#31142) · df5abae8
miivanov90 authored Jun 03, 2024
```
* update to not(endswith(loss))

* ruff formatting
```
df5abae8
Cohere: Fix copied from (#31213) · 924c46d4
Younes Belkada authored Jun 03, 2024
```
Update modeling_cohere.py
```
924c46d4
Rename sanity_evaluation to eval_on_start (#31192) · c6c78733
Qubitium authored Jun 03, 2024
```
* Rename sanity_evaluation to eval_on_start

* move arg back to last
```
c6c78733
Fix typo in utils (#31169) · c230504b
Bojun Feng authored Jun 03, 2024
```
fix typo
```
c230504b

fix the get_size_with_aspect_ratio in max_size situation (#30902) · 874ac129

Sangbum Daniel Choi authored Jun 04, 2024



* fix the get_size_with_aspect_ratio in max_size situation

* make fix-up

* add more general solution

* consider when max_size is not defined

* fix typo

* fix typo

* simple fix

* fix error

* fix if else error

* fix error of size overwrite

* fix yolos image processing

* fix detr image processing

* make

* add longest related test script

* Update src/transformers/models/yolos/image_processing_yolos.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* add more test

* add test script about longest size

* remove deprecated

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

874ac129

Add Qwen2 GGUF loading support (#31175) · e4628434

Isotr0py authored Jun 03, 2024

* add qwen2 gguf support

* Update docs

* fix qwen2 tokenizer

* add qwen2 gguf test

* fix typo in qwen2 gguf test

* format code

* Remove mistral, clarify the error message

* format code

* add typing and update docstring

e4628434

🚨 [Mistral and friends] Update MLP (#31057) · 70c87138
NielsRogge authored Jun 03, 2024
```
Update MLP
```
70c87138
SlidingWindowCache: reduce differences to other Cache classes (#30970) · d475f767
Joao Gante authored Jun 03, 2024
```
* tmp commit

* sliding window with fewer differences

* make fixup + rebase

* missing overwrite
```
d475f767

Ignore non-causal mask in more cases with SDPA (#30138) · 221aaec6

fxmarty authored Jun 03, 2024

* update non-causal mask for sdpa

* add test

* update docstrings

* add one more test

* fix cross attention bug

* gentler atol/rtol

221aaec6

Fix Cannot convert [array()] to EagerTensor of dtype int64 (#31109) · f4f69625

Pavithra Devi M authored Jun 03, 2024

While running the model.prepare_tf_dataset() method,
it raises the error below:
```
TypeError: Cannot convert [array([322.,   1.])] to EagerTensor of dtype int64
```

This happens, in  "DataCollatorForSeq2Seq" function when we are try
to convert the labels to tensors. While converting the labels to tensors,
the labels can be in the format of list of list or list of ndarrays.
There is no problem converting the list of list lables. There is a problem
when the list of ndarrays are float values(like below).

```
[array([322.,   1.])]
```

so the exception raises while trying to convert this label to tensors using
below code.

```
batch["labels"] = tf.constant(batch["labels"], dtype=tf.int64)
```

The labels are always integer values, so this got converted to float
values in the label padding operation below.
```
batch["labels"] = [
                    call(label)
                    if padding_side == "right"
                    else np.concatenate([[self.label_pad_token_id] * (max_label_length - len(label)), label])
                    for label in labels
                    ]
```
Here we have 2 cases:
1 - Concatenating an array having integer padding token value with labels.
2 - Concatenating an empty array with labels.

----------------------------------------------------------------------------------------
case 1: Concatenating an array having integer padding token value with labels.
WORKS EXPECTED:
----------------------------------------------------------------------------------------
```
label = np.array([233, 1])
max_label_length = 4
label_pad_token_id = -100
np.concatenate([[label_pad_token_id] * (max_label_length - len(label)), label])
o/p:
array([-100, -100,  233,    1])
```

----------------------------------------------------------------------------------------
Case 2: Concatenating an empty array with labels.
GIVES THE ISSUE:
This scenorio can happen when the label has the maximum label length -- No padding needed.
----------------------------------------------------------------------------------------
```
label = np.array([233, 1])
max_label_length = 2
label_pad_token_id = -100
np.concatenate([[label_pad_token_id] * (max_label_length - len(label)), label])
o/p:
array([233.,   1.])
```

----------------------------------------------------------------------------------------
Solution:
----------------------------------------------------------------------------------------
We need to concatenate a ndarray of dtype int with labels.

AFTER FIX:
----------
case 1:
```

label = np.array([233, 1])
max_label_length = 4
label_pad_token_id = -100
np.concatenate([np.array([label_pad_token_id] * (max_label_length - len(label)), dtype=np.int64),label])

o/p:
array([-100, -100,  233,    1])
```

case 2:
```

label = np.array([233, 1])
max_label_length = 2
label_pad_token_id = -100
np.concatenate([np.array([label_pad_token_id] * (max_label_length - len(label)), dtype=np.int64),label])

o/p:
array([233,   1])
```

f4f69625

[`GemmaModel`] fix small typo (#31202) · 1749841a
Arthur authored Jun 03, 2024
```
* fixes

* fix-copies
```
1749841a

Token healing (#30081) · 39b2ff69

Ahmed Moubtahij authored Jun 03, 2024



* token healing impl + trie with extensions

* make fixup

* prefix-robust space tokenization

* examples readme and requirements

* make fixup

* allow input prompt and model

* redundant defaults

* Specialized Trie

* make fixup

* updated tests with new inherited Tree

* input ids to auto device_map

* rm unused import

* Update src/transformers/generation/utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* naming convention

* Revert "naming convention"

This reverts commit dd39d9c5b7a969e2d8a8d2a8e54f121b82dc44f0.

* naming convention

* last -hopefully- changes

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

39b2ff69

Remove copied froms for deprecated models (#31153) · 5b5b48b1
amyeroberts authored Jun 03, 2024
```
* Remove copied froms for deprecated models

* Remove automatically in script
```
5b5b48b1

31 May, 2024 6 commits

Diff converter v2 (#30868) · 96eb0628

Arthur authored May 31, 2024

* current working example!

* commit regex and result file

* update

* nit

* push the conversion file

* oups

* roadmap and nits

* attempt diffs for 3 files

* persimmon

* nit

* add diff file that is the same as the modeling_llama.py

* fix rope nits

* updates

* updates with converted versions

* give some breathing space to the code

* delete

* update

* update

* push the actual result

* update regex patterns

* update regex patterns

* fix some issues

* fix some issues

* fix some issues

* updates

* updates

* updates

* updates

* updates

* revert changes done to llama

* updates

* update gemma

* updates

* oups

* current state

* current state

* update

* ouiiii

* nit

* clear diffs

* nit

* fixup

* update

* doc 🚀

* 🔥

* for now use gemma

* deal with comments

* style

* handle funtions

* deal with assigns

* todos

* process inheritage

* keep decorators?

* 🤗

* deal with duplicates

* fixup

* correctly remove duplicate code

* run ruff post script

* ruff deals pretty well with imports, let's leave it to him

* ah maybe not lol

* for now remove all imports from child.

* nit

* conversion of llama

* okay

* convert starcoder2

* synch with main

* update llama diff

* updates

* https://docs.astral.sh/ruff/rules/redefined-while-unused/

 fixes the imports, bit needs later version of ruff

* updates

* okay actual state

* non zero exit

* update!

* revert unrelated

* remove other diff files

* updates

* cleanup

* update

* less diff!

* stash

* current updates

* updates

* No need for call

* finished fining deps

* update

* current changes

* current state

* current state

* new status

* nit

* finally

* fixes

* nits

* order is now expected

* use logger info instead of prints

* fixup

* up

* nit

* update

* nits

* update

* correct merge

* update

* update

* update

* add warning

* update caution message

* update

* better merging strategy

* copy class statements :wink

* fixups

* nits

* update

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* nits

* smaller header

* do cleanup some stuff

* even simpler header?

* fixup

* updates

* ruff

* update examples

* nit

* TODO

* state

* OUUUUUUF

* current state

* nits

* final state

* add a readme

* fixup

* remove diff llama

* fix

* nit

* dummy noy funny

* ruff format tests src utils --check

* everless diffs

* less diffs and fix test

* fixes

* naming nit?

* update converter and add supper example

* nits

* updated for function signatures

* update

* update

* add converted dummies

* autoformat

* single target assign fix

* fixup

* fix some imports

* fixes

* don't push them

* `# noqa: F841`

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

96eb0628

Added description of quantization_config (#31133) · 372baec2

Vallepu Vamsi Krishna authored May 31, 2024

* Description of quantization_config

Added missing description about quantization_config in replace_with_bnb_linear for better readability.

* Removed trailing spaces

372baec2

Instance segmentation examples (#31084) · cdc81311

Pavel Iakubovskii authored May 31, 2024



* Initial setup

* Metrics

* Overfit on two batches

* Train 40 epochs

* Memory leak debugging

* Trainer fine-tuning

* Draft

* Fixup

* Trained end-to-end

* Add requirements

* Rewrite evaluator

* nits

* Add readme

* Add instance-segmentation to the table

* Support void masks

* Remove sh

* Update docs

* Add pytorch test

* Add accelerate test

* Update examples/pytorch/instance-segmentation/README.md

* Update examples/pytorch/instance-segmentation/run_instance_segmentation.py

* Update examples/pytorch/instance-segmentation/run_instance_segmentation_no_trainer.py

* Update examples/pytorch/instance-segmentation/run_instance_segmentation_no_trainer.py

* Update examples/pytorch/instance-segmentation/run_instance_segmentation.py

* Fix consistency oneformer

* Fix imports

* Fix imports sort

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update examples/pytorch/instance-segmentation/run_instance_segmentation.py
Co-authored-by: Sangbum Daniel Choi <34004152+SangbumChoi@users.noreply.github.com>

* Add resources to docs

* Update examples/pytorch/instance-segmentation/README.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/pytorch/instance-segmentation/README.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Remove explicit model_type argument

* Fix tests

* Update readme

* Note about other models

---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sangbum Daniel Choi <34004152+SangbumChoi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

cdc81311

Add streaming, various fixes (#30838) · 9837a254

Aymeric Roucher authored May 31, 2024

* Implement streaming run in ReAct agents
* Allow additional imports in code agents
* Python interpreter: support classes and exceptions, fixes

9837a254

[trainer] add sanity evaluation option (#31146) · f8e6ba45

Marc Sun authored May 31, 2024



* add sanity evaluation

* fix

* Apply suggestions from code review
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* fix

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

f8e6ba45

Quantization: Enhance bnb error message (#31160) · fc5d3e11
Younes Belkada authored May 31, 2024
```
enhance error message
```
fc5d3e11

30 May, 2024 2 commits
- fix get_scheduler when name is warmup_stable_decay (#31128) · cda9c82a
  zspo authored May 30, 2024
```
fix get_scheduler args
```
  cda9c82a
- FIX / Quantization: Add extra validation for bnb config (#31135) · 5e5c4d62
  Younes Belkada authored May 30, 2024
```
add validation for bnb config
```
  5e5c4d62
29 May, 2024 2 commits
- Add on_optimizer_step to callback options (#31095) · 5c882535
  Dhruv Pai authored May 29, 2024
```
* Modified test

* Added on_optimizer_step to callbacks

* Move callback after step is called

* Added on optimizer step callback
```
  5c882535
- Fix faulty rstrip in module loading (#31108) · 0bef4a27
  Matt authored May 29, 2024
  
  0bef4a27