Commits · e4628434d854ddfb5c002a6cc00b4eb4f22b7df2 · chenpangpang / transformers

03 Jun, 2024 10 commits

Add Qwen2 GGUF loading support (#31175) · e4628434

Isotr0py authored Jun 03, 2024

* add qwen2 gguf support

* Update docs

* fix qwen2 tokenizer

* add qwen2 gguf test

* fix typo in qwen2 gguf test

* format code

* Remove mistral, clarify the error message

* format code

* add typing and update docstring

e4628434

Fix `test_compile_static_cache` (#30991) · df848acc

Yih-Dar authored Jun 03, 2024



* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

df848acc

🚨 [Mistral and friends] Update MLP (#31057) · 70c87138
NielsRogge authored Jun 03, 2024
```
Update MLP
```
70c87138
SlidingWindowCache: reduce differences to other Cache classes (#30970) · d475f767
Joao Gante authored Jun 03, 2024
```
* tmp commit

* sliding window with fewer differences

* make fixup + rebase

* missing overwrite
```
d475f767

Ignore non-causal mask in more cases with SDPA (#30138) · 221aaec6

fxmarty authored Jun 03, 2024

* update non-causal mask for sdpa

* add test

* update docstrings

* add one more test

* fix cross attention bug

* gentler atol/rtol

221aaec6

Fix Cannot convert [array()] to EagerTensor of dtype int64 (#31109) · f4f69625

Pavithra Devi M authored Jun 03, 2024

While running the model.prepare_tf_dataset() method,
it raises the error below:
```
TypeError: Cannot convert [array([322.,   1.])] to EagerTensor of dtype int64
```

This happens, in  "DataCollatorForSeq2Seq" function when we are try
to convert the labels to tensors. While converting the labels to tensors,
the labels can be in the format of list of list or list of ndarrays.
There is no problem converting the list of list lables. There is a problem
when the list of ndarrays are float values(like below).

```
[array([322.,   1.])]
```

so the exception raises while trying to convert this label to tensors using
below code.

```
batch["labels"] = tf.constant(batch["labels"], dtype=tf.int64)
```

The labels are always integer values, so this got converted to float
values in the label padding operation below.
```
batch["labels"] = [
                    call(label)
                    if padding_side == "right"
                    else np.concatenate([[self.label_pad_token_id] * (max_label_length - len(label)), label])
                    for label in labels
                    ]
```
Here we have 2 cases:
1 - Concatenating an array having integer padding token value with labels.
2 - Concatenating an empty array with labels.

----------------------------------------------------------------------------------------
case 1: Concatenating an array having integer padding token value with labels.
WORKS EXPECTED:
----------------------------------------------------------------------------------------
```
label = np.array([233, 1])
max_label_length = 4
label_pad_token_id = -100
np.concatenate([[label_pad_token_id] * (max_label_length - len(label)), label])
o/p:
array([-100, -100,  233,    1])
```

----------------------------------------------------------------------------------------
Case 2: Concatenating an empty array with labels.
GIVES THE ISSUE:
This scenorio can happen when the label has the maximum label length -- No padding needed.
----------------------------------------------------------------------------------------
```
label = np.array([233, 1])
max_label_length = 2
label_pad_token_id = -100
np.concatenate([[label_pad_token_id] * (max_label_length - len(label)), label])
o/p:
array([233.,   1.])
```

----------------------------------------------------------------------------------------
Solution:
----------------------------------------------------------------------------------------
We need to concatenate a ndarray of dtype int with labels.

AFTER FIX:
----------
case 1:
```

label = np.array([233, 1])
max_label_length = 4
label_pad_token_id = -100
np.concatenate([np.array([label_pad_token_id] * (max_label_length - len(label)), dtype=np.int64),label])

o/p:
array([-100, -100,  233,    1])
```

case 2:
```

label = np.array([233, 1])
max_label_length = 2
label_pad_token_id = -100
np.concatenate([np.array([label_pad_token_id] * (max_label_length - len(label)), dtype=np.int64),label])

o/p:
array([233,   1])
```

f4f69625

[`GemmaModel`] fix small typo (#31202) · 1749841a
Arthur authored Jun 03, 2024
```
* fixes

* fix-copies
```
1749841a

Token healing (#30081) · 39b2ff69

Ahmed Moubtahij authored Jun 03, 2024



* token healing impl + trie with extensions

* make fixup

* prefix-robust space tokenization

* examples readme and requirements

* make fixup

* allow input prompt and model

* redundant defaults

* Specialized Trie

* make fixup

* updated tests with new inherited Tree

* input ids to auto device_map

* rm unused import

* Update src/transformers/generation/utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* naming convention

* Revert "naming convention"

This reverts commit dd39d9c5b7a969e2d8a8d2a8e54f121b82dc44f0.

* naming convention

* last -hopefully- changes

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

39b2ff69

Remove copied froms for deprecated models (#31153) · 5b5b48b1
amyeroberts authored Jun 03, 2024
```
* Remove copied froms for deprecated models

* Remove automatically in script
```
5b5b48b1

Fix typo: use_safetenstors to use_safetensors (#31184) · 97e5a707

CharlesCNorton authored Jun 03, 2024

Corrected a typo in security.md. Changed `use_safetenstors` to `use_safetensors` in the section discussing the usage of safe formats for loading models to prevent arbitrary code execution.

97e5a707

31 May, 2024 10 commits

Diff converter v2 (#30868) · 96eb0628

Arthur authored May 31, 2024

* current working example!

* commit regex and result file

* update

* nit

* push the conversion file

* oups

* roadmap and nits

* attempt diffs for 3 files

* persimmon

* nit

* add diff file that is the same as the modeling_llama.py

* fix rope nits

* updates

* updates with converted versions

* give some breathing space to the code

* delete

* update

* update

* push the actual result

* update regex patterns

* update regex patterns

* fix some issues

* fix some issues

* fix some issues

* updates

* updates

* updates

* updates

* updates

* revert changes done to llama

* updates

* update gemma

* updates

* oups

* current state

* current state

* update

* ouiiii

* nit

* clear diffs

* nit

* fixup

* update

* doc 🚀

* 🔥

* for now use gemma

* deal with comments

* style

* handle funtions

* deal with assigns

* todos

* process inheritage

* keep decorators?

* 🤗

* deal with duplicates

* fixup

* correctly remove duplicate code

* run ruff post script

* ruff deals pretty well with imports, let's leave it to him

* ah maybe not lol

* for now remove all imports from child.

* nit

* conversion of llama

* okay

* convert starcoder2

* synch with main

* update llama diff

* updates

* https://docs.astral.sh/ruff/rules/redefined-while-unused/

 fixes the imports, bit needs later version of ruff

* updates

* okay actual state

* non zero exit

* update!

* revert unrelated

* remove other diff files

* updates

* cleanup

* update

* less diff!

* stash

* current updates

* updates

* No need for call

* finished fining deps

* update

* current changes

* current state

* current state

* new status

* nit

* finally

* fixes

* nits

* order is now expected

* use logger info instead of prints

* fixup

* up

* nit

* update

* nits

* update

* correct merge

* update

* update

* update

* add warning

* update caution message

* update

* better merging strategy

* copy class statements :wink

* fixups

* nits

* update

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* nits

* smaller header

* do cleanup some stuff

* even simpler header?

* fixup

* updates

* ruff

* update examples

* nit

* TODO

* state

* OUUUUUUF

* current state

* nits

* final state

* add a readme

* fixup

* remove diff llama

* fix

* nit

* dummy noy funny

* ruff format tests src utils --check

* everless diffs

* less diffs and fix test

* fixes

* naming nit?

* update converter and add supper example

* nits

* updated for function signatures

* update

* update

* add converted dummies

* autoformat

* single target assign fix

* fixup

* fix some imports

* fixes

* don't push them

* `# noqa: F841`

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

96eb0628

Added description of quantization_config (#31133) · 372baec2

Vallepu Vamsi Krishna authored May 31, 2024

* Description of quantization_config

Added missing description about quantization_config in replace_with_bnb_linear for better readability.

* Removed trailing spaces

372baec2

Instance segmentation examples (#31084) · cdc81311

Pavel Iakubovskii authored May 31, 2024



* Initial setup

* Metrics

* Overfit on two batches

* Train 40 epochs

* Memory leak debugging

* Trainer fine-tuning

* Draft

* Fixup

* Trained end-to-end

* Add requirements

* Rewrite evaluator

* nits

* Add readme

* Add instance-segmentation to the table

* Support void masks

* Remove sh

* Update docs

* Add pytorch test

* Add accelerate test

* Update examples/pytorch/instance-segmentation/README.md

* Update examples/pytorch/instance-segmentation/run_instance_segmentation.py

* Update examples/pytorch/instance-segmentation/run_instance_segmentation_no_trainer.py

* Update examples/pytorch/instance-segmentation/run_instance_segmentation_no_trainer.py

* Update examples/pytorch/instance-segmentation/run_instance_segmentation.py

* Fix consistency oneformer

* Fix imports

* Fix imports sort

* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>

* Update examples/pytorch/instance-segmentation/run_instance_segmentation.py
Co-authored-by: Sangbum Daniel Choi <34004152+SangbumChoi@users.noreply.github.com>

* Add resources to docs

* Update examples/pytorch/instance-segmentation/README.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/pytorch/instance-segmentation/README.md
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Remove explicit model_type argument

* Fix tests

* Update readme

* Note about other models

---------
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>
Co-authored-by: Sangbum Daniel Choi <34004152+SangbumChoi@users.noreply.github.com>
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

cdc81311

Add streaming, various fixes (#30838) · 9837a254

Aymeric Roucher authored May 31, 2024

* Implement streaming run in ReAct agents
* Allow additional imports in code agents
* Python interpreter: support classes and exceptions, fixes

9837a254

[trainer] add sanity evaluation option (#31146) · f8e6ba45

Marc Sun authored May 31, 2024



* add sanity evaluation

* fix

* Apply suggestions from code review
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

* fix

---------
Co-authored-by: Zach Mueller <muellerzr@gmail.com>

f8e6ba45

Quantization: Enhance bnb error message (#31160) · fc5d3e11
Younes Belkada authored May 31, 2024
```
enhance error message
```
fc5d3e11

Update sam.md (#31130) · bd9d1ddf

Asif Ajrof authored May 31, 2024

`mask` variable is not defined. probably a writing mistake. it should be `segmentation_map`. `segmentation_map` should be a `1` channel image rather than `RGB`.
[on a different note, the `mask_url` is the same as `raw_image`. could provide a better example.

bd9d1ddf

Fix quantized cache output (#31143) · 48cada87
Marc Sun authored May 31, 2024

48cada87
pytest -rsfE (#31140) · d19566e8
Yih-Dar authored May 31, 2024
```
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
d19566e8

helper (#31152) · f3f640dc

Arthur authored May 31, 2024



* helper

* Apply suggestions from code review
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* updates

* more doc

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

f3f640dc

30 May, 2024 4 commits
- Workflow: Remove `IS_GITHUB_CI` (#31147) · 6bd511a4
  Younes Belkada authored May 30, 2024
```
remove `IS_GITHUB_CI`
```
  6bd511a4
- Docs / Quantization: Replace all occurences of `load_in_8bit` with bnb config (#31136) · f5590dea
  Younes Belkada authored May 30, 2024
```
Replace all occurences of `load_in_8bit` with bnb config
```
  f5590dea
- fix get_scheduler when name is warmup_stable_decay (#31128) · cda9c82a
  zspo authored May 30, 2024
```
fix get_scheduler args
```
  cda9c82a
- FIX / Quantization: Add extra validation for bnb config (#31135) · 5e5c4d62
  Younes Belkada authored May 30, 2024
```
add validation for bnb config
```
  5e5c4d62
29 May, 2024 12 commits

Cleanup docker build (#31119) · 2b9e252b

Yih-Dar authored May 29, 2024



* remove

* build

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

2b9e252b

Add on_optimizer_step to callback options (#31095) · 5c882535

Dhruv Pai authored May 29, 2024

* Modified test

* Added on_optimizer_step to callbacks

* Move callback after step is called

* Added on optimizer step callback

5c882535

Add VLM generation default contributor (#31115) · 4af705c6
Joao Gante authored May 29, 2024
```
* add Raushan

* add Raushan
```
4af705c6
FIX / Docs: Fix GPTQ expected number of bits (#31111) · cb879c58
Younes Belkada authored May 29, 2024
```
Update overview.md
```
cb879c58

Fix nightly circleci (#31114) · 1f841413

Yih-Dar authored May 29, 2024



* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

1f841413

Rm maintainer + migrate (#31089) · d16053c8
Zach Mueller authored May 29, 2024

d16053c8
Fix faulty rstrip in module loading (#31108) · 0bef4a27
Matt authored May 29, 2024

0bef4a27
Fix env.py in cases where torch is not present (#31113) · 97a58a5d
Matt authored May 29, 2024
```
* Fix env.py in cases where torch is not present

* Simplify the fix (and avoid some issues)
```
97a58a5d

Improve `transformers-cli env` reporting (#31003) · c8861376

Huazhong Ji authored May 29, 2024

* Improve `transformers-cli env` reporting

* move the line `"Using GPU in script?": "<fill in>"` to in if conditional
statement

* same option for npu

c8861376

Use `HF_HUB_OFFLINE` + fix has_file in offline mode (#31016) · c3044ec2

Lucain authored May 29, 2024

* Fix has_file in offline mode

* harmonize env variable for offline mode

* Switch to HF_HUB_OFFLINE

* fix test

* revert test_offline to test TRANSFORMERS_OFFLINE

* Add new offline test

* merge conflicts

* docs

c3044ec2

FEAT: Add mistral v3 conversion script (#30981) · bfe6f513

Younes Belkada authored May 29, 2024



* add mistral v3 conversion script

* Update src/transformers/models/mistral/convert_mistral_weights_to_hf.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixup

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

bfe6f513

Quantized KV cache: update quanto (#31052) · d521ba57

Raushan Turganbay authored May 29, 2024



* quanto latest version was refactored

* add error msg

* incorrect compare sign

* Update src/transformers/cache_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

d521ba57

28 May, 2024 4 commits

Deprecate low use models (#30781) · a564d10a

amyeroberts authored May 28, 2024

* Deprecate models
- graphormer
- time_series_transformer
- xlm_prophetnet
- qdqbert
- nat
- ernie_m
- tvlt
- nezha
- mega
- jukebox
- vit_hybrid
- x_clip
- deta
- speech_to_text_2
- efficientformer
- realm
- gptsan_japanese

* Fix up

* Fix speech2text2 imports

* Make sure message isn't indented

* Fix docstrings

* Correctly map for deprecated models from model_type

* Uncomment out

* Add back time series transformer and x-clip

* Import fix and fix-up

* Fix up with updated ruff

a564d10a

Docs / Quantization: Redirect deleted page (#31063) · 7f08817b
Younes Belkada authored May 28, 2024
```
Update _redirects.yml
```
7f08817b
TST: Fix instruct-blip tests (#31088) · 3264be41
Younes Belkada authored May 28, 2024
```
* fix flan t5 tests

* better format
```
3264be41
Fix DeepSpeed compatibility with weight_norm (#30881) (#31018) · 476890e9
Jonny Li authored May 28, 2024

476890e9