Commits · 14536c339a7f28190bde70a39d0c6815d65fd9bc · chenpangpang / transformers

08 Mar, 2024 5 commits

Typo fix in error message (#29535) · 14536c33
Clémentine Fourrier authored Mar 08, 2024

14536c33

fix image-to-text batch incorrect output issue (#29342) · 8ee1d472

Wang, Yi authored Mar 08, 2024



* fix image-to-text batch incorrect output issue
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* add ci test
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

* update ci test
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

---------
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

8ee1d472

[tests] add the missing `require_sacremoses` decorator (#29504) · 8e589c83

Fanli Lin authored Mar 08, 2024



* add sacremoses check

* fix style

* for FlaubertTokenizer

* HerbertTokenizer fix

* add typeHint

* Update src/transformers/testing_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* make less skipped

* make quality

* remove import

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

8e589c83

Generate: left-padding test, revisited (#29515) · bc764f42

Joao Gante authored Mar 08, 2024



* left-padding test revisited

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

bc764f42

Typo in mlx tensor support (#29509) · 631fa7bf
Pedro Cuenca authored Mar 08, 2024
```
Potential typo in mlx support
```
631fa7bf

07 Mar, 2024 9 commits

Fix `VisionEncoderDecoder` Positional Arg (#29497) · b338a6c3

Nick DeGroot authored Mar 07, 2024

* 🐛 Fix vision encoder decoder positional arg

* ✅

 Add test for VisionEncoderDecoder with LayoutLMv3 encoder

---------
Co-authored-by: Nick DeGroot <1966472+nickthegroot@users.noreply.github.com>

b338a6c3

Set `inputs` as kwarg in `TextClassificationPipeline` (#29495) · ddf177ee

Alvaro Bartolome authored Mar 07, 2024



* Set `inputs` as kwarg in `TextClassificationPipeline`

This change has been done to align the `TextClassificationPipeline` with the rest of the pipelines, and to be able to e.g. `pipeline(**{"inputs": "text"})` which wouldn't be possible since the `*args` were being used instead.

* Add `noqa: C409` on `tuple([inputs],)`

Even though is discouraged by the linter, the cast `tuple(list(...),)` is required here, as otherwise the original list in `inputs` will be transformed into a `tuple` and the elements 1...N will be ignored by the `Pipeline`

* Run `ruff format`

* Simplify `tuple` conversion with `(inputs,)`
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

---------
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

ddf177ee

test_generation_config_is_loaded_with_model - fall back to pytorch model for now (#29521) · 4ed9ae62
amyeroberts authored Mar 07, 2024
```
* Fall back to pytorch model for now

* Fix up
```
4ed9ae62
Add support for metadata format MLX (#29335) · 45c06510
Alex Ishida authored Mar 07, 2024
```
Add support for loading safetensors files saved with metadata format mlx.
```
45c06510
Flava multimodal add attention mask (#29446) · 923733c2
Raushan Turganbay authored Mar 07, 2024
```
* flava multimodal add attn mask

* make style

* check mask is not None
```
923733c2
fix: Avoid error when fsdp_config is missing xla_fsdp_v2 (#29480) · 9288e759
Ashok Pon Kumar authored Mar 07, 2024
```
Signed-off-by: Ashok Pon Kumar Sree Prakash <ashokponkumar@gmail.com>
```
9288e759
Revert "Automatic safetensors conversion when lacking these files (#2… (#29507) · f6133d76
Lysandre Debut authored Mar 07, 2024
```
Revert "Automatic safetensors conversion when lacking these files (#29390)"

This reverts commit a69cbf4e.
```
f6133d76
v4.39 deprecations 🧼 (#29492) · ffe60fdc
Joao Gante authored Mar 07, 2024

ffe60fdc
Enable BLIP for auto VQA (#29499) · 979fccc9
regisss authored Mar 07, 2024
```
* Enable BLIP for auto VQA

* Make style

* Add VQA to BLIP pipeline tests
```
979fccc9

06 Mar, 2024 13 commits

Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for MPS device (#29439) · d45f47ab

Park Jun authored Mar 07, 2024



* Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for MPS devices

* Update src/transformers/models/gemma/modeling_gemma.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update llama ang gemma rope use cpu in mps device

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

d45f47ab

Substantially reduce memory usage in _update_causal_mask for large batches by... · 2a939f20

Glen Taggart authored Mar 06, 2024


Substantially reduce memory usage in _update_causal_mask for large batches by using .expand instead of .repeat [needs tests+sanity check] (#29413)

* try to fix gemma mem use

* fix: handle attention mask dim==2 case

* remove logits=logits.float()

* clean up + add llama

* apply formatting

* readability edit: swap order of items being multiplied

* revert change unrelated to PR

* revert black autoformat

* switch to one .to

* Accept style edits
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

2a939f20

Fix `TextGenerationPipeline.__call__` docstring (#29491) · 965cf677
Alvaro Bartolome authored Mar 06, 2024

965cf677

added the max_matching_ngram_size to GenerationConfig (#29131) · 19fb1e22

Moshe Berchansky authored Mar 06, 2024



* added the max_matching_ngram_size parameter into the GenerationConfig, for the PromptLookupCandidateGenerator

* switched back to keyword arguments

* added PromptLookupCandidateGenerator docstring for its parameters

* ruff reformat

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

19fb1e22

Generate: torch.compile-ready generation config preparation (#29443) · ddb4fda3
Joao Gante authored Mar 06, 2024

ddb4fda3
Fix test failure on DeepSpeed (#29444) · 9322576e
Zach Mueller authored Mar 06, 2024
```
* Fix test failure

* use item
```
9322576e
Avoid dummy token in PLD to optimize performance (#29445) · 0a5b0516
Ofir Zafrir authored Mar 06, 2024

0a5b0516
Generate: get generation mode from the generation config instance 🧼 (#29441) · 700d48fb
Joao Gante authored Mar 06, 2024

700d48fb
Generate: add tests for caches with `pad_to_multiple_of` (#29462) · 41f7b7ae
Joao Gante authored Mar 06, 2024

41f7b7ae

Fix TrainingArguments regression with torch <2.0.0 for dataloader_prefetch_factor (#29447) · 2890116a

Matthew Hoffman authored Mar 06, 2024

* Fix TrainingArguments regression with torch <2.0.0 for dataloader_prefetch_factor

dataloader_prefetch_factor was added to TrainingArguments in #28498 with the default value None, but  versions of torch<2.0.0 do not accept None and will raise an error if num_workers == 0 and prefetch_factor != 2

* Add is_torch_available() check

* Use is_torch_greater_or_equal_than_2_0

add back check for dataloader_prefetch_factor

2890116a

[`docs`] Add starcoder2 docs (#29454) · b27aa206

Younes Belkada authored Mar 06, 2024



* add accelerate docs

* Apply suggestions from code review
Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

* Update starcoder2.md

* add correct generation

---------
Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

b27aa206

[`Docs` / `Awq`] Add docs on exllamav2 + AWQ (#29474) · 2a002d07
Younes Belkada authored Mar 06, 2024
```
* add docs on exllamav2 + AWQ

* Update docs/source/en/quantization.md
```
2a002d07
[FIX] `offload_weight()` takes from 3 to 4 positional arguments but 5 were given (#29457) · 00bf4427
Fanli Lin authored Mar 06, 2024
```
* use require_torch_gpu

* enable on XPU

* fix
```
00bf4427

05 Mar, 2024 13 commits

🌐

[i18n-KO] Translated generation_strategies.md to Korean (#29086) · 7b01579f

AI4Harmony authored Mar 06, 2024



* Update ko _toctree.yml

* Create ko: generation_strategies.md

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

7b01579f

[i18n-zh] Translate add_new_pipeline.md into Chinese (#29432) · 638c423c
Michael authored Mar 06, 2024
```
* [i18n-zh] Translate add_new_pipeline.md into Chinese

* apply suggestions from Fan-Lin
```
638c423c

Automatic safetensors conversion when lacking these files (#29390) · a69cbf4e

Lysandre Debut authored Mar 05, 2024

* Automatic safetensors conversion when lacking these files

* Remove debug

* Thread name

* Typo

* Ensure that raises do not affect the main thread

a69cbf4e

Update pytest `import_path` location (#29154) · 9c5e5609

Logan Adams authored Mar 05, 2024

* Update to pull function from proper lib

* Fix ruff formatting error

* Remove accidently added file

9c5e5609

Fix bug with passing capture_* args to neptune callback (#29041) · 8f3f8e67

AleksanderWWW authored Mar 05, 2024

* Fix bug with passing capture_* args to neptune callback

* ruff happy?

* instantiate (frozen)set only once

* code review

* code review 2

* ruff happy?

* code review

8f3f8e67

[`Add Mamba`] Adds support for the `Mamba` models (#28094) · fb1c62e9

Arthur authored Mar 05, 2024



* initial-commit

* start cleaning

* small nits

* small nits

* current updates

* add kernels

* small refactoring little step

* add comments

* styling

* nit

* nits

* Style

* Small changes

* Push dummy mambda simple slow

* nit

* Use original names

* Use original names and remove norm

* Updates for inference params

* Style nd updates

* nits

* Match logits

* Add a test

* Add expected generated text

* nits doc, imports and styling

* style

* oups

* dont install kernels, invite users to install the required kernels

* let use use the original packages

* styling

* nits

* fix some copieds

* update doc

* fix-copies

* styling done

* nits

* fix import check

* run but wrong cuda ress

* mamba CUDA works :)

* fix the fast path

* config naming nits

* conversion script is not required at this stage

* finish fixing the fast path: generation make sense now!

* nit

* Let's start working on the CIs

* style

* better style

* more nits

* test nit

* quick fix for now

* nits

* nit

* nit

* nit

* nits

* update test rest

* fixup

* update test

* nit

* some fixes

* nits

* update test values

* fix styling

* nit

* support peft

* integrations tests require torchg

* also add slow markers

* styling

* chose forward wisely

* nits

* update tests

* fix gradient checkpointing

* fixup

* nit

* fix doc

* check copies

* fix the docstring

* fix some more tests

* style

* fix beam search

* add init schene

* update

* nit

* fix

* fixup the doc

* fix the doc

* fixup

* tentative update but slow is no longer good

* nit

* should we always use float32?

* nits

* revert wrong changes

* res in float32

* cleanup

* skip fmt for now

* update generation values

* update test values running original model

* fixup

* update tests + rename inference_params to cache_params + make sure training does not use cache_params

* small nits

* more nits

* fix final CIs

* style

* nit doc

* I hope final doc nits

* nit

* 🫠

* final touch!

* fix torch import

* Apply suggestions from code review
Co-authored-by: Lysandre Debut <hi@lysand.re>

* Apply suggestions from code review

* fix fix and fix

* fix base model prefix!

* nit

* Update src/transformers/models/mamba/__init__.py

* Update docs/source/en/model_doc/mamba.md
Co-authored-by: Lysandre Debut <hi@lysand.re>

* nit

---------
Co-authored-by: Lysandre Debut <hi@lysand.re>

fb1c62e9

Generate: inner decoding methods are no longer public (#29437) · 87a0783d
Joao Gante authored Mar 05, 2024

87a0783d
[`Udop imports`] Processor tests were not run. (#29456) · 4d892b72
Arthur authored Mar 05, 2024
```
* fix udop imports

* sort imports
```
4d892b72
Revert-commit 0d52f9f5 (#29455) · 57d007b9
Arthur authored Mar 05, 2024
```
* style

* revert with RP

* nit

* exact revert
```
57d007b9
more fix · 0d52f9f5
Arthur Zucker authored Mar 05, 2024

0d52f9f5
[`UdopTokenizer`] Fix post merge imports (#29451) · 13285220
Arthur authored Mar 05, 2024
```
* update

* ...

* nits

* arf

* 🧼

* beat the last guy

* style everyone
```
13285220

[tests] enable test_pipeline_accelerate_top_p on XPU (#29309) · fa7f3cf3

Fanli Lin authored Mar 05, 2024



* use torch_device

* Update tests/pipelines/test_pipelines_text_generation.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix style

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

fa7f3cf3

[docs] Update starcoder2 paper link (#29418) · ebccb091
Joshua Lochner authored Mar 05, 2024
```
Update starcoder2 paper link
```
ebccb091