Commits · 6d67837f06fb8e3155a5c5b0dd57cd09841bc9f9 · chenpangpang / transformers

11 Mar, 2024 3 commits

Add Fill-in-the-middle training objective example - PyTorch (#27464) · 6d67837f

Tanay Mehta authored Mar 11, 2024

* add: initial script to train clm fim

* fix: if training model from scratch, new tokens will be added and embeddings resized

* fix: fixed attention_mask errors when generating FIM data

* fix: file formatted using black

* add: run_fim_no_trainer.py and fixed some comments in run_fim.py

* add: added fim examples to the README.md and ran code fixup

* fix: little bug in both fim training scripts

* fix: remove comment from notebook and added a note on fim related params

* fix: minor typo in README

* add: suggested minor changes to README and run_fim.py

* add: gradient_accumulation_steps and gradient_checkpointing args

* add: improved model embedding resizing

* add: pad_to_multiple_of and attn_implementation params

* add: requested minor changes

* add: deepspeed zero compatibility

* add: resize embeddings layer with zero3 support for fim model initialization

6d67837f

[`Docs`] fixed minor typo (#29555) · d80c9a34
j-gc authored Mar 11, 2024

d80c9a34
[`Mamba doc`] Post merge updates (#29472) · 4f27ee93
Arthur authored Mar 11, 2024
```
* post merge update

* nit

* oups
```
4f27ee93

08 Mar, 2024 13 commits

feat: use `warning_advice` for tensorflow warning (#29540) · 0290ec19
Winston H authored Mar 09, 2024
```
feat: use `warning_advice` instead of tensorflow warning
```
0290ec19

Fix eval thread fork bomb (#29538) · 469c1328

Zach Mueller authored Mar 08, 2024

* Fix eval thread fork bomb

* Keep eval dl persistent and prepare after so free_memory doesn't destroy it

* Add note

* Quality

469c1328

[tests] use the correct `n_gpu` in... · 3f6973db

Fanli Lin authored Mar 08, 2024

[tests] use the correct `n_gpu` in `TrainerIntegrationTest::test_train_and_eval_dataloaders` for XPU (#29307)

* fix n_gpu

* fix style

3f6973db

Fix WhisperNoSpeechDetection when input is full silence (#29065) · 1ba89dc2
Yoach Lacombe authored Mar 08, 2024
```
fix total silence input with no_speech_threshold
```
1ba89dc2
fix typos in FSDP config parsing logic in `TrainingArguments` (#29189) · 697f05ba
Yun Dai authored Mar 08, 2024
```
fix FSDP config
```
697f05ba
Make sliding window size inclusive in eager attention (#29519) · 608fa549
Jonatan Kłosko authored Mar 08, 2024
```
* Make sliding window size inclusive in eager attention

* Fix tests
```
608fa549

StableLM: Fix dropout argument type error (#29236) · f386c51a

liangjs authored Mar 08, 2024



* fix stablelm dropout argument type error

* fix docs of _flash_attention_forward

* fix all docs of _flash_attention_forward

* fix docs of _flash_attention_forward in starcoder2

---------
Co-authored-by: oliang <oliang@tencent.com>

f386c51a

[tests] use `torch_device` instead of `auto` for model testing (#29531) · 1ea3ad1a

Fanli Lin authored Mar 08, 2024



* use torch_device

* skip for XPU

* Update tests/generation/test_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

1ea3ad1a

Typo fix in error message (#29535) · 14536c33
Clémentine Fourrier authored Mar 08, 2024

14536c33

fix image-to-text batch incorrect output issue (#29342) · 8ee1d472

Wang, Yi authored Mar 08, 2024



* fix image-to-text batch incorrect output issue
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

* add ci test
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

* update ci test
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

---------
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi <yi.a.wang@intel.com>

8ee1d472

[tests] add the missing `require_sacremoses` decorator (#29504) · 8e589c83

Fanli Lin authored Mar 08, 2024



* add sacremoses check

* fix style

* for FlaubertTokenizer

* HerbertTokenizer fix

* add typeHint

* Update src/transformers/testing_utils.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* make less skipped

* make quality

* remove import

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

8e589c83

Generate: left-padding test, revisited (#29515) · bc764f42

Joao Gante authored Mar 08, 2024



* left-padding test revisited

* Apply suggestions from code review
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

bc764f42

Typo in mlx tensor support (#29509) · 631fa7bf
Pedro Cuenca authored Mar 08, 2024
```
Potential typo in mlx support
```
631fa7bf

07 Mar, 2024 9 commits

Fix `VisionEncoderDecoder` Positional Arg (#29497) · b338a6c3

Nick DeGroot authored Mar 07, 2024

* 🐛 Fix vision encoder decoder positional arg

* ✅

 Add test for VisionEncoderDecoder with LayoutLMv3 encoder

---------
Co-authored-by: Nick DeGroot <1966472+nickthegroot@users.noreply.github.com>

b338a6c3

Set `inputs` as kwarg in `TextClassificationPipeline` (#29495) · ddf177ee

Alvaro Bartolome authored Mar 07, 2024



* Set `inputs` as kwarg in `TextClassificationPipeline`

This change has been done to align the `TextClassificationPipeline` with the rest of the pipelines, and to be able to e.g. `pipeline(**{"inputs": "text"})` which wouldn't be possible since the `*args` were being used instead.

* Add `noqa: C409` on `tuple([inputs],)`

Even though is discouraged by the linter, the cast `tuple(list(...),)` is required here, as otherwise the original list in `inputs` will be transformed into a `tuple` and the elements 1...N will be ignored by the `Pipeline`

* Run `ruff format`

* Simplify `tuple` conversion with `(inputs,)`
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

---------
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

ddf177ee

test_generation_config_is_loaded_with_model - fall back to pytorch model for now (#29521) · 4ed9ae62
amyeroberts authored Mar 07, 2024
```
* Fall back to pytorch model for now

* Fix up
```
4ed9ae62
Add support for metadata format MLX (#29335) · 45c06510
Alex Ishida authored Mar 07, 2024
```
Add support for loading safetensors files saved with metadata format mlx.
```
45c06510
Flava multimodal add attention mask (#29446) · 923733c2
Raushan Turganbay authored Mar 07, 2024
```
* flava multimodal add attn mask

* make style

* check mask is not None
```
923733c2
fix: Avoid error when fsdp_config is missing xla_fsdp_v2 (#29480) · 9288e759
Ashok Pon Kumar authored Mar 07, 2024
```
Signed-off-by: Ashok Pon Kumar Sree Prakash <ashokponkumar@gmail.com>
```
9288e759
Revert "Automatic safetensors conversion when lacking these files (#2… (#29507) · f6133d76
Lysandre Debut authored Mar 07, 2024
```
Revert "Automatic safetensors conversion when lacking these files (#29390)"

This reverts commit a69cbf4e.
```
f6133d76
v4.39 deprecations 🧼 (#29492) · ffe60fdc
Joao Gante authored Mar 07, 2024

ffe60fdc
Enable BLIP for auto VQA (#29499) · 979fccc9
regisss authored Mar 07, 2024
```
* Enable BLIP for auto VQA

* Make style

* Add VQA to BLIP pipeline tests
```
979fccc9

06 Mar, 2024 13 commits

Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for MPS device (#29439) · d45f47ab

Park Jun authored Mar 07, 2024



* Fix: Disable torch.autocast in RotaryEmbedding of Gemma and LLaMa for MPS devices

* Update src/transformers/models/gemma/modeling_gemma.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update llama ang gemma rope use cpu in mps device

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

d45f47ab

Substantially reduce memory usage in _update_causal_mask for large batches by... · 2a939f20

Glen Taggart authored Mar 06, 2024


Substantially reduce memory usage in _update_causal_mask for large batches by using .expand instead of .repeat [needs tests+sanity check] (#29413)

* try to fix gemma mem use

* fix: handle attention mask dim==2 case

* remove logits=logits.float()

* clean up + add llama

* apply formatting

* readability edit: swap order of items being multiplied

* revert change unrelated to PR

* revert black autoformat

* switch to one .to

* Accept style edits
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

2a939f20

Fix `TextGenerationPipeline.__call__` docstring (#29491) · 965cf677
Alvaro Bartolome authored Mar 06, 2024

965cf677

added the max_matching_ngram_size to GenerationConfig (#29131) · 19fb1e22

Moshe Berchansky authored Mar 06, 2024



* added the max_matching_ngram_size parameter into the GenerationConfig, for the PromptLookupCandidateGenerator

* switched back to keyword arguments

* added PromptLookupCandidateGenerator docstring for its parameters

* ruff reformat

* Update src/transformers/generation/configuration_utils.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

19fb1e22

Generate: torch.compile-ready generation config preparation (#29443) · ddb4fda3
Joao Gante authored Mar 06, 2024

ddb4fda3
Fix test failure on DeepSpeed (#29444) · 9322576e
Zach Mueller authored Mar 06, 2024
```
* Fix test failure

* use item
```
9322576e
Avoid dummy token in PLD to optimize performance (#29445) · 0a5b0516
Ofir Zafrir authored Mar 06, 2024

0a5b0516
Generate: get generation mode from the generation config instance 🧼 (#29441) · 700d48fb
Joao Gante authored Mar 06, 2024

700d48fb
Generate: add tests for caches with `pad_to_multiple_of` (#29462) · 41f7b7ae
Joao Gante authored Mar 06, 2024

41f7b7ae

Fix TrainingArguments regression with torch <2.0.0 for dataloader_prefetch_factor (#29447) · 2890116a

Matthew Hoffman authored Mar 06, 2024

* Fix TrainingArguments regression with torch <2.0.0 for dataloader_prefetch_factor

dataloader_prefetch_factor was added to TrainingArguments in #28498 with the default value None, but  versions of torch<2.0.0 do not accept None and will raise an error if num_workers == 0 and prefetch_factor != 2

* Add is_torch_available() check

* Use is_torch_greater_or_equal_than_2_0

add back check for dataloader_prefetch_factor

2890116a

[`docs`] Add starcoder2 docs (#29454) · b27aa206

Younes Belkada authored Mar 06, 2024



* add accelerate docs

* Apply suggestions from code review
Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

* Update starcoder2.md

* add correct generation

---------
Co-authored-by: Loubna Ben Allal <44069155+loubnabnl@users.noreply.github.com>

b27aa206

[`Docs` / `Awq`] Add docs on exllamav2 + AWQ (#29474) · 2a002d07
Younes Belkada authored Mar 06, 2024
```
* add docs on exllamav2 + AWQ

* Update docs/source/en/quantization.md
```
2a002d07
[FIX] `offload_weight()` takes from 3 to 4 positional arguments but 5 were given (#29457) · 00bf4427
Fanli Lin authored Mar 06, 2024
```
* use require_torch_gpu

* enable on XPU

* fix
```
00bf4427

05 Mar, 2024 2 commits

🌐

[i18n-KO] Translated generation_strategies.md to Korean (#29086) · 7b01579f

AI4Harmony authored Mar 06, 2024



* Update ko _toctree.yml

* Create ko: generation_strategies.md

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

* Apply suggestions from code review
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Jungnerd <46880056+jungnerd@users.noreply.github.com>

7b01579f

[i18n-zh] Translate add_new_pipeline.md into Chinese (#29432) · 638c423c
Michael authored Mar 06, 2024
```
* [i18n-zh] Translate add_new_pipeline.md into Chinese

* apply suggestions from Fan-Lin
```
638c423c