Commits · 5eeaef921f70acd68073d1066ccb09d7c6e6f475 · chenpangpang / transformers

22 Aug, 2023 10 commits

Adds `TRANSFORMERS_TEST_BACKEND` (#25655) · 5eeaef92

Alex McKinney authored Aug 22, 2023

* Adds `TRANSFORMERS_TEST_BACKEND`
Allows specifying arbitrary additional import following first `import torch`.
This is useful for some custom backends, that will require additional imports to trigger backend registration with upstream torch.
See https://github.com/pytorch/benchmark/pull/1805

 for a similar change in `torchbench`.

* Update src/transformers/testing_utils.py
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

* Adds real backend example to documentation

---------
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

5eeaef92

removing unnecesssary extra parameter (#25643) · fd56f7f0
Rafael Padilla authored Aug 22, 2023

fd56f7f0

Fix bloom add prefix space (#25652) · e20fab0b

Arthur authored Aug 22, 2023

* properly support Sequence of pretokenizers

* actual fix

* make sure the fix works. Tests are not working for sure!

* hacky way

* add TODO

* update

* add a todo

* nits

* rename test

* nits

* rename test

e20fab0b

TF 2.14 compatibility (#25630) · 62396cff

Matt authored Aug 22, 2023

* Update the TF pin and see if anything breaks

* make fixup

* make fixup

* make fixup

62396cff

Put IDEFICS in the right section of the doc (#25650) · 36291906
Sylvain Gugger authored Aug 22, 2023

36291906
Pass the proper token to PEFT integration in auto classes (#25649) · edb28722
Sylvain Gugger authored Aug 22, 2023

edb28722
[MINOR:TYPO] (#25646) · 88e51ba3
Christopher Akiki authored Aug 22, 2023
```
[MINOR:TYPO] Update tokenization_auto.py
```
88e51ba3

[DOCS] MusicGen Docs Update (#25510) · 6a314ea7

Blake Wyatt authored Aug 22, 2023

* docs: note token limitations for MusicGen

* docs: note token limitations for MusicGen

* docs: fix token count with token limitations for MusicGen

6a314ea7

Add Number Normalisation for SpeechT5 (#25447) · 182b8374

Tanay Mehta authored Aug 22, 2023

* add: NumberNormalizer works for integers, floats, common currencies, negative numbers and percentages

* fix: renamed number normalizer class and added normalization to SpeechT5Processor

* fix: restyled with black and ruff, should pass code quality tests

* fix: moved normalization to tokenizer and other small changes to normalizer

* add: test for normalization and changed the existing full tokenizer test

* fix: tokenization tests now pass, made changes to existing tokenization where normalization is covered; added normalize arg to func signature

* fix: changed default normalize setting to False, modified the tests a bit

* fix: added support for comma separated numbers, tokenization on the fly with kwargs and normalizer getter setter funcs

182b8374

Support specifying revision in push_to_hub (#25578) · 58c36bea
Joe Mifsud authored Aug 22, 2023
```
Support revision in push_to_hub
```
58c36bea

21 Aug, 2023 12 commits

Add Pop2Piano (#21785) · 450a181d

Susnato Dhar authored Aug 21, 2023



* init commit

* config updated also some modeling

* Processor and Model config combined

* extraction pipeline(upto before spectogram & mel_conditioner) added but not properly tested

* model loading successful!

* feature extractor done!

* FE can now be called from HF

* postprocessing added in fe file

* same as prev commit

* Pop2PianoConfig doc done

* cfg docs slightly changed

* fe docs done

* batched

* batched working!

* temp

* v1

* checking

* trying to go with generate

* with generate and model tests passed

* before rebasing

* .

* tests done docs done remaining others & nits

* nits

* LogMelSpectogram shifted to FeatureExtractor

* is_tf rmeoved from pop2piano/init

* import solved

* tokenization tests added

* minor fixed regarding modeling_pop2piano

* tokenizer changed to only return midi_object and other changes

* Updated paper abstract(Camera-ready version) (#2)

* more comments and nits

* ruff changes

* code quality fix

* sg comments

* t5 change added and rebased

* comments except batching

* batching done

* comments

* small doc fix

* example removed from modeling

* ckpt

* forward it compatible with fe and generation done

* comments

* comments

* code-quality fix(maybe)

* ckpts changed

* doc file changed from mdx to md

* test fixes

* tokenizer test fix

* changes

* nits done main changes remaining

* code modified

* Pop2PianoProcessor added with tests

* other comments

* added Pop2PianoProcessor to dummy_objects

* added require_onnx to modeling file

* changes

* update .md file

* remove extra line in index.md

* back to the main index

* added pop2piano to index

* Added tokenizer.__call__ with valid args and batch_decode and aligned the processor part too

* changes

* added return types to 2 tokenizer methods

* the PR build test might work now

* added backends

* PR build fix

* vocab added

* comments

* refactored vocab into 1 file

* added conversion script

* comments

* essentia version changed in .md

* comments

* more tokenizer tests added

* minor fix

* tests extended for outputs acc check

* small fix

---------
Co-authored-by: Jongho Choi <sweetcocoa@snu.ac.kr>

450a181d

fix documentation for CustomTrainer (#25635) · 6f041fcb
mchau authored Aug 21, 2023
```
fix doc
```
6f041fcb
🚨🚨🚨 changing default threshold and applying threshold before the rescale (#25608) · 8608bf20
Rafael Padilla authored Aug 21, 2023
```
changing position of score threshold and its default value
```
8608bf20
Skip doctest for some recent files (#25631) · 2df24228
Yih-Dar authored Aug 21, 2023
```
update
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
2df24228
fix ACT_FN (#25627) · 2582bbde
Arthur authored Aug 21, 2023

2582bbde
correct TTS pipeline docstrings snippet (#25587) · 2c1bcbf5
Yoach Lacombe authored Aug 21, 2023
```
* correct TTS pipeline docstrings snippet

* add text_to_audio.py pipelines to documentation tests
```
2c1bcbf5
Added paper links in logitprocess.py (#25482) · e769ca3d
Pranith Pashikanti authored Aug 21, 2023

e769ca3d
v4.33.0.dev0 · 5c67682b
Sylvain Gugger authored Aug 21, 2023

5c67682b
Fix test_modeling_mpt typo in model id (#25606) · 2f8acfea
Francisco Kurucz authored Aug 21, 2023
```
Fix model id in get_large_model_config on file test_modeling_mpt
```
2f8acfea
Run doctest for new files (#25588) · f09db47a
Yih-Dar authored Aug 21, 2023
```
fix
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
f09db47a
Fix PEFT integration failures on nightly CI (#25624) · 9627c3da
Younes Belkada authored Aug 21, 2023
```
fix PEFT integration failures
```
9627c3da
Ignore all exceptions from signal in dynamic code (#25623) · f92cc703
Sylvain Gugger authored Aug 21, 2023

f92cc703

19 Aug, 2023 1 commit
- Hotfix · 1982dd3b
  ydshieh authored Aug 19, 2023
  
  1982dd3b
18 Aug, 2023 15 commits

reattach hooks when using `resize_token_embeddings` (#25596) · 6b82d936
Marc Sun authored Aug 18, 2023
```
* reattach hooks

* fix style
```
6b82d936

new model: IDEFICS via HuggingFaceM4 (#24796) · 6c811a32

Stas Bekman authored Aug 18, 2023



* rename

* restore

* mappings

* unedited tests+docs

* docs

* fixes

* fix auto-sync breakage

* cleanup

* wip

* wip

* add fetch_images

* remove einops dependency

* update

* fix

* fix

* fix

* fix

* fix

* re-add

* add batching

* rework

* fix

* improve

* add Leo as I am extending his work

* cleanup

* fix

* cleanup

* slow-test

* fix

* fix

* fixes

* deal with warning

* rename modified llama classes

* rework fetch_images

* alternative implementation

* cleanup

* strict version

* cleanup

* [`IDEFICS`] Fix idefics ci (#25056)

* Fix IDEFICS CI

* fix test file

* fixup

* some changes to make tests pass

* fix

* fixup

* Update src/transformers/models/idefics/configuration_idefics.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

---------
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* remove compat checks

* style

* explain that Idefics is not for training from scratch

* require pt>=2.0

* fix idefics vision config (#25092)

* fix idefics vision config

* fixup

* clean

* Update src/transformers/models/idefics/configuration_idefics.py

---------
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* cleanup

* style

* cleanup

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* upcase

* sequence of images

* handle the case with no images

* Update src/transformers/image_processing_utils.py
Co-authored-by: Victor SANH <victorsanh@gmail.com>

* support pure lm take 2

* support tokenizer options

* parameterize num_channels

* fix upcase

* s|IdeficsForCausalLM|IdeficsForVisionText2Text|g

* manual to one line

* addressing review

* unbreak

* remove clip dependency

* fix test

* consistency

* PIL import

* Idefics prefix

* Idefics prefix

* hack to make tests work

* style

* fix

* fix

* revert

* try/finally

* cleanup

* clean up

* move

* [`IDEFICS`] Fix idefics config refactor (#25149)

* refactor config

* nuke init weights

* more refactor

* oops

* remove visual question answering pipeline support

* Update src/transformers/models/idefics/clip.py
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>

* Update src/transformers/models/idefics/modeling_idefics.py

* cleanup

* mv clip.py vision.py

* tidyup

---------
Co-authored-by: Stas Bekman <stas00@users.noreply.github.com>
Co-authored-by: Stas Bekman <stas@stason.org>

* fix

* license

* condition on pt

* fix

* style

* fix

* rm torchvision dependency, allow custom transforms

* address review

* rework device arg

* add_eos_token

* s/transforms/transform/

* fix top level imports

* fix return value

* cleanup

* cleanup

* fix

* style

* license

* license

* Update src/transformers/models/idefics/image_processing_idefics.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add a wrapper to freeze vision layears

* tidyup

* use the correct std/mean settings

* parameterize values from config

* add tests/models/idefics/test_image_processing_idefics.py

* add test_processor_idefics.py

* cleanup

* cleanups

* fix

* fix

* move to the right group

* style

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* add perceiver config

* reset

* missing arg docs

* Apply suggestions from code review
Co-authored-by: Leo Tronchon <leo.tronchon@gmail.com>

* address review comments

* inject automatic end of utterance tokens (#25218)

* inject automatic end of utterance tokens

* fix

* fix

* fix

* rework to not use the config

* not end_of_utterance_token at the end

* Update src/transformers/models/idefics/processing_idefics.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* address review

* Apply suggestions from code review
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/image_processing_utils.py
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>

* [`Idefics`] add image_embeddings option in generate-related methods (#25442)

* add image_embeddings option in generate-related methods

* style

* rename image_embeddings and allow perceiver embeddings precomputation

* compute embeddings within generate

* make is_encoder_decoder= True the default in config

* nested if else fix

* better triple check

* switch if elif order for pixel values / img embeds

* update model_kwargs perceiver only at the end

* use _prepare_model_inputs instead of encoder_decoder logic

* fix comment typo

* fix config default for is_encoder_decoder

* style

* add typehints

* precompute in forward

* doc builder

* style

* pop instead of get image hidden states

* Trigger CI

* Update src/transformers/models/idefics/modeling_idefics.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* Update src/transformers/models/idefics/modeling_idefics.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix * + indentation + style

* simplify a bit the use_resampler logic using comments

* update diocstrings

* Trigger CI

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fix rebase changes

* unbreak #25237 - to be fixed in follow up PRs

* is_composition = False

* no longer needed

---------
Co-authored-by: leot13 <leo.tronchon@gmail.com>
Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Victor SANH <victorsanh@gmail.com>
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

6c811a32

🌐

[i18n-KO] Translated `perf_train_tpu_tf.md` to Korean (#25433) · 4d64157e

Hyeonseo Yun authored Aug 19, 2023



* docs: ko: perf_train_tpu_tf.md

* feat: nmt and manual edit perf_train_tpu_tf.md

* fix: resolve suggestions
Co-authored-by: Sangam Lee <74291999+augustinLib@users.noreply.github.com>
Co-authored-by: Kim haewon <ehdvkf02@naver.com>
Co-authored-by: Kihoon Son <75935546+kihoon71@users.noreply.github.com>

---------
Co-authored-by: Sangam Lee <74291999+augustinLib@users.noreply.github.com>
Co-authored-by: Kim haewon <ehdvkf02@naver.com>
Co-authored-by: Kihoon Son <75935546+kihoon71@users.noreply.github.com>

4d64157e

Make TTS automodels importable (#25595) · 6f4424bb

Omar Sanseviero authored Aug 18, 2023

* Add auto model for spectrogram/waveform

* Add doc and install

* Add dummy objects

* Did I miss anything?

6f4424bb

[`PEFT`] Peft integration alternative design (#25077) · faed2ca4

Younes Belkada authored Aug 18, 2023



* a draft version

* v2 integration

* fix

* make it more generic and works for IA3

* add set adapter and multiple adapters support

* fixup

* adapt a bit

* oops

* oops

* oops

* adapt more

* fix

* add more refactor

* now works with model class

* change it to instance method as it causes issues with `jit`.

* add CR

* change method name

* add `add_adapter` method

* clean up

* Update src/transformers/adapters/peft_mixin.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* add moe utils

* fixup

* Update src/transformers/adapters/peft_mixin.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* adapt

* oops

* fixup

* add is_peft_available

* remove `requires_backend`

* trainer compatibility

* fixup + docstring

* more details

* trigger CI

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* Update src/transformers/modeling_utils.py

* fixup + is_main_process

* added `save_peft_format` in save_pretrained

* up

* fix nits here and there

* nits here and there.

* docs

* revert `encoding="utf-8"`

* comment

* added slow tests before the PEFT release.

* fixup and nits

* let's be on the safe zone

* added more comments

* v1 docs

* add remaining docs

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* move to `lib_integrations`

* fixup

* this time fixup

* Apply suggestions from code review
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>

* address final comments

* refactor to use `token`

* add PEFT to DockerFile for slow tests.

* added pipeline support.

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

faed2ca4

[`TokenizerFast`] Fix setting prefix space in __init__ (#25563) · ef153425

Arthur authored Aug 18, 2023

* properly support Sequence of pretokenizers

* actual fix

* make sure the fix works. Tests are not working for sure!

* hacky way

* add TODO

* update

* add a todo

ef153425

fix z3 init when using accelerate launcher (#25589) · 636acc75
Sourab Mangrulkar authored Aug 18, 2023

636acc75
[Time series Informer] fix dtype of cumsum (#25431) · 8d2f953f
Kashif Rasul authored Aug 18, 2023
```
* fix dtype of cumsum

* add comment
```
8d2f953f

[`Llama`] remove prompt and fix prefix finetuning (#25565) · bc3e20dc

Arthur authored Aug 18, 2023

* nit

* update

* make sure use_default_system_prompt is saved

* update checkpointing

* consistency

* use_default_system_prompt for test

bc3e20dc

[`split_special_tokens`] Add support for `split_special_tokens` argument to encode (#25081) · 30b3c46f

Arthur authored Aug 18, 2023

* draft changes

* update and add tests

* styling for no

* move test

* path to usable model

* update test

* small update

* update bertbased tokenizers

* don'tuse kwargs for _tokenize

* don'tuse kwargs for _tokenize

* fix copies

* update

* update test for special tokenizers

* fixup

* skip two tests

* remove pdb breakpiont()

* wowo

* rewrite custom tests

* nits

* revert chang in target keys

* fix markup lm

* update documentation of the argument

30b3c46f

Replaces calls to `.cuda` with `.to(torch_device)` in tests (#25571) · 9d7afd25

Alex McKinney authored Aug 18, 2023



* Replaces calls to `.cuda` with `.to(torch_device)` in tests
`torch.Tensor.cuda()` is a pre-0.4 solution to changing a tensor's device. It is recommended to prefer `.to(...)` for greater flexibility and error handling. Furthermore, this makes it more consistent with other tests (that tend to use `.to(torch_device)`) and ensures the correct device backend is used (if `torch_device` is neither `cpu` or `cuda`).

* addressing review comments

* more formatting changes in Bloom test

* `make style`

* Update tests/models/bloom/test_modeling_bloom.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* fixes style failures

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

9d7afd25

Added missing parenthesis in call to is_fsdp_enabled (#25585) · c45aab75
Martin Malmsten authored Aug 18, 2023
```
Calling function is_fsdp_enabled instead of checking if it is not None
```
c45aab75

[`Docs` / `BetterTransformer` ] Added more details about flash attention + SDPA (#25265) · 940d1a76

Younes Belkada authored Aug 18, 2023



* added more details about flash attention

* correct and add more details

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* few modifs

* more details

* up

* Apply suggestions from code review
Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>

* adapt from suggestion

* Apply suggestions from code review
Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>

* trigger CI

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* fix nits and copies

* add new section

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>

940d1a76

Suggestions on Pipeline_webserver (#25570) · 08e32519

Kihoon Son authored Aug 18, 2023



* Suggestions on Pipeline_webserver

docs: reorder the warning tip for pseudo-code
Co-Authored-By: Wonhyeong Seo <wonhseo@kakao.com>

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/ko/pipeline_webserver.md
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>

---------
Co-authored-by: Wonhyeong Seo <wonhseo@kakao.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

08e32519

Fix typo in example code (#25583) · 659ab042
Amélie T. Reymond authored Aug 17, 2023
```
`lang_code_to_id("en_XX")` => `lang_code_to_id["en_XX"]`

lang_code_to_id is a dict
```
659ab042

17 Aug, 2023 2 commits
- add warning for 8bit optimizers (#25575) · 4a27c13f
  Marc Sun authored Aug 17, 2023
```
* add warning for 8bit optimizers

* protect import
```
  4a27c13f
- Skip `test_contrastive_generate` for `TFXLNet` (#25574) · 427adc89
  Yih-Dar authored Aug 17, 2023
```
* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  427adc89