Commits · 9333bf0769561c048700377c2e0813221ab9d2c9 · chenpangpang / transformers

24 Oct, 2023 12 commits

[docs] Performance docs refactor p.2 (#26791) · 9333bf07

Maria Khalusova authored Oct 24, 2023



* initial edits

* improvements for clarity and flow

* improvements for clarity and flow, removed the repetead section

* removed two docs that had no content

* Revert "removed two docs that had no content"

This reverts commit e98fa2fa0d8e171163f15cb8a04bdada1053543b.

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* feedback addressed

* more feedback addressed

* feedback addressed

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

9333bf07

Fix config silent copy in from_pretrained (#27043) · 13ef14e1
Patrick von Platen authored Oct 24, 2023
```
* Fix config modeling utils

* fix more

* fix attn mask bug

* Update src/transformers/modeling_utils.py
```
13ef14e1

Device agnostic testing (#25870) · 9da45171

Alex McKinney authored Oct 24, 2023



* adds agnostic decorators and availability fns

* renaming decorators and fixing imports

* updating some representative example tests
bloom, opt, and reformer for now

* wip device agnostic functions

* lru cache to device checking functions

* adds `TRANSFORMERS_TEST_DEVICE_SPEC`
if present, imports the target file and updates device to function
mappings

* comments `TRANSFORMERS_TEST_DEVICE_SPEC` code

* extra checks on device name

* `make style; make quality`

* updates default functions for agnostic calls

* applies suggestions from review

* adds `is_torch_available` guard

* Add spec file to docs, rename function dispatch names to backend_*

* add backend import to docs example for spec file

* change instances of  to

* Move register backend to before device check as per @statelesshz changes

* make style

* make opt test require fp16 to run

---------
Co-authored-by: arsalanu <arsalanu@graphcore.ai>
Co-authored-by: arsalanu <hzji210@gmail.com>

9da45171

Add fuyu device map (#26949) · 41496b95
Marc Sun authored Oct 24, 2023
```
* add _no_split_modules

* style

* fix _no_split_modules

* add doc
```
41496b95
add info on TRL docs (#27024) · b18e3140
Leandro von Werra authored Oct 24, 2023
```
* add info on TRL docs

* add TRL link

* tweak text

* tweak text
```
b18e3140
Safe import of rgb_to_id from FE modules (#27037) · cb0c6806
amyeroberts authored Oct 24, 2023
```
Safe import from FE modules
```
cb0c6806
[`TFxxxxForSequenceClassifciation`] Fix the eager mode after #25085 (#25751) · 7bde5d63
Arthur authored Oct 24, 2023
```
* TODOS

* Switch .shape -> shape_list

---------
Co-authored-by: Matt <rocketknight1@gmail.com>
```
7bde5d63

Normalize only if needed (#26049) · e2d6d5ce

Michal Jamroz authored Oct 24, 2023



* Normalize only if needed

* Update examples/pytorch/image-classification/run_image_classification.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* if else in one line

* within block

* one more place, sorry for mess

* import order

* Update examples/pytorch/image-classification/run_image_classification.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

* Update examples/pytorch/image-classification/run_image_classification_no_trainer.py
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

---------
Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>

e2d6d5ce

Add descriptive docstring to WhisperTimeStampLogitsProcessor (#25642) · 576e2823

JP authored Oct 24, 2023



* adding in logit examples for Whisper processor

* adding in updated logits processor for Whisper

* adding in cleaned version of  logits processor for Whisper

* adding docstrings for whisper processor

* making sure the formatting is correct

* adding logits after doc builder

* Update src/transformers/generation/logits_process.py

Adding in suggested fix to the LogitProcessor description.
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/logits_process.py
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/logits_process.py

Removing tip per suggestion.
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* Update src/transformers/generation/logits_process.py

Removing redundant code per suggestion.
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>

* adding in revised version

* adding in version with timestamp examples

* Update src/transformers/generation/logits_process.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* enhanced paragraph on behavior of processor

* fixing doc quality issue

* removing the word poem from example

* adding in updated docstring

* adding in new version of file after doc-builder

---------
Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

576e2823

Add `default_to_square_for_size` to `CLIPImageProcessor` (#26965) · fc142bd7
Yih-Dar authored Oct 24, 2023
```
* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
fc142bd7

Xuehai Pan authored Oct 24, 2023

* Register ModelOutput as supported torch pytree nodes

* Test ModelOutput as supported torch pytree nodes

* Update type hints for pytree unflatten functions

cc7803c0

Fix key dtype in GPTJ and CodeGen (#26836) · ede051f1
fxmarty authored Oct 24, 2023
```
* fix key dtype in gptj and codegen

* delay the key cast to a later point

* fix
```
ede051f1

23 Oct, 2023 18 commits

🌐 [i18n-ZH] Translate create_a_model.md into Chinese (#27026) · 32f799db
Yeyang authored Oct 23, 2023
```
docs(zh): translate create_a_model.md
```
32f799db
Fix little typo (#27028) · 25c022d7
Mert Yanık authored Oct 24, 2023

25c022d7

Bugfix device map detr model (#26849) · f370bebd

Pedro Gabriel Gengo Lourenço authored Oct 23, 2023



* Fixed replace_batch_norm when on meta device

* lint fix

* Adding coauthor
Co-authored-by: Pi Esposito <piero.skywalker@gmail.com>

* Removed tests

* Remove unused deps

* Try to fix copy issue

* try fix copy one more time

* Reverted import changes

---------
Co-authored-by: Pi Esposito <piero.skywalker@gmail.com>

f370bebd

translate `preprocessing.md` to Chinese (#26955) · b0d1d7f7

jiaqiw09 authored Oct 24, 2023



* translate preprocessing.md to Chinese

* update files fixing problems mentioned in review

* update files fixing problems mentioned in review

---------
Co-authored-by: jiaqiw <wangjiaqi50@huawei.com>

b0d1d7f7

🌐

[i18n-ZH] Translate multilingual into Chinese (#26935) · 19ae0505

Yeyang authored Oct 23, 2023



translate multilingual into Chinese
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

19ae0505

Remove ambiguous `padding_mask` and instead use a 2D->4D Attn Mask Mapper (#26792) · 33f98cfd

Patrick von Platen authored Oct 23, 2023



* [Attn Mask Converter] refactor attn mask

* up

* Apply suggestions from code review
Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>

* improve

* rename

* better cache

* renaming

* improve more

* improve

* fix bug

* finalize

* make style & make fix-copies

* correct more

* start moving attention_mask

* fix llama

* improve falcon

* up

* improve more

* improve more

* Update src/transformers/models/owlv2/modeling_owlv2.py

* make style

* make style

* rename to converter

* Apply suggestions from code review

---------
Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>

33f98cfd

Translate `pipeline_tutorial.md` to chinese (#26954) · f09a081d

jiaqiw09 authored Oct 23, 2023



* update translation of pipeline_tutorial and preprocessing(Version1.0)

* update translation of pipeline_tutorial and preprocessing(Version2.0)

* update translation docs

* update to fix problems mentioned in review

---------
Co-authored-by: jiaqiw <wangjiaqi50@huawei.com>

f09a081d

Remove token_type_ids from default TF GPT-2 signature (#26962) · f7354a3b
Matt authored Oct 23, 2023
```
Remove token_type_ids from default GPT-2 signature
```
f7354a3b
small typos found (#26988) · c0b5ad94
Rafael Padilla authored Oct 23, 2023
```
just very small typos found
```
c0b5ad94
[`SeamlessM4T`] fix copies with NLLB MoE int8 (#27018) · f9f27b0f
Arthur authored Oct 23, 2023
```
fix copies on newly merged model
```
f9f27b0f
[`NLLB-MoE`] Fix NLLB MoE 4bit inference (#27012) · 244a53e0
Younes Belkada authored Oct 23, 2023
```
fix NLLB MoE 4bit
```
244a53e0

Add Seamless M4T model (#25693) · cb45f71c

Yoach Lacombe authored Oct 23, 2023



* first raw commit

* still POC

* tentative convert script

* almost working speech encoder conversion scripts

* intermediate code for encoder/decoders

* add modeling code

* first version of speech encoder

* make style

* add new adapter layer architecture

* add adapter block

* add first tentative config

* add working speech encoder conversion

* base model convert works now

* make style

* remove unnecessary classes

* remove unecessary functions

* add modeling code speech encoder

* rework logics

* forward pass of sub components work

* add modeling codes

* some config modifs and modeling code modifs

* save WIP

* new edits

* same output speech encoder

* correct attention mask

* correct attention mask

* fix generation

* new generation logics

* erase comments

* make style

* fix typo

* add some descriptions

* new state

* clean imports

* add tests

* make style

* make beam search and num_return_sequences>1 works

* correct edge case issue

* correct SeamlessM4TConformerSamePadLayer copied from

* replace ACT2FN relu by nn.relu

* remove unecessary return variable

* move back a class

* change name conformer_attention_mask ->conv_attention_mask

* better nit code

* add some Copied from statements

* small nits

* small nit in dict.get

* rename t2u model -> conditionalgeneration

* ongoing refactoring of structure

* update models architecture

* remove SeamlessM4TMultiModal classes

* add tests

* adapt tests

* some non-working code for vocoder

* add seamlessM4T vocoder

* remove buggy line

* fix some hifigan related bugs

* remove hifigan specifc config

* change

* add WIP tokenization

* add seamlessM4T working tokenzier

* update tokenization

* add tentative feature extractor

* Update converting script

* update working FE

* refactor input_values -> input_features

* update FE

* changes in generation, tokenizer and modeling

* make style and add t2u_decoder_input_ids

* add intermediate outputs for ToSpeech models

* add vocoder to speech models

* update valueerror

* update FE with languages

* add vocoder convert

* update config docstrings and names

* update generation code and configuration

* remove todos and update config.pad_token_id to generation_config.pad_token_id

* move block vocoder

* remove unecessary code and uniformize tospeech code

* add feature extractor import

* make style and fix some copies from

* correct consistency + make fix-copies

* add processor code

* remove comments

* add fast tokenizer support

* correct pad_token_id in M4TModel

* correct config

* update tests and codes  + make style

* make some suggested correstion - correct comments and change naming

* rename some attributes

* rename some attributes

* remove unecessary sequential

* remove option to use dur predictor

* nit

* refactor hifigan

* replace normalize_mean and normalize_var with do_normalize + save lang ids to generation config

* add tests

* change tgt_lang logic

* update generation ToSpeech

* add support import SeamlessM4TProcessor

* fix generate

* make tests

* update integration tests, add option to only return text and update tokenizer fast

* fix wrong function call

* update import and convert script

* update integration tests + update repo id

* correct paths and add first test

* update how new attention masks are computed

* update tests

* take first care of batching in vocoder code

* add batching with the vocoder

* add waveform lengths to model outputs

* make style

* add generate kwargs + forward kwargs of M4TModel

* add docstrings forward methods

* reformate docstrings

* add docstrings t2u model

* add another round of modeling docstrings + reformate speaker_id -> spkr_id

* make style

* fix check_repo

* make style

* add seamlessm4t to toctree

* correct check_config_attributes

* write config docstrings + some modifs

* make style

* add docstrings tokenizer

* add docstrings to processor, fe and tokenizers

* make style

* write first version of model docs

* fix FE + correct FE test

* fix tokenizer + add correct integration tests

* fix most tokenization tests

* make style

* correct most processor test

* add generation tests and fix num_return_sequences > 1

* correct integration tests -still one left

* make style

* correct position embedding

* change numbeams to 1

* refactor some modeling code and correct one test

* make style

* correct typo

* refactor intermediate fnn

* refactor feedforward conformer

* make style

* remove comments

* make style

* fix tokenizer tests

* make style

* correct processor tests

* make style

* correct S2TT integration

* Apply suggestions from Sanchit code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* correct typo

* replace torch.nn->nn + make style

* change Output naming (waveforms -> waveform) and ordering

* nit renaming and formating

* remove return None when not necessary

* refactor SeamlessM4TConformerFeedForward

* nit typo

* remove almost copied from comments

* add a copied from comment and remove an unecessary dropout

* remove inputs_embeds from speechencoder

* remove backward compatibiliy function

* reformate class docstrings for a few components

* remove unecessary methods

* split over 2 lines smthg hard to read

* make style

* replace two steps offset by one step as suggested

* nice typo

* move warnings

* remove useless lines from processor

* make generation non-standard test more robusts

* remove torch.inference_mode from tests

* split integration tests

* enrich md

* rename control_symbol_vocoder_offset->vocoder_offset

* clean convert file

* remove tgt_lang and src_lang from FE

* change generate docstring of ToText models

* update generate docstring of tospeech models

* unify how to deal withtext_decoder_input_ids

* add default spkr_id

* unify tgt_lang for t2u_model

* simplify tgt_lang verification

* remove a todo

* change config docstring

* make style

* simplify t2u_tgt_lang_id

* make style

* enrich/correct comments

* enrich .md

* correct typo in docstrings

* add torchaudio dependency

* update tokenizer

* make style and fix copies

* modify SeamlessM4TConverter with new tokenizer behaviour

* make style

* correct small typo docs

* fix import

* update docs and add requirement to tests

* add convert_fairseq2_to_hf in utils/not_doctested.txt

* update FE

* fix imports and make style

* remove torchaudio in FE test

* add seamless_m4t.md to utils/not_doctested.txt

* nits and change the way docstring dataset is loaded

* move checkpoints from ylacombe/ to facebook/ orga

* refactor warning/error to be in the 119 line width limit

* round overly precised floats

* add stereo audio behaviour

* refactor .md and make style

* enrich docs with more precised architecture description

* readd undocumented models

* make fix-copies

* apply some suggestions

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* correct bug from previous commit

* refactor a parameter allowing to clean the code + some small nits

* clean tokenizer

* make style and fix

* make style

* clean tokenizers arguments

* add precisions for some tests

* move docs from not_tested to slow

* modify tokenizer according to last comments

* add copied from statements in tests

* correct convert script

* correct parameter docstring style

* correct tokenization

* correct multi gpus

* make style

* clean modeling code

* make style

* add copied from statements

* add copied statements

* add support with ASR pipeline

* remove file added inadvertently

* fix docstrings seamlessM4TModel

* add seamlessM4TConfig to OBJECTS_TO_IGNORE due of unconventional markdown

* add seamlessm4t to assisted generation ignored models

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

cb45f71c

Change default `max_shard_size` to smaller value (#26942) · 50d0cf4f
Younes Belkada authored Oct 23, 2023
```
* Update modeling_utils.py

* fixup

* let's change it to 5GB

* fix
```
50d0cf4f
Nits in Llama2 docstring (#26996) · d33d3131
Omar Sanseviero authored Oct 23, 2023
```
Update llama2.md
```
d33d3131
skip two tests (#27013) · ef978d0a
Arthur authored Oct 23, 2023
```
* skip two tests

* skip torch as well

* fixup
```
ef978d0a
python falcon doc-string example typo (#26995) · 45425660
Gema Parreño authored Oct 23, 2023
```
git python falcon typo
```
45425660
Limit to inferior fsspec version (#27010) · 70032949
Lysandre Debut authored Oct 23, 2023
```
Pin fsspec
```
70032949
fix logit-to-multi-hot conversion in example (#26936) · f71c9ccf
YQ authored Oct 23, 2023
```
* fix logit to multi-hot converstion

* add comments

* typo
```
f71c9ccf

20 Oct, 2023 5 commits

Added Telugu [te] translations (#26828) · 093848d3

Akhil authored Oct 21, 2023



* Create index.md

* Create _toctree.yml

* Updated index.md in telugu

* Update _toctree.yml

* Create quicktour.md

* Update quicktour.md

* Create index.md

* Update quicktour.md

* Update docs/source/te/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Delete docs/source/hi/index.md

* Update docs/source/te/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/te/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/te/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/te/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/te/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/te/quicktour.md

Co-authored-by:...

093848d3

Update README_hd.md (#26872) · 224794b0

Biswa Baibhab Subudhi authored Oct 21, 2023



* Update README_hd.md

- Fixed broken links
I hope this small contribution adds value to this project.

* Update README_hd.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

224794b0

Fix Fuyu image scaling bug (#26918) · c030fc89

Pedro Cuenca authored Oct 20, 2023

* Fix Fuyu image scaling bug

It could produce negative padding and hence inference errors for certain
image sizes.

* Fix aspect ratio scaling test

c030fc89

fix set_transform link docs (#26856) · 9b197669

Diego Machado authored Oct 20, 2023



* fix set_transform link

* Update docs/source/en/preprocessing.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* use doc-builder sintax

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

9b197669

[docstring] Fix docstring for speech-to-text config (#26883) · 929134bf

Adam Ross authored Oct 20, 2023

* Fix docstring for speech-to-text config

* Refactor doc line len <= 119 char

* Remove Speech2TextConfig from OBJECTS_TO_IGNORE

* Fix Speech2TextConfig doc str

* Fix Speech2TextConfig doc using doc-builder

* Refactor Speech2TextConfig doc

929134bf

19 Oct, 2023 5 commits
- Corrected modalities description in README_ru.md (#26913) · 08a2edfc
  letohx authored Oct 19, 2023
```
Update README_ru.md

Corrected modalities description in README
```
  08a2edfc
- Generate: update basic llm tutorial (#26937) · ae4fb846
  Joao Gante authored Oct 19, 2023
  
  ae4fb846
- [`FA-2` / `Mistral`] Supprot fa-2 + right padding + forward (#26912) · bc4bbd9f
  Younes Belkada authored Oct 19, 2023
```
supprot fa-2 + right padding + forward
```
  bc4bbd9f
- Pin Keras for now (#26904) · cbd278f0
  Matt authored Oct 19, 2023
```
* Pin Keras for now out of paranoia

* Add the keras pin to _tests_requirements.txt too

* Make sure the Keras version matches the TF one

* make fixup
```
  cbd278f0
- Fix license (#26931) · 73dc23f7
  Mohamed Aymane Farhi authored Oct 19, 2023
  
  73dc23f7