Commits · fc142bd775ae4639f80a8b0085a5df33bd2853ce · chenpangpang / transformers

24 Oct, 2023 3 commits

Add `default_to_square_for_size` to `CLIPImageProcessor` (#26965) · fc142bd7
Yih-Dar authored Oct 24, 2023
```
* fix

* fix

* fix

* fix

* fix

---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
fc142bd7

Xuehai Pan authored Oct 24, 2023

* Register ModelOutput as supported torch pytree nodes

* Test ModelOutput as supported torch pytree nodes

* Update type hints for pytree unflatten functions

cc7803c0

Fix key dtype in GPTJ and CodeGen (#26836) · ede051f1
fxmarty authored Oct 24, 2023
```
* fix key dtype in gptj and codegen

* delay the key cast to a later point

* fix
```
ede051f1

23 Oct, 2023 18 commits

🌐 [i18n-ZH] Translate create_a_model.md into Chinese (#27026) · 32f799db
Yeyang authored Oct 23, 2023
```
docs(zh): translate create_a_model.md
```
32f799db
Fix little typo (#27028) · 25c022d7
Mert Yanık authored Oct 24, 2023

25c022d7

Bugfix device map detr model (#26849) · f370bebd

Pedro Gabriel Gengo Lourenço authored Oct 23, 2023



* Fixed replace_batch_norm when on meta device

* lint fix

* Adding coauthor
Co-authored-by: Pi Esposito <piero.skywalker@gmail.com>

* Removed tests

* Remove unused deps

* Try to fix copy issue

* try fix copy one more time

* Reverted import changes

---------
Co-authored-by: Pi Esposito <piero.skywalker@gmail.com>

f370bebd

translate `preprocessing.md` to Chinese (#26955) · b0d1d7f7

jiaqiw09 authored Oct 24, 2023



* translate preprocessing.md to Chinese

* update files fixing problems mentioned in review

* update files fixing problems mentioned in review

---------
Co-authored-by: jiaqiw <wangjiaqi50@huawei.com>

b0d1d7f7

🌐

[i18n-ZH] Translate multilingual into Chinese (#26935) · 19ae0505

Yeyang authored Oct 23, 2023



translate multilingual into Chinese
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

19ae0505

Remove ambiguous `padding_mask` and instead use a 2D->4D Attn Mask Mapper (#26792) · 33f98cfd

Patrick von Platen authored Oct 23, 2023



* [Attn Mask Converter] refactor attn mask

* up

* Apply suggestions from code review
Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>

* improve

* rename

* better cache

* renaming

* improve more

* improve

* fix bug

* finalize

* make style & make fix-copies

* correct more

* start moving attention_mask

* fix llama

* improve falcon

* up

* improve more

* improve more

* Update src/transformers/models/owlv2/modeling_owlv2.py

* make style

* make style

* rename to converter

* Apply suggestions from code review

---------
Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com>

33f98cfd

Translate `pipeline_tutorial.md` to chinese (#26954) · f09a081d

jiaqiw09 authored Oct 23, 2023



* update translation of pipeline_tutorial and preprocessing(Version1.0)

* update translation of pipeline_tutorial and preprocessing(Version2.0)

* update translation docs

* update to fix problems mentioned in review

---------
Co-authored-by: jiaqiw <wangjiaqi50@huawei.com>

f09a081d

Remove token_type_ids from default TF GPT-2 signature (#26962) · f7354a3b
Matt authored Oct 23, 2023
```
Remove token_type_ids from default GPT-2 signature
```
f7354a3b
small typos found (#26988) · c0b5ad94
Rafael Padilla authored Oct 23, 2023
```
just very small typos found
```
c0b5ad94
[`SeamlessM4T`] fix copies with NLLB MoE int8 (#27018) · f9f27b0f
Arthur authored Oct 23, 2023
```
fix copies on newly merged model
```
f9f27b0f
[`NLLB-MoE`] Fix NLLB MoE 4bit inference (#27012) · 244a53e0
Younes Belkada authored Oct 23, 2023
```
fix NLLB MoE 4bit
```
244a53e0

Add Seamless M4T model (#25693) · cb45f71c

Yoach Lacombe authored Oct 23, 2023



* first raw commit

* still POC

* tentative convert script

* almost working speech encoder conversion scripts

* intermediate code for encoder/decoders

* add modeling code

* first version of speech encoder

* make style

* add new adapter layer architecture

* add adapter block

* add first tentative config

* add working speech encoder conversion

* base model convert works now

* make style

* remove unnecessary classes

* remove unecessary functions

* add modeling code speech encoder

* rework logics

* forward pass of sub components work

* add modeling codes

* some config modifs and modeling code modifs

* save WIP

* new edits

* same output speech encoder

* correct attention mask

* correct attention mask

* fix generation

* new generation logics

* erase comments

* make style

* fix typo

* add some descriptions

* new state

* clean imports

* add tests

* make style

* make beam search and num_return_sequences>1 works

* correct edge case issue

* correct SeamlessM4TConformerSamePadLayer copied from

* replace ACT2FN relu by nn.relu

* remove unecessary return variable

* move back a class

* change name conformer_attention_mask ->conv_attention_mask

* better nit code

* add some Copied from statements

* small nits

* small nit in dict.get

* rename t2u model -> conditionalgeneration

* ongoing refactoring of structure

* update models architecture

* remove SeamlessM4TMultiModal classes

* add tests

* adapt tests

* some non-working code for vocoder

* add seamlessM4T vocoder

* remove buggy line

* fix some hifigan related bugs

* remove hifigan specifc config

* change

* add WIP tokenization

* add seamlessM4T working tokenzier

* update tokenization

* add tentative feature extractor

* Update converting script

* update working FE

* refactor input_values -> input_features

* update FE

* changes in generation, tokenizer and modeling

* make style and add t2u_decoder_input_ids

* add intermediate outputs for ToSpeech models

* add vocoder to speech models

* update valueerror

* update FE with languages

* add vocoder convert

* update config docstrings and names

* update generation code and configuration

* remove todos and update config.pad_token_id to generation_config.pad_token_id

* move block vocoder

* remove unecessary code and uniformize tospeech code

* add feature extractor import

* make style and fix some copies from

* correct consistency + make fix-copies

* add processor code

* remove comments

* add fast tokenizer support

* correct pad_token_id in M4TModel

* correct config

* update tests and codes  + make style

* make some suggested correstion - correct comments and change naming

* rename some attributes

* rename some attributes

* remove unecessary sequential

* remove option to use dur predictor

* nit

* refactor hifigan

* replace normalize_mean and normalize_var with do_normalize + save lang ids to generation config

* add tests

* change tgt_lang logic

* update generation ToSpeech

* add support import SeamlessM4TProcessor

* fix generate

* make tests

* update integration tests, add option to only return text and update tokenizer fast

* fix wrong function call

* update import and convert script

* update integration tests + update repo id

* correct paths and add first test

* update how new attention masks are computed

* update tests

* take first care of batching in vocoder code

* add batching with the vocoder

* add waveform lengths to model outputs

* make style

* add generate kwargs + forward kwargs of M4TModel

* add docstrings forward methods

* reformate docstrings

* add docstrings t2u model

* add another round of modeling docstrings + reformate speaker_id -> spkr_id

* make style

* fix check_repo

* make style

* add seamlessm4t to toctree

* correct check_config_attributes

* write config docstrings + some modifs

* make style

* add docstrings tokenizer

* add docstrings to processor, fe and tokenizers

* make style

* write first version of model docs

* fix FE + correct FE test

* fix tokenizer + add correct integration tests

* fix most tokenization tests

* make style

* correct most processor test

* add generation tests and fix num_return_sequences > 1

* correct integration tests -still one left

* make style

* correct position embedding

* change numbeams to 1

* refactor some modeling code and correct one test

* make style

* correct typo

* refactor intermediate fnn

* refactor feedforward conformer

* make style

* remove comments

* make style

* fix tokenizer tests

* make style

* correct processor tests

* make style

* correct S2TT integration

* Apply suggestions from Sanchit code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>

* correct typo

* replace torch.nn->nn + make style

* change Output naming (waveforms -> waveform) and ordering

* nit renaming and formating

* remove return None when not necessary

* refactor SeamlessM4TConformerFeedForward

* nit typo

* remove almost copied from comments

* add a copied from comment and remove an unecessary dropout

* remove inputs_embeds from speechencoder

* remove backward compatibiliy function

* reformate class docstrings for a few components

* remove unecessary methods

* split over 2 lines smthg hard to read

* make style

* replace two steps offset by one step as suggested

* nice typo

* move warnings

* remove useless lines from processor

* make generation non-standard test more robusts

* remove torch.inference_mode from tests

* split integration tests

* enrich md

* rename control_symbol_vocoder_offset->vocoder_offset

* clean convert file

* remove tgt_lang and src_lang from FE

* change generate docstring of ToText models

* update generate docstring of tospeech models

* unify how to deal withtext_decoder_input_ids

* add default spkr_id

* unify tgt_lang for t2u_model

* simplify tgt_lang verification

* remove a todo

* change config docstring

* make style

* simplify t2u_tgt_lang_id

* make style

* enrich/correct comments

* enrich .md

* correct typo in docstrings

* add torchaudio dependency

* update tokenizer

* make style and fix copies

* modify SeamlessM4TConverter with new tokenizer behaviour

* make style

* correct small typo docs

* fix import

* update docs and add requirement to tests

* add convert_fairseq2_to_hf in utils/not_doctested.txt

* update FE

* fix imports and make style

* remove torchaudio in FE test

* add seamless_m4t.md to utils/not_doctested.txt

* nits and change the way docstring dataset is loaded

* move checkpoints from ylacombe/ to facebook/ orga

* refactor warning/error to be in the 119 line width limit

* round overly precised floats

* add stereo audio behaviour

* refactor .md and make style

* enrich docs with more precised architecture description

* readd undocumented models

* make fix-copies

* apply some suggestions

* Apply suggestions from code review
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* correct bug from previous commit

* refactor a parameter allowing to clean the code + some small nits

* clean tokenizer

* make style and fix

* make style

* clean tokenizers arguments

* add precisions for some tests

* move docs from not_tested to slow

* modify tokenizer according to last comments

* add copied from statements in tests

* correct convert script

* correct parameter docstring style

* correct tokenization

* correct multi gpus

* make style

* clean modeling code

* make style

* add copied from statements

* add copied statements

* add support with ASR pipeline

* remove file added inadvertently

* fix docstrings seamlessM4TModel

* add seamlessM4TConfig to OBJECTS_TO_IGNORE due of unconventional markdown

* add seamlessm4t to assisted generation ignored models

---------
Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

cb45f71c

Change default `max_shard_size` to smaller value (#26942) · 50d0cf4f
Younes Belkada authored Oct 23, 2023
```
* Update modeling_utils.py

* fixup

* let's change it to 5GB

* fix
```
50d0cf4f
Nits in Llama2 docstring (#26996) · d33d3131
Omar Sanseviero authored Oct 23, 2023
```
Update llama2.md
```
d33d3131
skip two tests (#27013) · ef978d0a
Arthur authored Oct 23, 2023
```
* skip two tests

* skip torch as well

* fixup
```
ef978d0a
python falcon doc-string example typo (#26995) · 45425660
Gema Parreño authored Oct 23, 2023
```
git python falcon typo
```
45425660
Limit to inferior fsspec version (#27010) · 70032949
Lysandre Debut authored Oct 23, 2023
```
Pin fsspec
```
70032949
fix logit-to-multi-hot conversion in example (#26936) · f71c9ccf
YQ authored Oct 23, 2023
```
* fix logit to multi-hot converstion

* add comments

* typo
```
f71c9ccf

20 Oct, 2023 5 commits

Added Telugu [te] translations (#26828) · 093848d3

Akhil authored Oct 21, 2023



* Create index.md

* Create _toctree.yml

* Updated index.md in telugu

* Update _toctree.yml

* Create quicktour.md

* Update quicktour.md

* Create index.md

* Update quicktour.md

* Update docs/source/te/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Delete docs/source/hi/index.md

* Update docs/source/te/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/te/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/te/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/te/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/te/quicktour.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/te/quicktour.md

Co-authored-by:...

093848d3

Update README_hd.md (#26872) · 224794b0

Biswa Baibhab Subudhi authored Oct 21, 2023



* Update README_hd.md

- Fixed broken links
I hope this small contribution adds value to this project.

* Update README_hd.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

224794b0

Fix Fuyu image scaling bug (#26918) · c030fc89

Pedro Cuenca authored Oct 20, 2023

* Fix Fuyu image scaling bug

It could produce negative padding and hence inference errors for certain
image sizes.

* Fix aspect ratio scaling test

c030fc89

fix set_transform link docs (#26856) · 9b197669

Diego Machado authored Oct 20, 2023



* fix set_transform link

* Update docs/source/en/preprocessing.md
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* use doc-builder sintax

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

9b197669

[docstring] Fix docstring for speech-to-text config (#26883) · 929134bf

Adam Ross authored Oct 20, 2023

* Fix docstring for speech-to-text config

* Refactor doc line len <= 119 char

* Remove Speech2TextConfig from OBJECTS_TO_IGNORE

* Fix Speech2TextConfig doc str

* Fix Speech2TextConfig doc using doc-builder

* Refactor Speech2TextConfig doc

929134bf

19 Oct, 2023 9 commits

Corrected modalities description in README_ru.md (#26913) · 08a2edfc
letohx authored Oct 19, 2023
```
Update README_ru.md

Corrected modalities description in README
```
08a2edfc
Generate: update basic llm tutorial (#26937) · ae4fb846
Joao Gante authored Oct 19, 2023

ae4fb846
[`FA-2` / `Mistral`] Supprot fa-2 + right padding + forward (#26912) · bc4bbd9f
Younes Belkada authored Oct 19, 2023
```
supprot fa-2 + right padding + forward
```
bc4bbd9f

Pin Keras for now (#26904) · cbd278f0

Matt authored Oct 19, 2023

* Pin Keras for now out of paranoia

* Add the keras pin to _tests_requirements.txt too

* Make sure the Keras version matches the TF one

* make fixup

cbd278f0

Fix license (#26931) · 73dc23f7
Mohamed Aymane Farhi authored Oct 19, 2023

73dc23f7

[docstring] Fix docstrings for `CodeGen` (#26821) · ad08137e

Daniil authored Oct 19, 2023



* remove docstrings CodeGen from objects_to_ignore

* autofix codegen docstrings

* fill in the missing types and docstrings

* fixup

* change descriptions to be in a separate line

* apply docstring suggestions from code review
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

* update n_ctx description in CodeGenConfig

---------
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>

ad08137e

Fix and re-enable ConversationalPipeline tests (#26907) · bdbcd5d4

Matt authored Oct 19, 2023

* Fix and re-enable conversationalpipeline tests

* Fix the batch test so the change only applies to conversational pipeline

bdbcd5d4

[Docs] Make sure important decode and generate method are nicely displayed in Whisper docs (#26927) · 734dd96e
Patrick von Platen authored Oct 19, 2023
```
better docstrings whisper
```
734dd96e

[docstring] Fix docstring for `ChineseCLIP` (#26880) · 816c2237

Sparty authored Oct 19, 2023



* Remove ChineseCLIPImageProcessor, ChineseCLIPTextConfig, ChineseCLIPVisionConfig from check_docstrings

* Run fix_and_overwrite for ChineseCLIPImageProcessor, ChineseCLIPTextConfig, ChineseCLIPVisionConfig

* Replace <fill_type> and <fill_docstring> in configuration_chinese_clip.py, image_processing_chinese_clip.py with type and docstring values

---------
Co-authored-by: vignesh-raghunathan <vignesh_raghunathan@intuit.com>

816c2237

18 Oct, 2023 5 commits

[`FA-2`] Revert suggestion that broke FA2 fine-tuning with quantized models (#26916) · 574a5384
Younes Belkada authored Oct 19, 2023
```
revert
```
574a5384

Add fuyu model (#26911) · caa0ff0b

Pablo Montalvo authored Oct 19, 2023



* initial commit

* add processor, add fuyu naming

* add draft processor

* fix processor

* remove dropout to fix loading of weights

* add image processing fixes from Pedro

* fix

* fix processor

* add basic processing fuyu test

* add documentation and TODO

* address comments, add tests, add doc

* replace assert with torch asserts

* add Mixins and fix tests

* clean imports

* add model tester, clean imports

* fix embedding test

* add updated tests from pre-release model

* Processor: return input_ids used for inference

* separate processing and model tests

* relax test tolerance for embeddings

* add test for logit comparison

* make sure fuyu image processor is imported in the init

* fix formattingh

* more formatting issues

* and more

* fixups

* remove some stuff

* nits

* update init

* remove the fuyu file

* Update integration test with release model

* Update conversion script.

The projection is not used, as confirmed by the authors.

* improve geenration

* Remove duplicate function

* Trickle down patches to model call

* processing fuyu updates

* remove things

* fix prepare_inputs_for_generation to fix generate()

* remove model_input

* update

* add generation tests

* nits

* draft leverage automodel and autoconfig

* nits

* fix dtype patch

* address comments, update READMEs and doc, include tests

* add working processing test, remove refs to subsequences

* add tests, remove Sequence classification

* processing

* update

* update the conversion script

* more processing cleanup

* safe import

* take out ModelTesterMixin for early release

* more cl;eanup

* more cleanup

* more cleanup

* and more

* register a buffer

* nits

* add postprocessing of generate output

* nits

* updates

* add one working test

* fix test

* make fixup works

* fixup

* Arthur's updates

* nits

* update

* update

* fix processor

* update tests

* passe more fixups

* fix

* nits

* don't import torch

* skip fuyu config for now

* fixup done

* fixup

* update

* oups

* nits

* Use input embeddings

* no buffer

* update

* styling processing fuyu

* fix test

* update licence

* protect torch import

* fixup and update not doctested

* kwargs should be passed

* udpates

* update the impofixuprts in the test

* protect import

* protecting imports

* protect imports in type checking

* add testing decorators

* protect top level import structure

* fix typo

* fix check init

* move requires_backend to functions

* Imports

* Protect types

---------
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: ArthurZucker <arthur.zucker@gmail.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
Co-authored-by: Lysandre <lysandre@huggingface.co>

caa0ff0b

[`FA-2`] Final fix for FA2 dtype (#26846) · 5a73316b

Younes Belkada authored Oct 18, 2023



* final fix for FA2 dtype

* try

* oops

* Update src/transformers/models/falcon/modeling_falcon.py
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* apply fix everywhere

---------
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

5a73316b

[i18n-ZH] Translated fast_tokenizers.md to Chinese (#26910) · 732d2a8a
Yeyang authored Oct 18, 2023
```
docs: translate fast_tokenizers into Chinese
```
732d2a8a
Refactor code part in documentation translated to japanese (#26900) · eec5a3a8
Rockerz authored Oct 18, 2023
```
Refactor code in documentation
```
eec5a3a8