Commits · 9b5180cb5f00799ec47b778533db9dcbf83ceda4 · renzhc / diffusers_dcu

07 Aug, 2024 5 commits

Flux fp16 inference fix (#9097) · 9b5180cb

latentCall145 authored Aug 07, 2024



* clipping for fp16

* fix typo

* added fp16 inference to docs

* fix docs typo

* include link for fp16 investigation

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

9b5180cb

[core] FreeNoise (#8948) · 16a93f1a

Aryan authored Aug 07, 2024



* initial work draft for freenoise; needs massive cleanup

* fix freeinit bug

* add animatediff controlnet implementation

* revert attention changes

* add freenoise

* remove old helper functions

* add decode batch size param to all pipelines

* make style

* fix copied from comments

* make fix-copies

* make style

* copy animatediff controlnet implementation from #8972

* add experimental support for num_frames not perfectly fitting context length, ocntext stride

* make unet motion model lora work again based on #8995

* copy load video utils from #8972

* copied from AnimateDiff::prepare_latents

* address the case where last batch of frames does not match length of indices in prepare latents

* decode_batch_size->vae_batch_size; batch vae encode support in animatediff vid2vid

* revert sparsectrl and sdxl freenoise changes

* revert pia

* add freenoise tests

* make fix-copies

* improve docstrings

* add freenoise tests to animatediff controlnet

* update tests

* Update src/diffusers/models/unets/unet_motion_model.py

* add freenoise to animatediff pag

* address review comments

* make style

* update tests

* make fix-copies

* fix error message

* remove copied from comment

* fix imports in tests

* update

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

16a93f1a

fix train_dreambooth_lora_sd3.py loading hook (#9107) · 2d753b6f
Sayak Paul authored Aug 07, 2024

2d753b6f

[Kolors] Add PAG (#8934) · 39e1f7ea

Álvaro Somoza authored Aug 06, 2024



* txt2img pag added

* autopipe added, fixed case

* style

* apply suggestions

* added fast tests, added todo tests

* revert dummy objects for kolors

* fix pag dummies

* fix test imports

* update pag tests

* add kolor pag to docs

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

39e1f7ea

[Single File] Add single file support for Flux Transformer (#9083) · e1b603dc
Dhruv Nair authored Aug 07, 2024
```
* update

* update

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
e1b603dc

06 Aug, 2024 7 commits

Fix loading sharded checkpoints when we have variants (#9061) · e4325606

Marc Sun authored Aug 07, 2024



* Fix loading sharded checkpoint when we have variant

* add test

* remote print

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

e4325606

add PAG support for Stable Diffusion 3 (#8861) · 926daa30

Ahn Donghoon (안동훈 / suno) authored Aug 07, 2024



add pag sd3


---------
Co-authored-by: HyoungwonCho <jhw9811@korea.ac.kr>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: crepejung00 <jaewoojung00@naver.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>

926daa30

[Docs] Add community projects section to docs (#9013) · 325a5de3
Dhruv Nair authored Aug 06, 2024
```
* update

* update

* update
```
325a5de3
update · 4c6152c2
Dhruv Nair authored Aug 06, 2024

4c6152c2

[Tests] Improve transformers model test suite coverage - Hunyuan DiT (#8916) · 87e50a2f

Vinh H. Pham authored Aug 06, 2024



* add hunyuan model test

* apply suggestions

* reduce dims further

* reduce dims further

* run make style

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

87e50a2f

[bug] remove unreachable norm_type=ada_norm_continuous from norm3 initialization conditions (#9006) · a57a7af4
Aryan authored Aug 06, 2024
```
remove ada_norm_continuous from norm3 list
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
a57a7af4

[Core] add QKV fusion to AuraFlow and PixArt Sigma (#8952) · 52f1378e

Sayak Paul authored Aug 06, 2024

* add fusion support to pixart

* add to auraflow.

* add tests

* apply review feedback.

* add back args and kwargs

* style

52f1378e

05 Aug, 2024 7 commits

Update `CLIPFeatureExtractor` to `CLIPImageProcessor` and... · 3dc97bd1

Tolga Cangöz authored Aug 05, 2024


Update `CLIPFeatureExtractor` to `CLIPImageProcessor` and `DPTFeatureExtractor` to `DPTImageProcessor` (#9002)

* fix: update `CLIPFeatureExtractor` to `CLIPImageProcessor` in codebase

* `make style && make quality`

* Update `DPTFeatureExtractor` to `DPTImageProcessor` in codebase

* `make style`

---------
Co-authored-by: Aryan <aryan@huggingface.co>

3dc97bd1

Fix typos (#9077) · 6d32b292
omahs authored Aug 05, 2024
```
* fix typo
```
6d32b292

add sentencepiece as a soft dependency (#9065) · bc3c73ad

YiYi Xu authored Aug 05, 2024



* add sentencepiece as  soft dependency for kolors

* up

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

bc3c73ad

[Docs] add stable cascade unet doc. (#9066) · 5934873b
Sayak Paul authored Aug 05, 2024
```
* add stable cascade unet doc.

* fix path
```
5934873b

PAG variant for HunyuanDiT, PAG refactor (#8936) · b7058d14

Aryan authored Aug 05, 2024



* copy hunyuandit pipeline

* pag variant of hunyuan dit

* add tests

* update docs

* make style

* make fix-copies

* Update src/diffusers/pipelines/pag/pag_utils.py

* remove incorrect copied from

* remove pag hunyuan attn procs to resolve conflicts

* add pag attn procs again

* new implementation for pag_utils

* revert pag changes

* add pag refactor back; update pixart sigma

* update pixart pag tests

* apply suggestions from review

Co-Authored-By: yixu310@gmail.com

* make style

* update docs, fix tests

* fix tests

* fix test_components_function since list not accepted as valid __init__ param

* apply patch to fix broken tests
Co-Authored-By: Sayak Paul <spsayakpaul@gmail.com>

* make style

* fix hunyuan tests

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

b7058d14

[Tests] Improve transformers model test suite coverage - Latte (#8919) · e1d508ae

Vinh H. Pham authored Aug 05, 2024



* add LatteTransformer3DModel model test

* change patch_size to 1

* reduce req len

* reduce channel dims

* increase num_layers

* reduce dims further

* run make style

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>

e1d508ae

[FLUX] support LoRA (#9057) · fc6a91e3

Sayak Paul authored Aug 05, 2024

* feat: lora support for Flux.

add tests

fix imports

major fixes.

* fix

fixes

final fixes?

* fix

* remove is_peft_available.

fc6a91e3

04 Aug, 2024 4 commits

[refactor] apply qk norm in attention processors (#9071) · 2b760996

Aryan authored Aug 04, 2024

* apply qk norm in attention processors

* revert attention processor

* qk-norm in only attention proc 2.0 and fused variant

2b760996

type `get_attention_scores` as optional in `get_attention_scores` (#9075) · 4f0d01d3
psychedelicious authored Aug 04, 2024
```
`None` is valid for `get_attention_scores`, should be typed as such
```
4f0d01d3

Update TensorRT txt2img and inpaint community pipelines (#9037) · 3dc10a53

asfiyab-nvidia authored Aug 04, 2024



* Update TensorRT txt2img and inpaint community pipelines
Signed-off-by: Asfiya Baig <asfiyab@nvidia.com>

* update tensorrt install instructions
Signed-off-by: Asfiya Baig <asfiyab@nvidia.com>

---------
Signed-off-by: Asfiya Baig <asfiyab@nvidia.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

3dc10a53

[Flux] minor documentation fixes for flux. (#9048) · c370b90f
Sayak Paul authored Aug 04, 2024
```
* minor documentation fixes for flux.

* clipskip

* add gist
```
c370b90f

03 Aug, 2024 4 commits

Fix grammar mistake. (#9072) · ebf3ab14
Philip Rideout authored Aug 03, 2024

ebf3ab14

[refactor] create modeling blocks specific to AnimateDiff (#8979) · fbe29c62

Aryan authored Aug 03, 2024



* animatediff specific transformer model

* make style

* make fix-copies

* move blocks to unet motion model

* make style

* remove dummy object

* fix incorrectly passed param causing test failures

* rename model and output class

* fix sparsectrl imports

* remove todo comments

* remove temporal double self attn param from controlnet sparsectrl

* add deprecated versions of blocks

* apply suggestions from review

* update

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

fbe29c62

Errata: Fix typos & `\s+$` (#9008) · 7071b746

Tolga Cangöz authored Aug 03, 2024



* Fix typos

* chore: Fix typos

* chore: Update README.md for promptdiffusion example

* Trim trailing white spaces

* Fix a typo

* update number

* chore: update number

* Trim trailing white space

* Update README.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update README.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

7071b746

Update transformer_flux.py (#9060) · a054c784
Frank (Haofan) Wang authored Aug 03, 2024

a054c784

02 Aug, 2024 3 commits

Fix Nightly Deps (#9036) · b1f43d71
Dhruv Nair authored Aug 02, 2024
```
update
```
b1f43d71

[Flux] allow tests to run (#9050) · 0e460675

Sayak Paul authored Aug 02, 2024

* fix tests

* fix

* float64 skip

* remove sample_size.

* remove

* remove more

* default_sample_size.

* credit black forest for flux model.

* skip

* fix: tests

* remove OriginalModelMixin

* add transformer model test

* add: transformer model tests

0e460675

[Core] Add PAG support for PixArtSigma (#8921) · 7b98c4cc

Sayak Paul authored Aug 02, 2024

* feat: add pixart sigma pag.

* inits.

* fixes

* fix

* remove print.

* copy paste methods to the pixart pag mixin

* fix-copies

* add documentation.

* add tests.

* remove correction file.

* remove pag_applied_layers

* empty

7b98c4cc

01 Aug, 2024 5 commits

Flux pipeline (#9043) · 27637a54

Sayak Paul authored Aug 02, 2024



add flux!
Signed-off-by: Adrien <adrien@huggingface.co>
Co-authored-by: Adrien <adrien.69740@gmail.com>
Co-authored-by: Anatoly Belikov <abelikov@singularitynet.io>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com>

27637a54

[docs] fix pia example (#9015) · 2ea22e1c
Aryan authored Aug 02, 2024
```
fix pia example docstring
```
2ea22e1c
fix load sharded checkpoint from a subfolder (local path) (#8913) · 95a78328
YiYi Xu authored Aug 01, 2024
```
fix
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
95a78328
Updates deps for pipeline test fetcher (#9033) · c646fbc1
Dhruv Nair authored Aug 01, 2024
```
update
```
c646fbc1

PAG variant for AnimateDiff (#8789) · 05b706c0

Aryan authored Aug 01, 2024

* add animatediff pag pipeline

* remove unnecessary print

* make fix-copies

* fix ip-adapter bug

* update docs

* add fast tests and fix bugs

* update

* update

* address review comments

* update ip adapter single test expected slice

* implement test_from_pipe_consistent_config; fix expected slice values

* LoraLoaderMixin->StableDiffusionLoraLoaderMixin; add latest freeinit test

05b706c0

30 Jul, 2024 5 commits

Fix Stable Audio repository id (#9016) · ea1b4ea7
Yoach Lacombe authored Jul 30, 2024
```
Fix Stable Audio repo id
```
ea1b4ea7

[core] Move community AnimateDiff ControlNet to core (#8972) · e5b94b4c

Aryan authored Jul 30, 2024



* add animatediff controlnet to core

* make style; remove unused method

* fix copied from comment

* add tests

* changes to make tests work

* add utility function to load videos

* update docs

* update pipeline example

* make style

* update docs with example

* address review comments

* add latest freeinit test from #8969

* LoraLoaderMixin -> StableDiffusionLoraLoaderMixin

* fix docs

* Update src/diffusers/utils/loading_utils.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* fix: variable out of scope

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

e5b94b4c

Stable Audio integration (#8716) · 69e72b1d

Yoach Lacombe authored Jul 30, 2024



* WIP modeling code and pipeline

* add custom attention processor + custom activation + add to init

* correct ProjectionModel forward

* add stable audio to __initèè

* add autoencoder and update pipeline and modeling code

* add half Rope

* add partial rotary v2

* add temporary modfis to scheduler

* add EDM DPM Solver

* remove TODOs

* clean GLU

* remove att.group_norm to attn processor

* revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py

* refactor GLU -> SwiGLU

* remove redundant args

* add channel multiples in autoencoder docstrings

* changes in docsrtings and copyright headers

* clean pipeline

* further cleaning

* remove peft and lora and fromoriginalmodel

* Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace

* make style

* dummy models

* fix copied from

* add fast oobleck tests

* add brownian tree

* oobleck autoencoder slow tests

* remove TODO

* fast stable audio pipeline tests

* add slow tests

* make style

* add first version of docs

* wrap is_torchsde_available to the scheduler

* fix slow test

* test with input waveform

* add input waveform

* remove some todos

* create stableaudio gaussian projection + make style

* add pipeline to toctree

* fix copied from

* make quality

* refactor timestep_features->time_proj

* refactor joint_attention_kwargs->cross_attention_kwargs

* remove forward_chunk

* move StableAudioDitModel to transformers folder

* correct convert + remove partial rotary embed

* apply suggestions from yiyixuxu -> removing attn.kv_heads

* remove temb

* remove cross_attention_kwargs

* further removal of cross_attention_kwargs

* remove text encoder autocast to fp16

* continue removing autocast

* make style

* refactor how text and audio are embedded

* add paper

* update example code

* make style

* unify projection model forward + fix device placement

* make style

* remove fuse qkv

* apply suggestions from review

* Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* make style

* smaller models in fast tests

* pass sequential offloading fast tests

* add docs for vae and autoencoder

* make style and update example

* remove useless import

* add cosine scheduler

* dummy classes

* cosine scheduler docs

* better description of scheduler

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

69e72b1d

[LoRA] fix: animate diff lora stuff. (#8995) · 8c4856cd
Sayak Paul authored Jul 30, 2024
```
* fix: animate diff lora stuff.

* fix scaling function for UNetMotionModel

* emoty
```
8c4856cd

handle lora scale and clip skip in lpw sd and sdxl community pipelines (#8988) · f240a936

Anatoly Belikov authored Jul 30, 2024



* handle lora scale and clip skip in lpw sd and sdxl

* use StableDiffusionLoraLoaderMixin

* use StableDiffusionXLLoraLoaderMixin

* style

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

f240a936