Commits · 325f6c53edf10a7b3f4804d4b38e89f95873d3c2 · renzhc / diffusers_dcu

21 Dec, 2023 7 commits
- [Refactor] move attend and excite out of `stable_diffusion`. (#6261) · 325f6c53
  Sayak Paul authored Dec 21, 2023
```
* move attend and excite out.

* fix: import

* fix diffedit
```
  325f6c53
- [Refactor] move sag out of `stable_diffusion` (#6264) · 9ea6ac1b
  Sayak Paul authored Dec 21, 2023
```
move sag out of .
```
  9ea6ac1b
- [Refactor] move gligen out of stable diffusion. (#6265) · 2c34c7d6
  Sayak Paul authored Dec 21, 2023
```
* move gligen out of stable diffusion.

* fix: import

* fix import module
```
  2c34c7d6
- [Refactor] move k diffusion out of stable_diffusion (#6267) · bffadde1
  Sayak Paul authored Dec 21, 2023
```
move k diffusion out of stable_diffusion
```
  bffadde1
- Revert "move attend and excite out of stable_diffusion" · c5ff469d
  sayakpaul authored Dec 21, 2023
```
This reverts commit bcecfbc8.
```
  c5ff469d
- move attend and excite out of stable_diffusion · bcecfbc8
  sayakpaul authored Dec 21, 2023
  
  bcecfbc8
- [Refactor] move diffedit out of stable_diffusion (#6260) · 6269045c
  Sayak Paul authored Dec 21, 2023
```
* move diffedit out of stable_diffuson

* fix: import

* style

* fix: import
```
  6269045c
19 Dec, 2023 2 commits

[refactor embeddings]pixart-alpha (#6212) · 3e71a206
YiYi Xu authored Dec 19, 2023
```
pixart-alpha
Co-authored-by: yiyixuxu <yixu310@gmail,com>
```
3e71a206

offload the optional module `image_encoder` (#6151) · 57fde871

YiYi Xu authored Dec 18, 2023



* offload image_encoder

* add test

---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

57fde871

18 Dec, 2023 7 commits
- Slow Test for Pipelines minor fixes (#6221) · 781775ea
  Dhruv Nair authored Dec 19, 2023
```
update
```
  781775ea
- [SVD] Fix guidance scale (#6002) · fa3c86be
  Patrick von Platen authored Dec 18, 2023
```
* [SVD] Fix guidance scale

* make style
```
  fa3c86be
- Deprecate Pipelines (#6169) · a0c54828
  Dhruv Nair authored Dec 18, 2023
```
* deprecate pipe

* make style

* update

* add deprecation message

* format

* remove tests for deprecated pipelines

* remove deprecation message

* make style

* fix copies

* clean up

* clean

* clean

* clean

* clean up

* clean up

* clean up toctree

* clean up

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  a0c54828
- [Torch Compile] Fix torch compile for svd vae (#6217) · 8d891e6e
  Patrick von Platen authored Dec 18, 2023
  
  8d891e6e
- [Text-to-Video] Clean up pipeline (#6213) · cce1fe2d
  Patrick von Platen authored Dec 18, 2023
```
* make style

* make style

* make style

* make style
```
  cce1fe2d
- Fix SDXL Inpainting from single file with Refiner Model (#6147) · fcbed3fa
  Dhruv Nair authored Dec 18, 2023
```
* update

* update

* update
```
  fcbed3fa
- [Refactor autoencoders] feat: introduce autoencoders module (#6129) · 56b3b216
  Sayak Paul authored Dec 18, 2023
```
* feat: introduce autoencoders module

* more changes for styling and copy fixing

* path changes in the docs.

* fix: import structure in init.

* fix controlnetxs import
```
  56b3b216
16 Dec, 2023 1 commit
- [Core] feat: enable fused attention projections for other SD and SDXL pipelines (#6179) · 2d94c783
  Sayak Paul authored Dec 16, 2023
```
* feat: enable fused attention projections for other SD and SDXL pipelines

* add: test for SD fused projections.
```
  2d94c783
14 Dec, 2023 1 commit

Add missing subclass docs, Fix broken example in SD_safe (#6116) · 56806cdb

Aryan V S authored Dec 14, 2023



* fix broken example in pipeline_stable_diffusion_safe

* fix typo in pipeline_stable_diffusion_pix2pix_zero

* add missing docs

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

56806cdb

11 Dec, 2023 1 commit
- [`Docs`] Fix typos (#6122) · 0a401b95
  M. Tolga Cangöz authored Dec 11, 2023
```
Fix typos and trim trailing whitespaces
```
  0a401b95
10 Dec, 2023 1 commit

IP adapter support for most pipelines (#5900) · 88bdd97c

Aryan V S authored Dec 10, 2023



* support ip-adapter in src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_upscale.py

* support ip-adapter in src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_attend_and_excite.py

* support ip-adapter in src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_instruct_pix2pix.py

* update tests

* support ip-adapter in src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_panorama.py

* support ip-adapter in src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_sag.py

* support ip-adapter in src/diffusers/pipelines/stable_diffusion_safe/pipeline_stable_diffusion_safe.py

* support ip-adapter in src/diffusers/pipelines/latent_consistency_models/pipeline_latent_consistency_text2img.py

* support ip-adapter in src/diffusers/pipelines/latent_consistency_models/pipeline_latent_consistency_img2img.py

* support ip-adapter in src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_ldm3d.py

* revert changes to sd_attend_and_excite and sd_upscale

* make style

* fix broken tests

* update ip-adapter implementation to latest

* apply suggestions from review

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

88bdd97c

09 Dec, 2023 1 commit

IP-Adapter for StableDiffusionControlNetImg2ImgPipeline (#5901) · 08b453e3

Charchit Sharma authored Dec 09, 2023



* adapter for StableDiffusionControlNetImg2ImgPipeline

* fix-copies

* fix-copies

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

08b453e3

06 Dec, 2023 4 commits

Add ControlNet-XS support (#5827) · e192ae08

UmerHA authored Dec 06, 2023



* Check in 23-10-05

* check-in 23-10-06

* check-in 23-10-07 2pm

* check-in 23-10-08

* check-in 231009T1200

* check-in 230109

* checkin 231010

* init + forward run

* checkin

* checkin

* ControlNetXSModel is now saveable+loadable

* Forward works

* checkin

* Pipeline works with `no_control=True`

* checkin

* debug: save intermediate outputs of resnet

* checkin

* Understood time error + fixed connection error

* checkin

* checkin 231106T1600

* turned off detailled debug prints

* time debug logs

* small fix

* Separated control_scale for connections/time

* simplified debug logging

* Full denoising works with control scale = 0

* aligned logs

* Added control_attention_head_dim param

* Passing n_heads instead of dim_head into ctrl unet

* Fixed ctrl midblock bug

* Cleanup

* Fixed time dtype bug

* checkin

* 1. from_unet, 2. base passed, 3. all unet params

* checkin

* Finished docstrings

* cleanup

* make style

* checkin

* more tests pass

* Fixed tests

* removed debug logs

* make style + quality

* make fix-copies

* fixed documentation

* added cnxs to doc toc

* added control start/end param

* Update controlnetxs_sdxl.md

* tried to fix copies..

* Fixed norm_num_groups in from_unet

* added sdxl-depth test

* created SD2.1 controlnet-xs pipeline

* re-added debug logs

* Adjusting group norm ; readded logs

* Added debug log statements

* removed debug logs ; started tests for sd2.1

* updated sd21 tests

* fixed tests

* fixed tests

* slightly increased error tolerance for 1 test

* make style & quality

* Added docs for CNXS-SD

* make fix-copies

* Fixed sd compile test ; fixed gradient ckpointing

* vae downs = cnxs conditioning downs; removed guess

* make style & quality

* Fixed tests

* fixed test

* Incorporated review feedback

* simplified control model surgery

* fixed tests & make style / quality

* Updated docs; deleted pip & cursor files

* Rolled back minimal change to resnet

* Update resnet.py

* Update resnet.py

* Update src/diffusers/models/controlnetxs.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/models/controlnetxs.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Incorporated review feedback

* Update docs/source/en/api/pipelines/controlnetxs_sdxl.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/api/pipelines/controlnetxs.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/api/pipelines/controlnetxs.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/api/pipelines/controlnetxs.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update src/diffusers/models/controlnetxs.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update src/diffusers/models/controlnetxs.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update src/diffusers/pipelines/controlnet_xs/pipeline_controlnet_xs.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/api/pipelines/controlnetxs.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update src/diffusers/pipelines/controlnet_xs/pipeline_controlnet_xs_sd_xl.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Incorporated doc feedback

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

e192ae08

Harmonize HF environment variables + deprecate use_auth_token (#6066) · 75ada250
Lucain authored Dec 06, 2023
```
* Harmonize HF environment variables + deprecate use_auth_token

* fix import

* fix
```
75ada250
fix · f90a5139
Dhruv Nair authored Dec 06, 2023

f90a5139

[feat] allow SDXL pipeline to run with fused QKV projections (#6030) · a2bc2e14

Sayak Paul authored Dec 06, 2023



* debug

* from step

* print

* turn sigma a list

* make str

* init_noise_sigma

* comment

* remove prints

* feat: introduce fused projections

* change to a better name

* no grad

* device.

* device

* dtype

* okay

* print

* more print

* fix: unbind -> split

* fix: qkv >-> k

* enable disable

* apply attention processor within the method

* attn processors

* _enable_fused_qkv_projections

* remove print

* add fused projection to vae

* add todos.

* add: documentation and cleanups.

* add: test for qkv projection fusion.

* relax assertions.

* relax further

* fix: docs

* fix-copies

* correct error message.

* Empty-Commit

* better conditioning on disable_fused_qkv_projections

* check

* check processor

* bfloat16 computation.

* check latent dtype

* style

* remove copy temporarily

* cast latent to bfloat16

* fix: vae -> self.vae

* remove print.

* add _change_to_group_norm_32

* comment out stuff that didn't work

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* reflect patrick's suggestions.

* fix imports

* fix: disable call.

* fix more

* fix device and dtype

* fix conditions.

* fix more

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a2bc2e14

05 Dec, 2023 2 commits
- Ldm unet convert fix (#6038) · 4c05f785
  Dhruv Nair authored Dec 05, 2023
```
* fix

* fix ldm conversion

* fix linting
```
  4c05f785
- Move kandinsky convert script (#6047) · f9487783
  Dhruv Nair authored Dec 05, 2023
```
move kandinsky convert script
```
  f9487783
04 Dec, 2023 2 commits

[docs] Add Kandinsky 3 (#5988) · b64f835e
Steven Liu authored Dec 04, 2023
```
* add

* fix api docs

* edits
```
b64f835e

[Feature] Support IP-Adapter Plus (#5915) · 0a08d419

takuoko authored Dec 04, 2023



* Support IP-Adapter Plus

* fix format

* restore before black format

* restore before black format

* generic

* Refactor PerceiverAttention

* format

* fix test and refactor PerceiverAttention

* generic encode_image

* keep attention implementation

* merge tests

* encode_image backward compatible

* code quality

* fix controlnet inpaint pipeline

* refactor FFN

* refactor FFN

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

0a08d419

02 Dec, 2023 1 commit

adapt PixArtAlphaPipeline for pixart-lcm model (#5974) · 4520e122

Junsong Chen authored Dec 02, 2023



* adapt PixArtAlphaPipeline for pixart-lcm model

* remove original_inference_steps from __call__

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

4520e122

01 Dec, 2023 5 commits
- Post Release: v0.24.0 (#5985) · dadd55fb
  Patrick von Platen authored Dec 01, 2023
```
* Post Release: v0.24.0

* post pone deprecation

* post pone deprecation

* Add model_index.json
```
  dadd55fb
- [Kandinsky 3.0] Follow-up TODOs (#5944) · b41f809a
  YiYi Xu authored Dec 01, 2023
```
clean-up kendinsky 3.0
```
  b41f809a
- added attention_head_dim, attention_type, resolution_idx (#6011) · 5058d27f
  Charchit Sharma authored Dec 01, 2023
  
  5058d27f
- [From Single File] Allow Text Encoder to be passed (#6020) · bc1d28c8
  Patrick von Platen authored Dec 01, 2023
```
Allow text encoder to be passed
```
  bc1d28c8
- Remove a duplicated line? (#6010) · 7d4a257c
  Jongho Choi authored Dec 01, 2023
```
Update __init__.py
```
  7d4a257c
29 Nov, 2023 4 commits

Add SVD (#5895) · 63f767ef

Suraj Patil authored Nov 29, 2023



* begin model

* finish blocks

* add_embedding

* addition_time_embed_dim

* use TimestepEmbedding

* fix temporal res block

* fix time_pos_embed

* fix add_embedding

* add conversion script

* fix model

* up

* add new resnet blocks

* make forward work

* return sample in original shape

* fix temb shape in TemporalResnetBlock

* add spatio temporal transformers

* add vae blocks

* fix blocks

* update

* update

* fix shapes in Alphablender and add time activation in res blcok

* use new blocks

* style

* fix temb shape

* fix SpatioTemporalResBlock

* reuse TemporalBasicTransformerBlock

* fix TemporalBasicTransformerBlock

* use TransformerSpatioTemporalModel

* fix TransformerSpatioTemporalModel

* fix time_context dim

* clean up

* make temb optional

* add blocks

* rename model

* update conversion script

* remove UNetMidBlockSpatioTemporal

* add in init

* remove unused arg

* remove unused arg

* remove more unsed args

* up

* up

* check for None

* update vae

* update up/mid blocks for decoder

* begin pipeline

* adapt scheduler

* add guidance scalings

* fix norm eps in temporal transformers

* add temporal autoencoder

* make pipeline run

* fix frame decodig

* decode in float32

* decode n frames at a time

* pass decoding_t to decode_latents

* fix decode_latents

* vae encode/decode in fp32

* fix dtype in TransformerSpatioTemporalModel

* type image_latents same as image_embeddings

* allow using differnt eps in temporal block for video decoder

* fix default values in vae

* pass num frames in decode

* switch spatial to temporal for mixing in VAE

* fix num frames during split decoding

* cast alpha to sample dtype

* fix attention in MidBlockTemporalDecoder

* fix typo

* fix guidance_scales dtype

* fix missing activation in TemporalDecoder

* skip_post_quant_conv

* add vae conversion

* style

* take guidance scale as input

* up

* allow passing PIL to export_video

* accept fps as arg

* add pipeline and vae in init

* remove hack

* use AutoencoderKLTemporalDecoder

* don't scale image latents

* add unet tests

* clean up unet

* clean TransformerSpatioTemporalModel

* add slow svd test

* clean up

* make temb optional in Decoder mid block

* fix norm eps in TransformerSpatioTemporalModel

* clean up temp decoder

* clean up

* clean up

* use c_noise values for timesteps

* use math for log

* update

* fix copies

* doc

* upcast vae

* update forward pass for gradient checkpointing

* make added_time_ids is tensor

* up

* fix upcasting

* remove post quant conv

* add _resize_with_antialiasing

* fix _compute_padding

* cleanup model

* more cleanup

* more cleanup

* more cleanup

* remove freeu

* remove attn slice

* small clean

* up

* up

* remove extra step kwargs

* remove eta

* remove dropout

* remove callback

* remove merge factor args

* clean

* clean up

* move to dedicated folder

* remove attention_head_dim

* docstr and small fix

* update unet doc strings

* rename decoding_t

* correct linting

* store c_skip and c_out

* cleanup

* clean TemporalResnetBlock

* more cleanup

* clean up vae

* clean up

* begin doc

* more cleanup

* up

* up

* doc

* Improve

* better naming

* better naming

* better naming

* better naming

* better naming

* better naming

* better naming

* better naming

* Apply suggestions from code review

* Default chunk size to None

* add example

* Better

* Apply suggestions from code review

* update doc

* Update src/diffusers/pipelines/stable_diffusion_video/pipeline_stable_diffusion_video.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* style

* Get torch compile working

* up

* rename

* fix doc

* add chunking

* torch compile

* torch compile

* add modelling outputs

* torch compile

* Improve chunking

* Apply suggestions from code review

* Update docs/source/en/using-diffusers/svd.md

* Close diff tag

* remove slicing

* resnet docstr

* add docstr in resnet

* rename

* Apply suggestions from code review

* update tests

* Fix output type latents

* fix more

* fix more

* Update docs/source/en/using-diffusers/svd.md

* fix more

* add pipeline tests

* remove unused arg

* clean  up

* make sure get_scaling receives tensors

* fix euler scheduler

* fix get_scalings

* simply euler for now

* remove old test file

* use randn_tensor to create noise

* fix device for rand tensor

* increase expected_max_difference

* fix test_inference_batch_single_identical

* actually fix test_inference_batch_single_identical

* disable test_save_load_float16

* skip test_float16_inference

* skip test_inference_batch_single_identical

* fix test_xformers_attention_forwardGenerator_pass

* Apply suggestions from code review

* update StableVideoDiffusionPipelineSlowTests

* update image

* add diffusers example

* fix more

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: apolinário <joaopaulo.passos@gmail.com>

63f767ef

Fixed custom module importing on Windows (#5891) · d1b2a1a9

PENGUINLIONG authored Nov 29, 2023



* Fixed custom module importing on Windows

Windows use back slash and `os.path.join()` follows that convention.

* Apply suggestions from code review
Co-authored-by: Lucain <lucainp@gmail.com>

* Update pipeline_utils.py

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Lucain <lucainp@gmail.com>

d1b2a1a9

[Pipeline] Add TextToVideoZeroSDXLPipeline (#4695) · d63a498c

vahramtadevosyan authored Nov 29, 2023



* integrated sdxl for the text2video-zero pipeline

* make fix-copies

* fixed CI issues

* make fix-copies

* added docs and `copied from` statements

* added fast tests

* made a small change in docs

* quality+style check fix

* updated docs. added controlnet inference with sdxl

* added device compatibility for fast tests

* fixed docstrings

* changing vae upcasting

* remove torch.empty_cache to speed up inference
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* made fast tests to run on dummy models only, fixed copied from statements

* fixed testing utils imports

* Added bullet points for SDXL support

* fixed formatting & quality

* Update tests/pipelines/text_to_video/test_text_to_video_zero_sdxl.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/pipelines/text_to_video/test_text_to_video_zero_sdxl.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fixed minor error for merging

* fixed updates of sdxl

* made fast tests inherit from `PipelineTesterMixin` and run in 3-4secs on CPU

* make style && make quality

* reimplemented fast tests w/o default attn processor

* make style & make quality

* make fix-copies

* make fix-copies

* fixed docs

* make style & make quality & make fix-copies

* bug fix in cross attention

* make style && make quality

* make fix-copies

* fix gpu issues

* make fix-copies

* updated pipeline signature

---------
Co-authored-by: Vahram <vahram.tadevosyan@lambda-loginnode02.cm.cluster>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

d63a498c

Support of ip-adapter to the StableDiffusionControlNetInpaintPipeline (#5887) · 9f7b2cf2

JuanCarlosPi authored Nov 29, 2023



* Change pipeline_controlnet_inpaint.py to add ip-adapter support. Changes are similar to those in pipeline_controlnet

* Change tests for the StableDiffusionControlNetInpaintPipeline by adding image_encoder: None

* Update src/diffusers/pipelines/controlnet/pipeline_controlnet_inpaint.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

9f7b2cf2

28 Nov, 2023 1 commit
- fix: minor typo in docstring (#5961) · 21bc59ab
  Soumik Rakshit authored Nov 28, 2023
  
  21bc59ab