Commits · f9487783228cd500a21555da3346db40e8f05992 · renzhc / diffusers_dcu

05 Dec, 2023 1 commit
- Move kandinsky convert script (#6047) · f9487783
  Dhruv Nair authored Dec 05, 2023
```
move kandinsky convert script
```
  f9487783
04 Dec, 2023 4 commits

[docs] Add Kandinsky 3 (#5988) · b64f835e
Steven Liu authored Dec 04, 2023
```
* add

* fix api docs

* edits
```
b64f835e

[Feature] Support IP-Adapter Plus (#5915) · 0a08d419

takuoko authored Dec 04, 2023



* Support IP-Adapter Plus

* fix format

* restore before black format

* restore before black format

* generic

* Refactor PerceiverAttention

* format

* fix test and refactor PerceiverAttention

* generic encode_image

* keep attention implementation

* merge tests

* encode_image backward compatible

* code quality

* fix controlnet inpaint pipeline

* refactor FFN

* refactor FFN

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

0a08d419

Update Tests Fetcher (#5950) · b2172922

Dhruv Nair authored Dec 04, 2023



* update setup and deps table

* update

* update

* update

* up

* up

* update

* up

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* quality fix

* fix failure reporting

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

b2172922

Update value_guided_sampling.py (#6027) · 8a812e4e

Parth38 authored Dec 03, 2023



* Update value_guided_sampling.py

Changed the scheduler step function as predict_epsilon parameter is not there in latest  DDPM Scheduler

* Update value_guided_sampling.md

Updated a link to a working notebook

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

8a812e4e

02 Dec, 2023 2 commits

[LoRA serialization] fix: duplicate unet prefix problem. (#5991) · d486f0e8

Sayak Paul authored Dec 02, 2023



* fix: duplicate unet prefix problem.

* Update src/diffusers/loaders/lora.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d486f0e8

adapt PixArtAlphaPipeline for pixart-lcm model (#5974) · 4520e122

Junsong Chen authored Dec 02, 2023



* adapt PixArtAlphaPipeline for pixart-lcm model

* remove original_inference_steps from __call__

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

4520e122

01 Dec, 2023 7 commits
- Post Release: v0.24.0 (#5985) · dadd55fb
  Patrick von Platen authored Dec 01, 2023
```
* Post Release: v0.24.0

* post pone deprecation

* post pone deprecation

* Add model_index.json
```
  dadd55fb
- [schedulers] create `self.sigmas` during __init__ (#6006) · 1b6c7ea7
  YiYi Xu authored Dec 01, 2023
```
* fix dpm
* all scheulers
```
  1b6c7ea7
- [Kandinsky 3.0] Follow-up TODOs (#5944) · b41f809a
  YiYi Xu authored Dec 01, 2023
```
clean-up kendinsky 3.0
```
  b41f809a
- added attention_head_dim, attention_type, resolution_idx (#6011) · 5058d27f
  Charchit Sharma authored Dec 01, 2023
  
  5058d27f
- [`logging`] Fix assertion bug (#6012) · 52350703
  M. Tolga Cangöz authored Dec 01, 2023
```
Fix assertion bug
```
  52350703
- [From Single File] Allow Text Encoder to be passed (#6020) · bc1d28c8
  Patrick von Platen authored Dec 01, 2023
```
Allow text encoder to be passed
```
  bc1d28c8
- Remove a duplicated line? (#6010) · 7d4a257c
  Jongho Choi authored Dec 01, 2023
```
Update __init__.py
```
  7d4a257c
29 Nov, 2023 6 commits

Add SVD (#5895) · 63f767ef

Suraj Patil authored Nov 29, 2023



* begin model

* finish blocks

* add_embedding

* addition_time_embed_dim

* use TimestepEmbedding

* fix temporal res block

* fix time_pos_embed

* fix add_embedding

* add conversion script

* fix model

* up

* add new resnet blocks

* make forward work

* return sample in original shape

* fix temb shape in TemporalResnetBlock

* add spatio temporal transformers

* add vae blocks

* fix blocks

* update

* update

* fix shapes in Alphablender and add time activation in res blcok

* use new blocks

* style

* fix temb shape

* fix SpatioTemporalResBlock

* reuse TemporalBasicTransformerBlock

* fix TemporalBasicTransformerBlock

* use TransformerSpatioTemporalModel

* fix TransformerSpatioTemporalModel

* fix time_context dim

* clean up

* make temb optional

* add blocks

* rename model

* update conversion script

* remove UNetMidBlockSpatioTemporal

* add in init

* remove unused arg

* remove unused arg

* remove more unsed args

* up

* up

* check for None

* update vae

* update up/mid blocks for decoder

* begin pipeline

* adapt scheduler

* add guidance scalings

* fix norm eps in temporal transformers

* add temporal autoencoder

* make pipeline run

* fix frame decodig

* decode in float32

* decode n frames at a time

* pass decoding_t to decode_latents

* fix decode_latents

* vae encode/decode in fp32

* fix dtype in TransformerSpatioTemporalModel

* type image_latents same as image_embeddings

* allow using differnt eps in temporal block for video decoder

* fix default values in vae

* pass num frames in decode

* switch spatial to temporal for mixing in VAE

* fix num frames during split decoding

* cast alpha to sample dtype

* fix attention in MidBlockTemporalDecoder

* fix typo

* fix guidance_scales dtype

* fix missing activation in TemporalDecoder

* skip_post_quant_conv

* add vae conversion

* style

* take guidance scale as input

* up

* allow passing PIL to export_video

* accept fps as arg

* add pipeline and vae in init

* remove hack

* use AutoencoderKLTemporalDecoder

* don't scale image latents

* add unet tests

* clean up unet

* clean TransformerSpatioTemporalModel

* add slow svd test

* clean up

* make temb optional in Decoder mid block

* fix norm eps in TransformerSpatioTemporalModel

* clean up temp decoder

* clean up

* clean up

* use c_noise values for timesteps

* use math for log

* update

* fix copies

* doc

* upcast vae

* update forward pass for gradient checkpointing

* make added_time_ids is tensor

* up

* fix upcasting

* remove post quant conv

* add _resize_with_antialiasing

* fix _compute_padding

* cleanup model

* more cleanup

* more cleanup

* more cleanup

* remove freeu

* remove attn slice

* small clean

* up

* up

* remove extra step kwargs

* remove eta

* remove dropout

* remove callback

* remove merge factor args

* clean

* clean up

* move to dedicated folder

* remove attention_head_dim

* docstr and small fix

* update unet doc strings

* rename decoding_t

* correct linting

* store c_skip and c_out

* cleanup

* clean TemporalResnetBlock

* more cleanup

* clean up vae

* clean up

* begin doc

* more cleanup

* up

* up

* doc

* Improve

* better naming

* better naming

* better naming

* better naming

* better naming

* better naming

* better naming

* better naming

* Apply suggestions from code review

* Default chunk size to None

* add example

* Better

* Apply suggestions from code review

* update doc

* Update src/diffusers/pipelines/stable_diffusion_video/pipeline_stable_diffusion_video.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* style

* Get torch compile working

* up

* rename

* fix doc

* add chunking

* torch compile

* torch compile

* add modelling outputs

* torch compile

* Improve chunking

* Apply suggestions from code review

* Update docs/source/en/using-diffusers/svd.md

* Close diff tag

* remove slicing

* resnet docstr

* add docstr in resnet

* rename

* Apply suggestions from code review

* update tests

* Fix output type latents

* fix more

* fix more

* Update docs/source/en/using-diffusers/svd.md

* fix more

* add pipeline tests

* remove unused arg

* clean  up

* make sure get_scaling receives tensors

* fix euler scheduler

* fix get_scalings

* simply euler for now

* remove old test file

* use randn_tensor to create noise

* fix device for rand tensor

* increase expected_max_difference

* fix test_inference_batch_single_identical

* actually fix test_inference_batch_single_identical

* disable test_save_load_float16

* skip test_float16_inference

* skip test_inference_batch_single_identical

* fix test_xformers_attention_forwardGenerator_pass

* Apply suggestions from code review

* update StableVideoDiffusionPipelineSlowTests

* update image

* add diffusers example

* fix more

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: apolinário <joaopaulo.passos@gmail.com>

63f767ef

Fixed custom module importing on Windows (#5891) · d1b2a1a9

PENGUINLIONG authored Nov 29, 2023



* Fixed custom module importing on Windows

Windows use back slash and `os.path.join()` follows that convention.

* Apply suggestions from code review
Co-authored-by: Lucain <lucainp@gmail.com>

* Update pipeline_utils.py

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Lucain <lucainp@gmail.com>

d1b2a1a9

[Pipeline] Add TextToVideoZeroSDXLPipeline (#4695) · d63a498c

vahramtadevosyan authored Nov 29, 2023



* integrated sdxl for the text2video-zero pipeline

* make fix-copies

* fixed CI issues

* make fix-copies

* added docs and `copied from` statements

* added fast tests

* made a small change in docs

* quality+style check fix

* updated docs. added controlnet inference with sdxl

* added device compatibility for fast tests

* fixed docstrings

* changing vae upcasting

* remove torch.empty_cache to speed up inference
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* made fast tests to run on dummy models only, fixed copied from statements

* fixed testing utils imports

* Added bullet points for SDXL support

* fixed formatting & quality

* Update tests/pipelines/text_to_video/test_text_to_video_zero_sdxl.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/pipelines/text_to_video/test_text_to_video_zero_sdxl.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fixed minor error for merging

* fixed updates of sdxl

* made fast tests inherit from `PipelineTesterMixin` and run in 3-4secs on CPU

* make style && make quality

* reimplemented fast tests w/o default attn processor

* make style & make quality

* make fix-copies

* make fix-copies

* fixed docs

* make style & make quality & make fix-copies

* bug fix in cross attention

* make style && make quality

* make fix-copies

* fix gpu issues

* make fix-copies

* updated pipeline signature

---------
Co-authored-by: Vahram <vahram.tadevosyan@lambda-loginnode02.cm.cluster>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

d63a498c

Controlnet ssd 1b support (#5779) · 6a4aad43

Marko Kostiv authored Nov 29, 2023



* Add SSD-1B support for controlnet model

* Add conditioning_channels into ControlNet init from unet

* Fix black formatting

* Isort fixes

* Adds SSD-1B controlnet pipeline test with UNetMidBlock2D as mid block

* Overrides failing ssd-1b tests

* Fixes tests after main branch update

* Fixes code quality checks

---------
Co-authored-by: Marko Kostiv <marko@linearity.io>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

6a4aad43

Support of ip-adapter to the StableDiffusionControlNetInpaintPipeline (#5887) · 9f7b2cf2

JuanCarlosPi authored Nov 29, 2023



* Change pipeline_controlnet_inpaint.py to add ip-adapter support. Changes are similar to those in pipeline_controlnet

* Change tests for the StableDiffusionControlNetInpaintPipeline by adding image_encoder: None

* Update src/diffusers/pipelines/controlnet/pipeline_controlnet_inpaint.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

9f7b2cf2

[LoRA refactor] move several state dict conversion utils out of lora.py (#5955) · 895c4b70

Sayak Paul authored Nov 29, 2023



* move several state dict conversion utils out of lora.py

* check

* check

* check

* check

* check

* check

* check

* revert back

* check

* check

* again check

* maybe fix?

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

895c4b70

28 Nov, 2023 2 commits

[ldm3d] Ldm3d upscaler to community pipeline (#5870) · 5ae3c3a5

estelleafl authored Nov 28, 2023





---------
Co-authored-by: Aflalo <estellea@isl-gpu27.rr.intel.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

5ae3c3a5

fix: minor typo in docstring (#5961) · 21bc59ab
Soumik Rakshit authored Nov 28, 2023

21bc59ab

27 Nov, 2023 9 commits

[load_textual_inversion]: allow multiple tokens (#5837) · d9075be4
YiYi Xu authored Nov 27, 2023
```
Co-authored-by: yiyixuxu <yixu310@gmail,com>
```
d9075be4
[From_pretrained] Fix warning (#5948) · b135b6e9
Patrick von Platen authored Nov 27, 2023

b135b6e9

[Vae] Make sure all vae's work with latent diffusion models (#5880) · e550163b

Patrick von Platen authored Nov 27, 2023

* add comments to explain the code better

* add comments to explain the code better

* add comments to explain the code better

* add comments to explain the code better

* add comments to explain the code better

* fix more

* fix more

* fix more

* fix more

* fix more

* fix more

e550163b

Replace multiple variables with one variable. (#5715) · d72a24b7

Chi authored Nov 27, 2023



* I added a new doc string to the class. This is more flexible to understanding other developers what are doing and where it's using.

* Update src/diffusers/models/unet_2d_blocks.py

This changes suggest by maintener.
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update src/diffusers/models/unet_2d_blocks.py

Add suggested text
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update unet_2d_blocks.py

I changed the Parameter to Args text.

* Update unet_2d_blocks.py

proper indentation set in this file.

* Update unet_2d_blocks.py

a little bit of change in the act_fun argument line.

* I run the black command to reformat style in the code

* Update unet_2d_blocks.py

similar doc-string add to have in the original diffusion repository.

* I enhanced the code by replacing multiple redundant variables with a single variable, as they all served the same purpose. Additionally, I utilized the get_activation function for improved flexibility in choosing activation functions.

* Using as black package to reformated my file

* reverte some changes

* Remove conv_out_padding variables and using as conv_in_padding

* conv_out_padding create and add them into the code.

* run black command to solving styling problem

* add little bit space between comment and import statement

* I am utilizing the ruff library to address the style issues in my Makefile.

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d72a24b7

Avoid computing min() that is expensive when do_normalize is False in the image processor (#5896) · c079cae3
Iván de Prado authored Nov 27, 2023
```
Avoid computing min() that is expensive when do_normalize is False

Avoid extra computing when do_normalize is False
```
c079cae3

Add Custom Timesteps Support to LCMScheduler and Supported Pipelines (#5874) · 67d07074

dg845 authored Nov 27, 2023

* Add custom timesteps support to LCMScheduler.

* Add custom timesteps support to StableDiffusionPipeline.

* Add custom timesteps support to StableDiffusionXLPipeline.

* Add custom timesteps support to remaining Stable Diffusion pipelines which support LCMScheduler (img2img, inpaint).

* Add custom timesteps support to remaining Stable Diffusion XL pipelines which support LCMScheduler (img2img, inpaint).

* Add custom timesteps support to StableDiffusionControlNetPipeline.

* Add custom timesteps support to T21 Stable Diffusion (XL) Adapters.

* Clean up Stable Diffusion inpaint tests.

* Manually add support for custom timesteps to AltDiffusion pipelines since make fix-copies doesn't appear to work correctly (it deletes the whole pipeline).

* make style

* Refactor pipeline timestep handling into the retrieve_timesteps function.

67d07074

Deprecate KarrasVeScheduler and ScoreSdeVpScheduler (#5269) · 9c357bda

Aryan V S authored Nov 27, 2023



* deprecated: KarrasVeScheduler, ScoreSdeVpScheduler

* delete tests relevant to deprecated schedulers

* chore: run make style

* fix: import error caused due to incorrect _import_structure after deprecation

* fix: ScoreSdeVpScheduler was not importable from diffusers

* remove import added by assumption

* Update src/diffusers/schedulers/__init__.py as suggested by @patrickvonplaten
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* make it a part deprecated

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Fix

* fix

* fix doc

* fix doc....again.......

* remove karras_ve test folder
Co-Authored-By: YiYi Xu <yixu310@gmail.com>

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail,com>

9c357bda

[Core] add support for gradient checkpointing in transformer_2d (#5943) · 3f7c3511
Sayak Paul authored Nov 27, 2023
```
add support for gradient checkpointing in transformer_2d
```
3f7c3511

[Fix: pixart-alpha] random 512px resolution bug (#5842) · 7d6f30e8

Junsong Chen authored Nov 27, 2023



* [Fix: pixart-alpha]
add ASPECT_RATIO_512_BIN in use_resolution_binning for random 512px image generation.

* add slow test file for 512px generation without resolution binning

* fix: slow tests for resolution binning.

---------
Co-authored-by: jschen <chenjunsong4@h-partners.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

7d6f30e8

24 Nov, 2023 2 commits

correct num inference steps · 2a7f43a7
Patrick von Platen authored Nov 24, 2023

2a7f43a7

[@cene555][Kandinsky 3.0] Add Kandinsky 3.0 (#5913) · b978334d

Patrick von Platen authored Nov 24, 2023

* finalize

* finalize

* finalize

* add slow test

* add slow test

* add slow test

* Fix more

* add slow test

* fix more

* fix more

* fix more

* fix more

* fix more

* fix more

* fix more

* fix more

* fix more

* Better

* Fix more

* Fix more

* add slow test

* Add auto pipelines

* add slow test

* Add all

* add slow test

* add slow test

* add slow test

* add slow test

* add slow test

* Apply suggestions from code review

* add slow test

* add slow test

b978334d

21 Nov, 2023 4 commits

[Lora] Seperate logic (#5809) · 13d73d93

Patrick von Platen authored Nov 21, 2023

* [Lora] Seperate logic

* [Lora] Seperate logic

* [Lora] Seperate logic

* add comments to explain the code better

* add comments to explain the code better

13d73d93

[feat] IP Adapters (author @okotaku ) (#5713) · ba352aea

YiYi Xu authored Nov 21, 2023



* add ip-adapter


---------
Co-authored-by: okotaku <to78314910@gmail.com>
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

ba352aea

[docs] MusicLDM (#5854) · 1093f9d6
Steven Liu authored Nov 21, 2023
```
* fix

* feedback
```
1093f9d6

Addition of new callbacks to controlnets (#5812) · 81780882

Aryan V S authored Nov 21, 2023



* add new callbacks to src/diffusers/pipelines/controlnet/pipeline_controlnet.py

* update callbacks

* fix repeated kwarg

* update

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

81780882

20 Nov, 2023 3 commits

[docs] Loader APIs (#5813) · 7457aa67

Steven Liu authored Nov 20, 2023

* first draft

* remove old loader doc

* start adding lora code examples

* finish

* add link to loralinearlayer

* feedback

* fix

7457aa67

Update LCMScheduler Inference Timesteps to be More Evenly Spaced (#5836) · dc21498b

dg845 authored Nov 20, 2023



* Change LCMScheduler.set_timesteps to pick more evenly spaced inference timesteps.

* Change inference_indices implementation to better match previous behavior.

* Add num_inference_steps=26 test case to test_inference_steps.

* run CI

---------
Co-authored-by: patil-suraj <surajp815@gmail.com>

dc21498b

[JAX] Replace uses of jax.devices("cpu") with jax.local_devices(backend="cpu") (#5864) · 2695ba8e

Roy Hvaara authored Nov 20, 2023

An upcoming change to JAX will include non-local (addressable) CPU devices in jax.devices() when JAX is used multicontroller-style, where there are multiple Python processes.

This change preserves the current behavior by replacing uses of jax.devices("cpu"), which previously only returned local devices, with jax.local_devices("cpu"), which will return local devices both now and in the future.

This change is always safe (i.e., it should always preserve the previous behavior), but it may sometimes be unnecessary if code is never used in a multicontroller setting.
Co-authored-by: Peter Hawkins <phawkins@google.com>

2695ba8e