Commits · 63f767ef15fa59704272ac7320ec23b8c15de246 · renzhc / diffusers_dcu

29 Nov, 2023 13 commits

Suraj Patil authored Nov 29, 2023



* begin model

* finish blocks

* add_embedding

* addition_time_embed_dim

* use TimestepEmbedding

* fix temporal res block

* fix time_pos_embed

* fix add_embedding

* add conversion script

* fix model

* up

* add new resnet blocks

* make forward work

* return sample in original shape

* fix temb shape in TemporalResnetBlock

* add spatio temporal transformers

* add vae blocks

* fix blocks

* update

* update

* fix shapes in Alphablender and add time activation in res blcok

* use new blocks

* style

* fix temb shape

* fix SpatioTemporalResBlock

* reuse TemporalBasicTransformerBlock

* fix TemporalBasicTransformerBlock

* use TransformerSpatioTemporalModel

* fix TransformerSpatioTemporalModel

* fix time_context dim

* clean up

* make temb optional

* add blocks

* rename model

* update conversion script

* remove UNetMidBlockSpatioTemporal

* add in init

* remove unused arg

* remove unused arg

* remove more unsed args

* up

* up

* check for None

* update vae

* update up/mid blocks for decoder

* begin pipeline

* adapt scheduler

* add guidance scalings

* fix norm eps in temporal transformers

* add temporal autoencoder

* make pipeline run

* fix frame decodig

* decode in float32

* decode n frames at a time

* pass decoding_t to decode_latents

* fix decode_latents

* vae encode/decode in fp32

* fix dtype in TransformerSpatioTemporalModel

* type image_latents same as image_embeddings

* allow using differnt eps in temporal block for video decoder

* fix default values in vae

* pass num frames in decode

* switch spatial to temporal for mixing in VAE

* fix num frames during split decoding

* cast alpha to sample dtype

* fix attention in MidBlockTemporalDecoder

* fix typo

* fix guidance_scales dtype

* fix missing activation in TemporalDecoder

* skip_post_quant_conv

* add vae conversion

* style

* take guidance scale as input

* up

* allow passing PIL to export_video

* accept fps as arg

* add pipeline and vae in init

* remove hack

* use AutoencoderKLTemporalDecoder

* don't scale image latents

* add unet tests

* clean up unet

* clean TransformerSpatioTemporalModel

* add slow svd test

* clean up

* make temb optional in Decoder mid block

* fix norm eps in TransformerSpatioTemporalModel

* clean up temp decoder

* clean up

* clean up

* use c_noise values for timesteps

* use math for log

* update

* fix copies

* doc

* upcast vae

* update forward pass for gradient checkpointing

* make added_time_ids is tensor

* up

* fix upcasting

* remove post quant conv

* add _resize_with_antialiasing

* fix _compute_padding

* cleanup model

* more cleanup

* more cleanup

* more cleanup

* remove freeu

* remove attn slice

* small clean

* up

* up

* remove extra step kwargs

* remove eta

* remove dropout

* remove callback

* remove merge factor args

* clean

* clean up

* move to dedicated folder

* remove attention_head_dim

* docstr and small fix

* update unet doc strings

* rename decoding_t

* correct linting

* store c_skip and c_out

* cleanup

* clean TemporalResnetBlock

* more cleanup

* clean up vae

* clean up

* begin doc

* more cleanup

* up

* up

* doc

* Improve

* better naming

* better naming

* better naming

* better naming

* better naming

* better naming

* better naming

* better naming

* Apply suggestions from code review

* Default chunk size to None

* add example

* Better

* Apply suggestions from code review

* update doc

* Update src/diffusers/pipelines/stable_diffusion_video/pipeline_stable_diffusion_video.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* style

* Get torch compile working

* up

* rename

* fix doc

* add chunking

* torch compile

* torch compile

* add modelling outputs

* torch compile

* Improve chunking

* Apply suggestions from code review

* Update docs/source/en/using-diffusers/svd.md

* Close diff tag

* remove slicing

* resnet docstr

* add docstr in resnet

* rename

* Apply suggestions from code review

* update tests

* Fix output type latents

* fix more

* fix more

* Update docs/source/en/using-diffusers/svd.md

* fix more

* add pipeline tests

* remove unused arg

* clean  up

* make sure get_scaling receives tensors

* fix euler scheduler

* fix get_scalings

* simply euler for now

* remove old test file

* use randn_tensor to create noise

* fix device for rand tensor

* increase expected_max_difference

* fix test_inference_batch_single_identical

* actually fix test_inference_batch_single_identical

* disable test_save_load_float16

* skip test_float16_inference

* skip test_inference_batch_single_identical

* fix test_xformers_attention_forwardGenerator_pass

* Apply suggestions from code review

* update StableVideoDiffusionPipelineSlowTests

* update image

* add diffusers example

* fix more

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: apolinário <joaopaulo.passos@gmail.com>

63f767ef

Fixed custom module importing on Windows (#5891) · d1b2a1a9

PENGUINLIONG authored Nov 29, 2023



* Fixed custom module importing on Windows

Windows use back slash and `os.path.join()` follows that convention.

* Apply suggestions from code review
Co-authored-by: Lucain <lucainp@gmail.com>

* Update pipeline_utils.py

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Lucain <lucainp@gmail.com>

d1b2a1a9

[Wuerstchen] Adapt lora training example scripts to use PEFT (#5959) · 01782c22
Kashif Rasul authored Nov 29, 2023
```
* Adapt lora example scripts to use PEFT

* add to_out.0
```
01782c22

[Pipeline] Add TextToVideoZeroSDXLPipeline (#4695) · d63a498c

vahramtadevosyan authored Nov 29, 2023



* integrated sdxl for the text2video-zero pipeline

* make fix-copies

* fixed CI issues

* make fix-copies

* added docs and `copied from` statements

* added fast tests

* made a small change in docs

* quality+style check fix

* updated docs. added controlnet inference with sdxl

* added device compatibility for fast tests

* fixed docstrings

* changing vae upcasting

* remove torch.empty_cache to speed up inference
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* made fast tests to run on dummy models only, fixed copied from statements

* fixed testing utils imports

* Added bullet points for SDXL support

* fixed formatting & quality

* Update tests/pipelines/text_to_video/test_text_to_video_zero_sdxl.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/pipelines/text_to_video/test_text_to_video_zero_sdxl.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fixed minor error for merging

* fixed updates of sdxl

* made fast tests inherit from `PipelineTesterMixin` and run in 3-4secs on CPU

* make style && make quality

* reimplemented fast tests w/o default attn processor

* make style & make quality

* make fix-copies

* make fix-copies

* fixed docs

* make style & make quality & make fix-copies

* bug fix in cross attention

* make style && make quality

* make fix-copies

* fix gpu issues

* make fix-copies

* updated pipeline signature

---------
Co-authored-by: Vahram <vahram.tadevosyan@lambda-loginnode02.cm.cluster>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

d63a498c

Controlnet ssd 1b support (#5779) · 6a4aad43

Marko Kostiv authored Nov 29, 2023



* Add SSD-1B support for controlnet model

* Add conditioning_channels into ControlNet init from unet

* Fix black formatting

* Isort fixes

* Adds SSD-1B controlnet pipeline test with UNetMidBlock2D as mid block

* Overrides failing ssd-1b tests

* Fixes tests after main branch update

* Fixes code quality checks

---------
Co-authored-by: Marko Kostiv <marko@linearity.io>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

6a4aad43

[docs] LCM training (#5796) · ddd8bd53
Steven Liu authored Nov 29, 2023
```
* first draft

* feedback
```
ddd8bd53

Support of ip-adapter to the StableDiffusionControlNetInpaintPipeline (#5887) · 9f7b2cf2

JuanCarlosPi authored Nov 29, 2023



* Change pipeline_controlnet_inpaint.py to add ip-adapter support. Changes are similar to those in pipeline_controlnet

* Change tests for the StableDiffusionControlNetInpaintPipeline by adding image_encoder: None

* Update src/diffusers/pipelines/controlnet/pipeline_controlnet_inpaint.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

9f7b2cf2

[LoRA refactor] move several state dict conversion utils out of lora.py (#5955) · 895c4b70

Sayak Paul authored Nov 29, 2023



* move several state dict conversion utils out of lora.py

* check

* check

* check

* check

* check

* check

* check

* revert back

* check

* check

* again check

* maybe fix?

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

895c4b70

Rename output_dir argument (#5916) · 636feba5
Linh Nguyen authored Nov 29, 2023
```
Fix typo in output_dir argument: "text-inversion-model" → "dreambooth-model"
```
636feba5

[bug fix] Inpainting for MultiAdapter (#5922) · 79dc7df0

Andrés Romero authored Nov 29, 2023



* bug in MultiAdapter for Inpainting

* adapter_input is a list for MultiAdapter

---------
Co-authored-by: andres <andres@hax.ai>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

79dc7df0

added doc for Kandinsky3.0 (#5937) · 6031ecbd

Charchit Sharma authored Nov 29, 2023



* added en doc for Kandinsky3.0

* required changes

* Update docs/source/en/api/pipelines/kandinsky3.md

* Update docs/source/en/api/pipelines/kandinsky3.md

* Update docs/source/en/api/pipelines/kandinsky3.md

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

6031ecbd

[Tests] Refactor `test_examples.py` for better readability (#5946) · fdd003d8

Sayak Paul authored Nov 29, 2023

* control and custom diffusion

* dreambooth

* instructpix2pix and dreambooth ckpting

* t2i adapters.

* text to image ft

* textual inversion

* unconditional

* workflows

* import fix

* fix import

fdd003d8

[docs] Update pipeline list (#5952) · 172acc98
Steven Liu authored Nov 29, 2023
```
add to list
```
172acc98

28 Nov, 2023 2 commits

[ldm3d] Ldm3d upscaler to community pipeline (#5870) · 5ae3c3a5

estelleafl authored Nov 28, 2023





---------
Co-authored-by: Aflalo <estellea@isl-gpu27.rr.intel.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

5ae3c3a5

fix: minor typo in docstring (#5961) · 21bc59ab
Soumik Rakshit authored Nov 28, 2023

21bc59ab

27 Nov, 2023 17 commits

[docs] Fix space (#5898) · 50a749e9
Steven Liu authored Nov 27, 2023
```
* fix

* minor edits
```
50a749e9
[load_textual_inversion]: allow multiple tokens (#5837) · d9075be4
YiYi Xu authored Nov 27, 2023
```
Co-authored-by: yiyixuxu <yixu310@gmail,com>
```
d9075be4
[From_pretrained] Fix warning (#5948) · b135b6e9
Patrick von Platen authored Nov 27, 2023

b135b6e9
[Community Pipeline] Diffusion Posterior Sampling for General Noisy Inverse Problems (#5939) · 14a0d21d
T. Xu authored Nov 27, 2023
```
* [community pipeline] dps impl

* add type checking

* pass ruff check

* ruff formatter
```
14a0d21d

[Tests] Make sure that we don't run tests multiple times (#5949) · ebf581e8

Patrick von Platen authored Nov 27, 2023

* [Tests] Make sure that we don't run tests mulitple times

* [Tests] Make sure that we don't run tests mulitple times

* [Tests] Make sure that we don't run tests mulitple times

ebf581e8

[Vae] Make sure all vae's work with latent diffusion models (#5880) · e550163b

Patrick von Platen authored Nov 27, 2023

* add comments to explain the code better

* add comments to explain the code better

* add comments to explain the code better

* add comments to explain the code better

* add comments to explain the code better

* fix more

* fix more

* fix more

* fix more

* fix more

* fix more

e550163b

fix: error on device for `lpw_stable_diffusion_xl` pipeline if... · 20f0cbc8

Viktor Grygorchuk authored Nov 27, 2023

fix: error on device for `lpw_stable_diffusion_xl` pipeline if `pipe.enable_sequential_cpu_offload()` enabled (#5885)

fix: set device for pipe.enable_sequential_cpu_offload()

20f0cbc8

Replace multiple variables with one variable. (#5715) · d72a24b7

Chi authored Nov 27, 2023



* I added a new doc string to the class. This is more flexible to understanding other developers what are doing and where it's using.

* Update src/diffusers/models/unet_2d_blocks.py

This changes suggest by maintener.
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update src/diffusers/models/unet_2d_blocks.py

Add suggested text
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update unet_2d_blocks.py

I changed the Parameter to Args text.

* Update unet_2d_blocks.py

proper indentation set in this file.

* Update unet_2d_blocks.py

a little bit of change in the act_fun argument line.

* I run the black command to reformat style in the code

* Update unet_2d_blocks.py

similar doc-string add to have in the original diffusion repository.

* I enhanced the code by replacing multiple redundant variables with a single variable, as they all served the same purpose. Additionally, I utilized the get_activation function for improved flexibility in choosing activation functions.

* Using as black package to reformated my file

* reverte some changes

* Remove conv_out_padding variables and using as conv_in_padding

* conv_out_padding create and add them into the code.

* run black command to solving styling problem

* add little bit space between comment and import statement

* I am utilizing the ruff library to address the style issues in my Makefile.

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d72a24b7

add LoRA weights load and fuse support for IPEX pipeline (#5920) · d3cda804
ginjia authored Nov 27, 2023
```
add IPEX pipeline LoRA weights loading support
```
d3cda804
Fix LCM Stable Diffusion distillation bug related to parsing unet_time_cond_proj_dim (#5893) · 07eac4d6
dg845 authored Nov 27, 2023
```
* Fix bug related to parsing unet_time_cond_proj_dim.

* Fix analogous bug in the SD-XL LCM distillation script.
```
07eac4d6
Avoid computing min() that is expensive when do_normalize is False in the image processor (#5896) · c079cae3
Iván de Prado authored Nov 27, 2023
```
Avoid computing min() that is expensive when do_normalize is False

Avoid extra computing when do_normalize is False
```
c079cae3
set the model to train state before accelerator prepare (#5099) · c7bfb8b2
Wang, Yi authored Nov 27, 2023
```
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
```
c7bfb8b2

Add Custom Timesteps Support to LCMScheduler and Supported Pipelines (#5874) · 67d07074

dg845 authored Nov 27, 2023

* Add custom timesteps support to LCMScheduler.

* Add custom timesteps support to StableDiffusionPipeline.

* Add custom timesteps support to StableDiffusionXLPipeline.

* Add custom timesteps support to remaining Stable Diffusion pipelines which support LCMScheduler (img2img, inpaint).

* Add custom timesteps support to remaining Stable Diffusion XL pipelines which support LCMScheduler (img2img, inpaint).

* Add custom timesteps support to StableDiffusionControlNetPipeline.

* Add custom timesteps support to T21 Stable Diffusion (XL) Adapters.

* Clean up Stable Diffusion inpaint tests.

* Manually add support for custom timesteps to AltDiffusion pipelines since make fix-copies doesn't appear to work correctly (it deletes the whole pipeline).

* make style

* Refactor pipeline timestep handling into the retrieve_timesteps function.

67d07074

Deprecate KarrasVeScheduler and ScoreSdeVpScheduler (#5269) · 9c357bda

Aryan V S authored Nov 27, 2023



* deprecated: KarrasVeScheduler, ScoreSdeVpScheduler

* delete tests relevant to deprecated schedulers

* chore: run make style

* fix: import error caused due to incorrect _import_structure after deprecation

* fix: ScoreSdeVpScheduler was not importable from diffusers

* remove import added by assumption

* Update src/diffusers/schedulers/__init__.py as suggested by @patrickvonplaten
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* make it a part deprecated

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Fix

* fix

* fix doc

* fix doc....again.......

* remove karras_ve test folder
Co-Authored-By: YiYi Xu <yixu310@gmail.com>

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail,com>

9c357bda

[Core] add support for gradient checkpointing in transformer_2d (#5943) · 3f7c3511
Sayak Paul authored Nov 27, 2023
```
add support for gradient checkpointing in transformer_2d
```
3f7c3511

[Fix: pixart-alpha] random 512px resolution bug (#5842) · 7d6f30e8

Junsong Chen authored Nov 27, 2023



* [Fix: pixart-alpha]
add ASPECT_RATIO_512_BIN in use_resolution_binning for random 512px image generation.

* add slow test file for 512px generation without resolution binning

* fix: slow tests for resolution binning.

---------
Co-authored-by: jschen <chenjunsong4@h-partners.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

7d6f30e8

[Examples] Allow downloading variant model files (#5531) · 6d2e19f7

Patrick von Platen authored Nov 27, 2023



* add variant

* add variant

* Apply suggestions from code review

* reformat

* fix: textual_inversion.py

* fix: variant in model_info

---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>

6d2e19f7

24 Nov, 2023 3 commits

correct num inference steps · 2a7f43a7
Patrick von Platen authored Nov 24, 2023

2a7f43a7

[@cene555][Kandinsky 3.0] Add Kandinsky 3.0 (#5913) · b978334d

Patrick von Platen authored Nov 24, 2023

* finalize

* finalize

* finalize

* add slow test

* add slow test

* add slow test

* Fix more

* add slow test

* fix more

* fix more

* fix more

* fix more

* fix more

* fix more

* fix more

* fix more

* fix more

* Better

* Fix more

* Fix more

* add slow test

* Add auto pipelines

* add slow test

* Add all

* add slow test

* add slow test

* add slow test

* add slow test

* add slow test

* Apply suggestions from code review

* add slow test

* add slow test

b978334d

[Docs] add: 8bit inference with pixart alpha (#5814) · e5f232f7

Sayak Paul authored Nov 24, 2023



* add: 8bit inference with pixart alpha

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* add: note on 4bit.

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* address comment

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

e5f232f7

23 Nov, 2023 2 commits
- [bug fix] fix small bug in readme template of sdxl lora training script (#5914) · 3003ff49
  Linoy Tsaban authored Nov 23, 2023
```
readme improvement and metadata fix
```
  3003ff49
- [bug fix] fix small bug in readme template of sdxl lora training script (#5906) · 5ffa6032
  Linoy Tsaban authored Nov 23, 2023
```
* readme bug fix

* style fix

---------
Co-authored-by: Linoy Tsaban <linoy@huggingface.co>
```
  5ffa6032
22 Nov, 2023 1 commit

Adds an advanced version of the SD-XL DreamBooth LoRA training script... · 0eeee618

Linoy Tsaban authored Nov 22, 2023


Adds an advanced version of the SD-XL DreamBooth LoRA training script supporting pivotal tuning  (#5883)

* sdxl dreambooth lora training script with pivotal tuning

* bug fix - args missing from parse_args

* code quality fixes

* comment unnecessary code from TokenEmbedding handler class

* fixup

---------
Co-authored-by: Linoy Tsaban <linoy@huggingface.co>

0eeee618

21 Nov, 2023 2 commits

ControlNet+Adapter pipeline, and ControlNet+Adapter+Inpaint pipeline (#5869) · 93f1a14c
Andrés Romero authored Nov 21, 2023
```
* ControlNet+Adapter pipeline, and +Inpaint pipeline


---------
Co-authored-by: andres <andres@hax.ai>
```
93f1a14c

[Lora] Seperate logic (#5809) · 13d73d93

Patrick von Platen authored Nov 21, 2023

* [Lora] Seperate logic

* [Lora] Seperate logic

* [Lora] Seperate logic

* add comments to explain the code better

* add comments to explain the code better

13d73d93