Commits · beb848e2b6cc888bd5039e6f6cac7c932c6c3225 · renzhc / diffusers_dcu

17 Apr, 2023 1 commit
- [Bug fix] Fix img2img processor with safety checker (#3127) · beb848e2
  Patrick von Platen authored Apr 17, 2023
```
Fix img2img processor with safety checker
```
  beb848e2
16 Apr, 2023 1 commit
- Fix breaking change in `pipeline_stable_diffusion_controlnet.py` (#3118) · 807f69b3
  Tommaso De Rossi authored Apr 16, 2023
```
fix breaking change
```
  807f69b3
14 Apr, 2023 2 commits

fix default value for attend-and-excite (#3099) · eb2ef316
YiYi Xu authored Apr 13, 2023
```
* fix default
```
eb2ef316

Add to support Guess Mode for StableDiffusionControlnetPipleline (#2998) · 5c9dd0af

Takuma Mori authored Apr 14, 2023

* add guess mode (WIP)

* fix uncond/cond order

* support guidance_scale=1.0 and batch != 1

* remove magic coeff

* add docstring

* add intergration test

* add document to controlnet.mdx

* made the comments a bit more explanatory

* fix table

5c9dd0af

13 Apr, 2023 2 commits
- Allow SD attend and excite pipeline to work with any size output images (#2835) · 3eaead0c
  Joseph Coffland authored Apr 13, 2023
```
Allow stable diffusion attend and excite pipeline to work with any size output image. Re: #2476, #2603
```
  3eaead0c
- doc string example remove from_pt (#3083) · e748b3c6
  YiYi Xu authored Apr 12, 2023
  
  e748b3c6
12 Apr, 2023 4 commits

Fix a bug of pano when not doing CFG (#3030) · a4393437

Ernie Chu authored Apr 12, 2023



* Fix a bug of pano when not doing CFG

* enhance code quality

* apply formatting.

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

a4393437

add support for pre-calculated prompt embeds to Stable Diffusion ONNX pipelines (#2597) · 7b2407f4

Sean Sube authored Apr 12, 2023

* add support for prompt embeds to SD ONNX pipeline

* fix up the pipeline copies

* add prompt embeds param to other ONNX pipelines

* fix up prompt embeds param for SD upscaling ONNX pipeline

* add missing type annotations to ONNX pipes

7b2407f4

[Docs] update Self-Attention Guidance docs (#2952) · 0df47efe
Susung Hong authored Apr 12, 2023
```
* Update index.mdx

* Edit docs & add HF space link

* Only change equation numbers in comments
```
0df47efe

[LoRA] Enabling limited LoRA support for text encoder (#2918) · a89a14fa

Sayak Paul authored Apr 12, 2023

* add: first draft for a better LoRA enabler.

* make fix-copies.

* feat: backward compatibility.

* add: entry to the docs.

* add: tests.

* fix: docs.

* fix: norm group test for UNet3D.

* feat: add support for flat dicts.

* add depcrcation message instead of warning.

a89a14fa

11 Apr, 2023 4 commits

Attention processor cross attention norm group norm (#3021) · 98c5e5da

Will Berman authored Apr 11, 2023

add group norm type to attention processor cross attention norm

This lets the cross attention norm use both a group norm block and a
layer norm block.

The group norm operates along the channels dimension
and requires input shape (batch size, channels, *) where as the layer norm with a single
`normalized_shape` dimension only operates over the least significant
dimension i.e. (*, channels).

The channels we want to normalize are the hidden dimension of the encoder hidden states.

By convention, the encoder hidden states are always passed as (batch size, sequence
length, hidden states).

This means the layer norm can operate on the tensor without modification, but the group
norm requires flipping the last two dimensions to operate on (batch size, hidden states, sequence length).

All existing attention processors will have the same logic and we can
consolidate it in a helper function `prepare_encoder_hidden_states`

prepare_encoder_hidden_states -> norm_encoder_hidden_states re: @patrickvonplaten

move norm_cross defined check to outside norm_encoder_hidden_states

add missing attn.norm_cross check

98c5e5da

Fix scheduler type mismatch (#3041) · 526827c3
Pedro Cuenca authored Apr 11, 2023
```
When doing generation manually and using guidance_scale as a static
argument.
```
526827c3
config fixes (#3060) · 80bc0c0c
Will Berman authored Apr 11, 2023

80bc0c0c

Fix config prints and save, load of pipelines (#2849) · 8b451eb6

Patrick von Platen authored Apr 11, 2023

* [Config] Fix config prints and save, load

* Only use potential nn.Modules for dtype and device

* Correct vae image processor

* make sure in_channels is not accessed directly

* make sure in channels is only accessed via config

* Make sure schedulers only access config attributes

* Make sure to access config in SAG

* Fix vae processor and make style

* add tests

* uP

* make style

* Fix more naming issues

* Final fix with vae config

* change more

8b451eb6

06 Apr, 2023 1 commit

Update the K-Diffusion SD pipeline, to allow calling it with only... · 8826bae6

cmdr2 authored Apr 06, 2023

Update the K-Diffusion SD pipeline, to allow calling it with only prompt_embeds (instead of always requiring a prompt) (#2962)

8826bae6

04 Apr, 2023 1 commit
- fix post-processing (#2968) · 1a6def3d
  YiYi Xu authored Apr 04, 2023
```
Co-authored-by: yiyixuxu <yixu310@gmail,com>
```
  1a6def3d
31 Mar, 2023 4 commits

Update pipeline_stable_diffusion_controlnet.py (#2917) · 7447f75b
Patrick von Platen authored Mar 31, 2023

7447f75b

[2884]: Fix cross_attention_kwargs in StableDiffusionImg2ImgPipeline (#2902) · b3c437e0

Nipun Jindal authored Mar 31, 2023



* [2884]: Fix cross_attention_kwargs in StableDiffusionImg2ImgPipeline

* [Build Fix]

* [Build Fix]

---------
Co-authored-by: njindal <njindal@adobe.com>

b3c437e0

Fix check_inputs in upscaler pipeline to allow embeds (#2892) · f3fbf9bf

Sandeep authored Mar 31, 2023

* Remove suggestion to use cuDNN benchmark in docs

* removing the wrong line

* add support for embeds

* fix line length

f3fbf9bf

Add support `Karras sigmas` for StableDiffusionKDiffusionPipeline (#2874) · 0df4ad54

Takuma Mori authored Mar 31, 2023

* add use_karras_sigmas option

thanks @Stax124

* fix sigma_min/max from scheduler.sigmas

* add docstring

* revert to use k_diffusion_model.sigma, to(device)

* add integration test

* make style

0df4ad54

30 Mar, 2023 1 commit

add load textual inversion embeddings to stable diffusion (#2009) · a937e1b5

Pi Esposito authored Mar 30, 2023



* add load textual inversion embeddings draft

* fix quality

* fix typo

* make fix copies

* move to textual inversion mixin

* make it accept from sd-concept library

* accept list of paths to embeddings

* fix styling of stable diffusion pipeline

* add dummy TextualInversionMixin

* add docstring to textualinversionmixin

* add load textual inversion embeddings draft

* fix quality

* fix typo

* make fix copies

* move to textual inversion mixin

* make it accept from sd-concept library

* accept list of paths to embeddings

* fix styling of stable diffusion pipeline

* add dummy TextualInversionMixin

* add docstring to textualinversionmixin

* add case for parsing embedding from auto1111 UI format
Co-authored-by: Evan Jones <evan.a.jones3@gmail.com>
Co-authored-by: Ana Tamais <aninhamoraestamais@gmail.com>

* fix style after rebase

* move textual inversion mixin to loaders

* move mixin inheritance to DiffusionPipeline from StableDiffusionPipeline)

* update dummy class name

* addressed allo comments

* fix old dangling import

* fix style

* proposal

* remove bogus

* Apply suggestions from code review
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Will Berman <wlbberman@gmail.com>

* finish

* make style

* up

* fix code quality

* fix code quality - again

* fix code quality - 3

* fix alt diffusion code quality

* fix model editing pipeline

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Finish

---------
Co-authored-by: Evan Jones <evan.a.jones3@gmail.com>
Co-authored-by: Ana Tamais <aninhamoraestamais@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Will Berman <wlbberman@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

a937e1b5

28 Mar, 2023 5 commits

[WIP] Check UNet shapes in StableDiffusionInpaintPipeline __init__ (#2853) · 4d0f412d
dg845 authored Mar 28, 2023
```
Add warning in __init__ if user loads a checkpoint with pipeline.unet.config.in_channels other than 9.
```
4d0f412d

Update the legacy inpainting SD pipeline, to allow calling it with only... · 7d756813

cmdr2 authored Mar 28, 2023

Update the legacy inpainting SD pipeline, to allow calling it with only prompt_embeds (instead of always requiring a prompt) (#2842)

Fix error 'required positional argument: prompt' when Legacy Inpaint is called only with prompt_embeds

7d756813

Remove duplicate sentence in docstrings (#2834) · 159a0bff
Li-Huai (Allan) Lin authored Mar 28, 2023
```
* Remove duplicate sentence

* format
```
159a0bff

[Stable Diffusion] Allow users to disable Safety checker if loading model from checkpoint (#2768) · 585f621a

Stax124 authored Mar 28, 2023



* Allow user to disable SafetyChecker and enable dtypes if loading models from .ckpt or .safetensors

* Fix Import sorting (Ruff error)

* Get rid of the dtype convert method as it was implemented all along

* Fix the docstring

* Fix ruff formatting

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

585f621a

add: better warning messages when handling multiple conditionings. (#2804) · 58fc8244
Sayak Paul authored Mar 28, 2023
```
* add: better warning messages when handling multiple conditioning.

* fix: handling of controlnet_conditioning_scale
```
58fc8244

27 Mar, 2023 2 commits

Fix StableUnCLIPImg2ImgPipeline handling of explicitly passed image embeddings (#2845) · 7bc2fff1
Eugene Lyapustin authored Mar 27, 2023

7bc2fff1

Ruff: apply same rules as in transformers (#2827) · 1d7b4b60

Pedro Cuenca authored Mar 27, 2023

* Apply same ruff settings as in transformers

See https://github.com/huggingface/transformers/blob/main/pyproject.toml

Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>

* Apply new style rules

* Style
Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>

* style

* remove list, ruff wouldn't auto fix.

---------
Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>

1d7b4b60

24 Mar, 2023 3 commits

StableDiffusionModelEditingPipeline documentation (#2810) · 9fb02175
Bahjat Kawar authored Mar 24, 2023
```
* comment update

* comment update
```
9fb02175
[Stable UnCLIP] Finish Stable UnCLIP (#2814) · dbcb15c2
Patrick von Platen authored Mar 24, 2023
```
* up

* fix more 7

* up

* finish
```
dbcb15c2

Add ModelEditing pipeline (#2721) · 37a44bb2

Bahjat Kawar authored Mar 24, 2023



* TIME first commit

* styling.

* styling 2.

* fixes; tests

* apply styling and doc fix.

* remove sups.

* fixes

* remove temp file

* move augmentations to const

* added doc entry

* code quality

* customize augmentations

* quality

* quality

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

37a44bb2

23 Mar, 2023 2 commits

Flax controlnet (#2727) · df91c447

YiYi Xu authored Mar 23, 2023



* add contronet flax

---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>

df91c447

Rename 'CLIPFeatureExtractor' class to 'CLIPImageProcessor' (#2732) · 14e3a28c

Naoki Ainoya authored Mar 23, 2023

The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.

14e3a28c

22 Mar, 2023 1 commit

[MS Text To Video] Add first text to video (#2738) · ca1a2229

Patrick von Platen authored Mar 22, 2023



* [MS Text To Video} Add first text to video

* upload

* make first model example

* match unet3d params

* make sure weights are correcctly converted

* improve

* forward pass works, but diff result

* make forward work

* fix more

* finish

* refactor video output class.

* feat: add support for a video export utility.

* fix: opencv availability check.

* run make fix-copies.

* add: docs for the model components.

* add: standalone pipeline doc.

* edit docstring of the pipeline.

* add: right path to TransformerTempModel

* add: first set of tests.

* complete fast tests for text to video.

* fix bug

* up

* three fast tests failing.

* add: note on slow tests

* make work with all schedulers

* apply styling.

* add slow tests

* change file name

* update

* more correction

* more fixes

* finish

* up

* Apply suggestions from code review

* up

* finish

* make copies

* fix pipeline tests

* fix more tests

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* apply suggestions

* up

* revert

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

ca1a2229

21 Mar, 2023 1 commit
- stable diffusion depth batching fix (#2757) · ca1e4072
  Will Berman authored Mar 21, 2023
  
  ca1e4072
15 Mar, 2023 3 commits

Rename attention (#2691) · e8282327

Patrick von Platen authored Mar 16, 2023

* rename file

* rename attention

* fix more

* rename more

* up

* more deprecation imports

* fixes

e8282327

Add image_processor (#2617) · e52cd556

YiYi Xu authored Mar 15, 2023



* add image_processor

---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

e52cd556

Controlnet training (#2545) · 79eb3d07

Henrik Forstén authored Mar 15, 2023

* Controlnet training code initial commit

Works with circle dataset: https://github.com/lllyasviel/ControlNet/blob/main/docs/train.md



* Script for adding a controlnet to existing model

* Fix control image transform

Control image should be in 0..1 range.

* Add license header and remove more unused configs

* controlnet training readme

* Allow nonlocal model in add_controlnet.py

* Formatting

* Remove unused code

* Code quality

* Initialize controlnet in training script

* Formatting

* Address review comments

* doc style

* explicit constructor args and submodule names

* hub dataset

NOTE -  not tested

* empty prompts

* add conditioning image

* rename

* remove instance data dir

* image_transforms -> -1,1 . conditioning_image_transformers -> 0, 1

* nits

* remove local rank config

I think this isn't necessary in any of our training scripts

* validation images

* proportion_empty_prompts typo

* weight copying to controlnet bug

* call log validation fix

* fix

* gitignore wandb

* fix progress bar and resume from checkpoint iteration

* initial step fix

* log multiple images

* fix

* fixes

* tracker project name configurable

* misc

* add controlnet requirements.txt

* update docs

* image labels

* small fixes

* log validation using existing models for pipeline

* fix for deepspeed saving

* memory usage docs

* Update examples/controlnet/train_controlnet.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/controlnet/train_controlnet.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/controlnet/README.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/controlnet/README.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/controlnet/README.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/controlnet/README.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/controlnet/README.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/controlnet/README.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/controlnet/README.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/controlnet/README.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* remove extra is main process check

* link to dataset in intro paragraph

* remove unnecessary paragraph

* note on deepspeed

* Update examples/controlnet/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* assert -> value error

* weights and biases note

* move images out of git

* remove .gitignore

---------
Co-authored-by: William Berman <WLBberman@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

79eb3d07

14 Mar, 2023 1 commit

Add support for different model prediction types in DDIMInverseScheduler (#2619) · ee71d9d0

clarencechen authored Mar 14, 2023



* Add support for different model prediction types in DDIMInverseScheduler
Resolve alpha_prod_t_prev index issue for final step of inversion

* Fix old bug introduced when prediction type is "sample"

* Add support for sample clipping for numerical stability and deprecate old kwarg

* Detach sample, alphas, betas

Derive predicted noise from model output before dist. regularization

Style cleanup

* Log loss for debugging

* Revert "Log loss for debugging"

This reverts commit 76ea9c856f99f4c8eca45a0b1801593bb982584b.

* Add comments

* Add inversion equivalence test

* Add expected data for Pix2PixZero pipeline tests with SD 2

* Update tests/pipelines/stable_diffusion/test_stable_diffusion_pix2pix_zero.py

* Remove cruft and add more explanatory comments

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

ee71d9d0

13 Mar, 2023 1 commit

Add support for Multi-ControlNet to StableDiffusionControlNetPipeline (#2627) · d9b8adc4

Takuma Mori authored Mar 14, 2023



* support for List[ControlNetModel] on init()

* Add to support for multiple ControlNetCondition

* rename conditioning_scale to scale

* scaling bugfix

* Manually merge `MultiControlNet` #2621
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_controlnet.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_controlnet.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_controlnet.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_controlnet.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_controlnet.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_controlnet.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* cleanups
- don't expose ControlNetCondition
- move scaling to ControlNetModel

* make style error correct

* remove ControlNetCondition to reduce code diff

* refactoring image/cond_scale

* add explain for `images`

* Add docstrings

* all fast-test passed

* Add a slow test

* nit

* Apply suggestions from code review

* small precision fix

* nits

MultiControlNet -> MultiControlNetModel - Matches existing naming a bit
closer

MultiControlNetModel inherit from model utils class - Don't have to
re-write fp16 test

Skip tests that save multi controlnet pipeline - Clearer than changing
test body

Don't auto-batch the number of input images to the number of controlnets.
We generally like to require the user to pass the expected number of
inputs. This simplifies the processing code a bit more

Use existing image pre-processing code a bit more. We can rely on the
existing image pre-processing code and keep the inference loop a bit
simpler.

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: William Berman <WLBberman@gmail.com>

d9b8adc4