Commits · 3980858ad40d46d0d0b52b09d9667344b91ab783 · renzhc / diffusers_dcu

28 Mar, 2023 20 commits

Update overview.mdx (#2864) · 3980858a
M. Tolga Cangöz authored Mar 28, 2023
```
Fix typos
```
3980858a
Update evaluation.mdx (#2862) · 37c82480
M. Tolga Cangöz authored Mar 28, 2023
```
Fix typos
```
37c82480

[Tests] Adds a test to check if `image_embeds` None case is handled properly... · 13845462

Sayak Paul authored Mar 28, 2023

[Tests] Adds a test to check if `image_embeds` None case is handled properly in `StableUnCLIPImg2ImgPipeline` (#2861)

* improve stable unclip doc.

* add: test to check if image_emebds None case is handled.

* apply formatting/

13845462

[2761]: Add documentation for extra_in_channels UNet1DModel (#2817) · 53377ef8
Nipun Jindal authored Mar 28, 2023
```
Co-authored-by: njindal <njindal@adobe.com>
```
53377ef8
[WIP] Check UNet shapes in StableDiffusionInpaintPipeline __init__ (#2853) · 4d0f412d
dg845 authored Mar 28, 2023
```
Add warning in __init__ if user loads a checkpoint with pipeline.unet.config.in_channels other than 9.
```
4d0f412d

Add `last_epoch` argument to `optimization.get_scheduler` (#2850) · 25d927aa

Felix Blanke authored Mar 28, 2023

Add last_epoch arg to optimization.get_scheduler.

Allows the specification of the index of the last epoch when
resuming training.

25d927aa

[WIP][Docs] Use DiffusionPipeline Instead of Child Classes when Loading Pipeline (#2809) · 663c6545

dg845 authored Mar 28, 2023



* Change the docs to use the parent DiffusionPipeline class when loading a checkpoint using from_pretrained() instead of a child class (e.g. StableDiffusionPipeline) where possible.

* Run make style to fix style issues.

* Change more docs to use DiffusionPipeline rather than a subclass.

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

663c6545

Fix link to LoRA training guide in DreamBooth training guide (#2836) · 920a15cf
John HU authored Mar 28, 2023
```
Fix link to LoRA training guide
```
920a15cf

Update the legacy inpainting SD pipeline, to allow calling it with only... · 7d756813

cmdr2 authored Mar 28, 2023

Update the legacy inpainting SD pipeline, to allow calling it with only prompt_embeds (instead of always requiring a prompt) (#2842)

Fix error 'required positional argument: prompt' when Legacy Inpaint is called only with prompt_embeds

7d756813

Remove duplicate sentence in docstrings (#2834) · 159a0bff
Li-Huai (Allan) Lin authored Mar 28, 2023
```
* Remove duplicate sentence

* format
```
159a0bff
Remove suggestion to use cuDNN benchmark in docs (#2793) · b76d9fde
Sandeep authored Mar 28, 2023
```
* Remove suggestion to use cuDNN benchmark in docs

* removing the wrong line
```
b76d9fde
StableDiffusionLongPromptWeightingPipeline: Do not hardcode pad token (#2832) · 0f14335a
Aki Sakurai authored Mar 28, 2023

0f14335a
fix KarrasVePipeline bug (#2828) · 8bdf4236
junhsss authored Mar 28, 2023

8bdf4236

[Stable Diffusion] Allow users to disable Safety checker if loading model from checkpoint (#2768) · 585f621a

Stax124 authored Mar 28, 2023



* Allow user to disable SafetyChecker and enable dtypes if loading models from .ckpt or .safetensors

* Fix Import sorting (Ruff error)

* Get rid of the dtype convert method as it was implemented all along

* Fix the docstring

* Fix ruff formatting

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

585f621a

updated onnx pndm test (#2811) · c0afca2d
Kashif Rasul authored Mar 28, 2023

c0afca2d
[Init] Make sure shape mismatches are caught early (#2847) · 42d95017
Patrick von Platen authored Mar 28, 2023
```
Improve init
```
42d95017

Make dynamo wrapped modules work with save_pretrained (#2726) · 81125d84

Pedro Cuenca authored Mar 28, 2023



* Workaround for saving dynamo-wrapped models.

* Accept suggestion from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Apply workaround when overriding pipeline components.

* Ensure the correct config.json is saved to disk.

Instead of the dynamo class.

* Save correct module (not compiled one)

* Add test

* style

* fix docstrings

* Go back to using string comparisons.

PyTorch CPU does not have _dynamo.

* Simple test for save_pretrained of compiled models.

* Helper function to test whether module is compiled.

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

81125d84

[WIP]Flax training script for controlnet (#2818) · d4f846fa

YiYi Xu authored Mar 27, 2023



* add train_controlnet_flax

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d4f846fa

add: better warning messages when handling multiple conditionings. (#2804) · 58fc8244
Sayak Paul authored Mar 28, 2023
```
* add: better warning messages when handling multiple conditioning.

* fix: handling of controlnet_conditioning_scale
```
58fc8244
improve stable unclip doc. (#2823) · fab4f3d6
Sayak Paul authored Mar 28, 2023

fab4f3d6

27 Mar, 2023 5 commits
- Helper function to disable custom attention processors (#2791) · b10f5275
  Pedro Cuenca authored Mar 27, 2023
```
* Helper function to disable custom attention processors.

* Restore code deleted by mistake.

* Format

* Fix modeling_text_unet copy.
```
  b10f5275
- Fix StableUnCLIPImg2ImgPipeline handling of explicitly passed image embeddings (#2845) · 7bc2fff1
  Eugene Lyapustin authored Mar 27, 2023
  
  7bc2fff1
- [Tests] Fix slow tests (#2846) · 4c26cb9c
  Patrick von Platen authored Mar 27, 2023
  
  4c26cb9c
- Ruff: apply same rules as in transformers (#2827) · 1d7b4b60
  Pedro Cuenca authored Mar 27, 2023
```
* Apply same ruff settings as in transformers

See https://github.com/huggingface/transformers/blob/main/pyproject.toml

Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>

* Apply new style rules

* Style
Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>

* style

* remove list, ruff wouldn't auto fix.

---------
Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>
```
  1d7b4b60
- Update `examples` README.md to include the latest examples (#2839) · abb22b4e
  Sayak Paul authored Mar 27, 2023
  
  abb22b4e
24 Mar, 2023 7 commits

StableDiffusionModelEditingPipeline documentation (#2810) · 9fb02175
Bahjat Kawar authored Mar 24, 2023
```
* comment update

* comment update
```
9fb02175

[Docs] update docs (Stable unCLIP) to reflect the updated ckpts. (#2815) · 5883d8d4

Sayak Paul authored Mar 24, 2023



* update docs to reflect the updated ckpts.

* update: point about prompt.

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* emove image resizing.

* Apply suggestions from code review

* Apply suggestions from code review

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5883d8d4

[Stable UnCLIP] Finish Stable UnCLIP (#2814) · dbcb15c2
Patrick von Platen authored Mar 24, 2023
```
* up

* fix more 7

* up

* finish
```
dbcb15c2

Update onnxruntime package candidates (#2666) · c4892f18

PeixuanZuo authored Mar 24, 2023

* update import onnxruntime package, enable onnxruntime-rocm and onnxruntime-training

* add ort_nightly_gpu

c4892f18

Relax DiT test (#2808) · f6feb699

Kashif Rasul authored Mar 24, 2023

* Relax DiT test

* relax 2 more tests

* fix style

* skip test on mac due to older protobuf

f6feb699

Add ModelEditing pipeline (#2721) · 37a44bb2

Bahjat Kawar authored Mar 24, 2023



* TIME first commit

* styling.

* styling 2.

* fixes; tests

* apply styling and doc fix.

* remove sups.

* fixes

* remove temp file

* move augmentations to const

* added doc entry

* code quality

* customize augmentations

* quality

* quality

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

37a44bb2

Update train_text_to_image_lora.py (#2795) · 4a98d6e0
Haofan Wang authored Mar 24, 2023

4a98d6e0

23 Mar, 2023 8 commits

Add AudioLDM (#2232) · b94880e5

Sanchit Gandhi authored Mar 23, 2023



* Add AudioLDM

* up

* add vocoder

* start unet

* unconditional unet

* clap, vocoder and vae

* clean-up: conversion scripts

* fix: conversion script token_type_ids

* clean-up: pipeline docstring

* tests: from SD

* clean-up: cpu offload vocoder instead of safety checker

* feat: adapt tests to audioldm

* feat: add docs

* clean-up: amend pipeline docstrings

* clean-up: make style

* clean-up: make fix-copies

* fix: add doc path to toctree

* clean-up: args for conversion script

* clean-up: paths to checkpoints

* fix: use conditional unet

* clean-up: make style

* fix: type hints for UNet

* clean-up: docstring for UNet

* clean-up: make style

* clean-up: remove duplicate in docstring

* clean-up: make style

* clean-up: make fix-copies

* clean-up: move imports to start in code snippet

* fix: pass cross_attention_dim as a list/tuple to unet

* clean-up: make fix-copies

* fix: update checkpoint path

* fix: unet cross_attention_dim in tests

* film embeddings -> class embeddings

* Apply suggestions from code review
Co-authored-by: Will Berman <wlbberman@gmail.com>

* fix: unet film embed to use existing args

* fix: unet tests to use existing args

* fix: make style

* fix: transformers import and version in init

* clean-up: make style

* Revert "clean-up: make style"

This reverts commit 5d6d1f8b324f5583e7805dc01e2c86e493660d66.

* clean-up: make style

* clean-up: use pipeline tester mixin tests where poss

* clean-up: skip attn slicing test

* fix: add torch dtype to docs

* fix: remove conversion script out of src

* fix: remove .detach from 1d waveform

* fix: reduce default num inf steps

* fix: swap height/width -> audio_length_in_s

* clean-up: make style

* fix: remove nightly tests

* fix: imports in conversion script

* clean-up: slim-down to two slow tests

* clean-up: slim-down fast tests

* fix: batch consistent tests

* clean-up: make style

* clean-up: remove vae slicing fast test

* clean-up: propagate changes to doc

* fix: increase test tol to 1e-2

* clean-up: finish docs

* clean-up: make style

* feat: vocoder / VAE compatibility check

* feat: possibly expand / cut audio waveform

* fix: pipeline call signature test

* fix: slow tests output len

* clean-up: make style

* make style

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: William Berman <WLBberman@gmail.com>

b94880e5

[docs] Add Colab notebooks and Spaces (#2713) · 1870fb05
Steven Liu authored Mar 23, 2023
```
* add colab notebook and spaces

* fix image link
```
1870fb05

Flax controlnet (#2727) · df91c447

YiYi Xu authored Mar 23, 2023



* add contronet flax

---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>

df91c447

Skip `mps` in text-to-video tests (#2792) · aa0531fa
Pedro Cuenca authored Mar 23, 2023
```
* Skip mps in text-to-video tests.

* style

* Skip UNet3D mps tests.
```
aa0531fa

Update train_text_to_image_lora.py (#2767) · dc5b4e23

Haofan Wang authored Mar 23, 2023

* Update train_text_to_image_lora.py

* Update train_text_to_image_lora.py

* Update train_text_to_image_lora.py

* Update train_text_to_image_lora.py

* format

dc5b4e23

[Docs] small fixes to the text to video doc. (#2787) · 0d7aac3e
Sayak Paul authored Mar 23, 2023
```
* small fixes to the text to video doc.

* add: Spaces link.

* add: warning on research-only model.
```
0d7aac3e

[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipeline (#2779) · 055c90f5

Nipun Jindal authored Mar 23, 2023



[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipelines
Co-authored-by: njindal <njindal@adobe.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

055c90f5

Music Spectrogram diffusion pipeline (#1044) · 2ef9bdd7

Kashif Rasul authored Mar 23, 2023

* initial TokenEncoder and ContinuousEncoder

* initial modules

* added ContinuousContextTransformer

* fix copy paste error

* use numpy for get_sequence_length

* initial terminal relative positional encodings

* fix weights keys

* fix assert

* cross attend style: concat encodings

* make style

* concat once

* fix formatting

* Initial SpectrogramPipeline

* fix input_tokens

* make style

* added mel output

* ignore weights for config

* move mel to numpy

* import pipeline

* fix class names and import

* moved models to models folder

* import ContinuousContextTransformer and SpectrogramDiffusionPipeline

* initial spec diffusion converstion script

* renamed config to t5config

* added weight loading

* use arguments instead of t5config

* broadcast noise time to batch dim

* fix call

* added scale_to_features

* fix weights

* transpose laynorm weight

* scale is a vector

* scale the...

2ef9bdd7