Commits · b76d9fde8de381a50d64c401b5d12864a28c5556 · renzhc / diffusers_dcu

28 Mar, 2023 10 commits

Remove suggestion to use cuDNN benchmark in docs (#2793) · b76d9fde
Sandeep authored Mar 28, 2023
```
* Remove suggestion to use cuDNN benchmark in docs

* removing the wrong line
```
b76d9fde
StableDiffusionLongPromptWeightingPipeline: Do not hardcode pad token (#2832) · 0f14335a
Aki Sakurai authored Mar 28, 2023

0f14335a
fix KarrasVePipeline bug (#2828) · 8bdf4236
junhsss authored Mar 28, 2023

8bdf4236

[Stable Diffusion] Allow users to disable Safety checker if loading model from checkpoint (#2768) · 585f621a

Stax124 authored Mar 28, 2023



* Allow user to disable SafetyChecker and enable dtypes if loading models from .ckpt or .safetensors

* Fix Import sorting (Ruff error)

* Get rid of the dtype convert method as it was implemented all along

* Fix the docstring

* Fix ruff formatting

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

585f621a

updated onnx pndm test (#2811) · c0afca2d
Kashif Rasul authored Mar 28, 2023

c0afca2d
[Init] Make sure shape mismatches are caught early (#2847) · 42d95017
Patrick von Platen authored Mar 28, 2023
```
Improve init
```
42d95017

Make dynamo wrapped modules work with save_pretrained (#2726) · 81125d84

Pedro Cuenca authored Mar 28, 2023



* Workaround for saving dynamo-wrapped models.

* Accept suggestion from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Apply workaround when overriding pipeline components.

* Ensure the correct config.json is saved to disk.

Instead of the dynamo class.

* Save correct module (not compiled one)

* Add test

* style

* fix docstrings

* Go back to using string comparisons.

PyTorch CPU does not have _dynamo.

* Simple test for save_pretrained of compiled models.

* Helper function to test whether module is compiled.

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

81125d84

[WIP]Flax training script for controlnet (#2818) · d4f846fa

YiYi Xu authored Mar 27, 2023



* add train_controlnet_flax

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

d4f846fa

add: better warning messages when handling multiple conditionings. (#2804) · 58fc8244
Sayak Paul authored Mar 28, 2023
```
* add: better warning messages when handling multiple conditioning.

* fix: handling of controlnet_conditioning_scale
```
58fc8244
improve stable unclip doc. (#2823) · fab4f3d6
Sayak Paul authored Mar 28, 2023

fab4f3d6

27 Mar, 2023 5 commits
- Helper function to disable custom attention processors (#2791) · b10f5275
  Pedro Cuenca authored Mar 27, 2023
```
* Helper function to disable custom attention processors.

* Restore code deleted by mistake.

* Format

* Fix modeling_text_unet copy.
```
  b10f5275
- Fix StableUnCLIPImg2ImgPipeline handling of explicitly passed image embeddings (#2845) · 7bc2fff1
  Eugene Lyapustin authored Mar 27, 2023
  
  7bc2fff1
- [Tests] Fix slow tests (#2846) · 4c26cb9c
  Patrick von Platen authored Mar 27, 2023
  
  4c26cb9c
- Ruff: apply same rules as in transformers (#2827) · 1d7b4b60
  Pedro Cuenca authored Mar 27, 2023
```
* Apply same ruff settings as in transformers

See https://github.com/huggingface/transformers/blob/main/pyproject.toml

Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>

* Apply new style rules

* Style
Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>

* style

* remove list, ruff wouldn't auto fix.

---------
Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>
```
  1d7b4b60
- Update `examples` README.md to include the latest examples (#2839) · abb22b4e
  Sayak Paul authored Mar 27, 2023
  
  abb22b4e
24 Mar, 2023 7 commits

StableDiffusionModelEditingPipeline documentation (#2810) · 9fb02175
Bahjat Kawar authored Mar 24, 2023
```
* comment update

* comment update
```
9fb02175

[Docs] update docs (Stable unCLIP) to reflect the updated ckpts. (#2815) · 5883d8d4

Sayak Paul authored Mar 24, 2023



* update docs to reflect the updated ckpts.

* update: point about prompt.

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* emove image resizing.

* Apply suggestions from code review

* Apply suggestions from code review

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5883d8d4

[Stable UnCLIP] Finish Stable UnCLIP (#2814) · dbcb15c2
Patrick von Platen authored Mar 24, 2023
```
* up

* fix more 7

* up

* finish
```
dbcb15c2

Update onnxruntime package candidates (#2666) · c4892f18

PeixuanZuo authored Mar 24, 2023

* update import onnxruntime package, enable onnxruntime-rocm and onnxruntime-training

* add ort_nightly_gpu

c4892f18

Relax DiT test (#2808) · f6feb699

Kashif Rasul authored Mar 24, 2023

* Relax DiT test

* relax 2 more tests

* fix style

* skip test on mac due to older protobuf

f6feb699

Add ModelEditing pipeline (#2721) · 37a44bb2

Bahjat Kawar authored Mar 24, 2023



* TIME first commit

* styling.

* styling 2.

* fixes; tests

* apply styling and doc fix.

* remove sups.

* fixes

* remove temp file

* move augmentations to const

* added doc entry

* code quality

* customize augmentations

* quality

* quality

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

37a44bb2

Update train_text_to_image_lora.py (#2795) · 4a98d6e0
Haofan Wang authored Mar 24, 2023

4a98d6e0

23 Mar, 2023 15 commits

Add AudioLDM (#2232) · b94880e5

Sanchit Gandhi authored Mar 23, 2023



* Add AudioLDM

* up

* add vocoder

* start unet

* unconditional unet

* clap, vocoder and vae

* clean-up: conversion scripts

* fix: conversion script token_type_ids

* clean-up: pipeline docstring

* tests: from SD

* clean-up: cpu offload vocoder instead of safety checker

* feat: adapt tests to audioldm

* feat: add docs

* clean-up: amend pipeline docstrings

* clean-up: make style

* clean-up: make fix-copies

* fix: add doc path to toctree

* clean-up: args for conversion script

* clean-up: paths to checkpoints

* fix: use conditional unet

* clean-up: make style

* fix: type hints for UNet

* clean-up: docstring for UNet

* clean-up: make style

* clean-up: remove duplicate in docstring

* clean-up: make style

* clean-up: make fix-copies

* clean-up: move imports to start in code snippet

* fix: pass cross_attention_dim as a list/tuple to unet

* clean-up: make fix-copies

* fix: update checkpoint path

* fix: unet cross_attention_dim in tests

* film embeddings -> class embeddings

* Apply suggestions from code review
Co-authored-by: Will Berman <wlbberman@gmail.com>

* fix: unet film embed to use existing args

* fix: unet tests to use existing args

* fix: make style

* fix: transformers import and version in init

* clean-up: make style

* Revert "clean-up: make style"

This reverts commit 5d6d1f8b324f5583e7805dc01e2c86e493660d66.

* clean-up: make style

* clean-up: use pipeline tester mixin tests where poss

* clean-up: skip attn slicing test

* fix: add torch dtype to docs

* fix: remove conversion script out of src

* fix: remove .detach from 1d waveform

* fix: reduce default num inf steps

* fix: swap height/width -> audio_length_in_s

* clean-up: make style

* fix: remove nightly tests

* fix: imports in conversion script

* clean-up: slim-down to two slow tests

* clean-up: slim-down fast tests

* fix: batch consistent tests

* clean-up: make style

* clean-up: remove vae slicing fast test

* clean-up: propagate changes to doc

* fix: increase test tol to 1e-2

* clean-up: finish docs

* clean-up: make style

* feat: vocoder / VAE compatibility check

* feat: possibly expand / cut audio waveform

* fix: pipeline call signature test

* fix: slow tests output len

* clean-up: make style

* make style

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: William Berman <WLBberman@gmail.com>

b94880e5

[docs] Add Colab notebooks and Spaces (#2713) · 1870fb05
Steven Liu authored Mar 23, 2023
```
* add colab notebook and spaces

* fix image link
```
1870fb05

Flax controlnet (#2727) · df91c447

YiYi Xu authored Mar 23, 2023



* add contronet flax

---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>

df91c447

Skip `mps` in text-to-video tests (#2792) · aa0531fa
Pedro Cuenca authored Mar 23, 2023
```
* Skip mps in text-to-video tests.

* style

* Skip UNet3D mps tests.
```
aa0531fa

Update train_text_to_image_lora.py (#2767) · dc5b4e23

Haofan Wang authored Mar 23, 2023

* Update train_text_to_image_lora.py

* Update train_text_to_image_lora.py

* Update train_text_to_image_lora.py

* Update train_text_to_image_lora.py

* format

dc5b4e23

[Docs] small fixes to the text to video doc. (#2787) · 0d7aac3e
Sayak Paul authored Mar 23, 2023
```
* small fixes to the text to video doc.

* add: Spaces link.

* add: warning on research-only model.
```
0d7aac3e

[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipeline (#2779) · 055c90f5

Nipun Jindal authored Mar 23, 2023



[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipelines
Co-authored-by: njindal <njindal@adobe.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

055c90f5

Music Spectrogram diffusion pipeline (#1044) · 2ef9bdd7

Kashif Rasul authored Mar 23, 2023

* initial TokenEncoder and ContinuousEncoder

* initial modules

* added ContinuousContextTransformer

* fix copy paste error

* use numpy for get_sequence_length

* initial terminal relative positional encodings

* fix weights keys

* fix assert

* cross attend style: concat encodings

* make style

* concat once

* fix formatting

* Initial SpectrogramPipeline

* fix input_tokens

* make style

* added mel output

* ignore weights for config

* move mel to numpy

* import pipeline

* fix class names and import

* moved models to models folder

* import ContinuousContextTransformer and SpectrogramDiffusionPipeline

* initial spec diffusion converstion script

* renamed config to t5config

* added weight loading

* use arguments instead of t5config

* broadcast noise time to batch dim

* fix call

* added scale_to_features

* fix weights

* transpose laynorm weight

* scale is a vector

* scale the...

2ef9bdd7

Rename 'CLIPFeatureExtractor' class to 'CLIPImageProcessor' (#2732) · 14e3a28c

Naoki Ainoya authored Mar 23, 2023

The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.

14e3a28c

[doc wip] literalinclude (#2718) · 8e35ef01
Mishig authored Mar 23, 2023

8e35ef01
[UNet3DModel] Fix with attn processor (#2790) · a8315ce1
Patrick von Platen authored Mar 23, 2023
```
* [UNet3DModel] Fix attn processor

* make style
```
a8315ce1
deduplicate training section in the docs. (#2788) · 0d633a42
Sayak Paul authored Mar 23, 2023

0d633a42

[Examples] InstructPix2Pix instruct training script (#2478) · 9dc84448

Sayak Paul authored Mar 23, 2023



* add: initial implementation of the pix2pix instruct training script.

* shorten cli arg.

* fix: main process check.

* fix: dataset column names.

* simplify tokenization.

* proper placement of null conditions.

* apply styling.

* remove debugging message for conditioning do.

* complete license.

* add: requirements.tzt

* wandb column name order.

* fix: augmentation.

* change: dataset_id.

* fix: convert_to_np() call.

* fix: reshaping.

* fix: final ema copy.

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* address PR comments.

* add: readme details.

* config fix.

* downgrade version.

* reduce image width in the readme.

* note on hyperparameters during generation.

* add: output images.

* update readme.

* minor edits to readme.

* debugging statement.

* explicitly placement of the pipeline.

* bump minimum diffusers version.

* fix: device attribute error.

* weight dtype.

* debugging.

* add dtype inform.

* add seoarate te and vae.

* add: explicit casting/

* remove casting.

* up.

* up 2.

* up 3.

* autocast.

* disable mixed-precision in the final inference.

* debugging information.

* autocasting.

* add: instructpix2pix training section to the docs.

* Empty-Commit

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

9dc84448

add: section on multiple controlnets. (#2762) · c681ad1a

Sayak Paul authored Mar 23, 2023



* add: section on multiple controlnets.
Co-authored-by: William Berman <WLBberman@gmail.com>

* fix: docs.

* fix: docs.

---------
Co-authored-by: William Berman <WLBberman@gmail.com>

c681ad1a

Support for Offset Noise in examples (#2753) · e0d8c9ef
Haofan Wang authored Mar 23, 2023
```
* add noise offset

* make style
```
e0d8c9ef

22 Mar, 2023 3 commits

`mps`: remove warmup passes (#2771) · 92e1164e

Pedro Cuenca authored Mar 22, 2023

* Remove warmup passes in mps tests.

* Update mps docs: no warmup pass in PyTorch 2

* Update imports.

92e1164e

[MS Text To Video] Add first text to video (#2738) · ca1a2229

Patrick von Platen authored Mar 22, 2023

* [MS Text To Video} Add first text to video

* upload

* make first model example

* match unet3d params

* make sure weights are correcctly converted

* improve

* forward pass works, but diff result

* make forward work

* fix more

* finish

* refactor video output class.

* feat: add support for a video export utility.

* fix: opencv availability check.

* run make fix-copies.

* add: docs for the model components.

* add: standalone pipeline doc.

* edit docstring of the pipeline.

* add: right path to TransformerTempModel

* add: first set of tests.

* complete fast tests for text to video.

* fix bug

* up

* three fast tests failing.

* add: note on slow tests

* make work with all schedulers

* apply styling.

* add slow tests

* change file name

* update

* more correction

* more fixes

* finish

* up

* Apply suggestions from code review

* up

* finish

* make copies
...

ca1a2229

[docs] Clarify purpose of reproducibility docs (#2756) · 7fe88613
Steven Liu authored Mar 21, 2023
```
* clarify purpose of repro docs

* apply feedback
```
7fe88613