Commits · 7b39f43c06b73b6ac740104cd17a722be9cc81cf · renzhc / diffusers_dcu

"docker/vscode:/vscode.git/clone" did not exist on "5b279fbd53976731504db075ff7fabde43137d17"

15 Sep, 2023 1 commit
- [Wuerstchen] fix typos in docs (#5051) · 427feb53
  Kashif Rasul authored Sep 15, 2023
```
* fix typos in docs

* fix for issue  #5023
```
  427feb53
13 Sep, 2023 3 commits

Fix broken link in docs (#5015) · b954c22a
Lucain authored Sep 13, 2023
```
fix broken link
```
b954c22a

[Wuerstchen] fix compel usage (#4999) · 77373c5e

Kashif Rasul authored Sep 13, 2023



* fix compel usage

* minor changes in documentation

* fix tests

* fix more

* fix more

* typos

* fix tests

* formatting

---------
Co-authored-by: Dominic Rampas <d6582533@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

77373c5e

[SDXL] Add LoRA to all pipelines (#4896) · 324aef6d

Patrick von Platen authored Sep 13, 2023

* [SDXL] Add LoRA to all pipelines

* fix all

* fix all

* fix all

* fix more docs

* make style

324aef6d

11 Sep, 2023 3 commits

Wuerstchen fixes (#4942) · 16a056a7

Kashif Rasul authored Sep 11, 2023



* fix arguments and make example code work

* change arguments in combined test

* Add default timesteps

* style

* fixed test

* fix broken test

* formatting

* fix docstrings

* fix  num_images_per_prompt

* fix doc styles

* please dont change this

* fix tests

* rename to DEFAULT_STAGE_C_TIMESTEPS

---------
Co-authored-by: Dominic Rampas <d6582533@gmail.com>

16a056a7

Lazy Import for Diffusers (#4829) · b6e0b016

Dhruv Nair authored Sep 11, 2023



* initial commit

* move modules to import struct

* add dummy objects and _LazyModule

* add lazy import to schedulers

* clean up unused imports

* lazy import on models module

* lazy import for schedulers module

* add lazy import to pipelines module

* lazy import altdiffusion

* lazy import audio diffusion

* lazy import audioldm

* lazy import consistency model

* lazy import controlnet

* lazy import dance diffusion ddim ddpm

* lazy import deepfloyd

* lazy import kandinksy

* lazy imports

* lazy import semantic diffusion

* lazy imports

* lazy import stable diffusion

* move sd output to its own module

* clean up

* lazy import t2iadapter

* lazy import unclip

* lazy import versatile and vq diffsuion

* lazy import vq diffusion

* helper to fetch objects from modules

* lazy import sdxl

* lazy import txt2vid

* lazy import stochastic karras

* fix model imports

* fix bug

* lazy import

* clean up

* clean up

* fixes for tests

* fixes for tests

* clean up

* remove import of torch_utils from utils module

* clean up

* clean up

* fix mistake import statement

* dedicated modules for exporting and loading

* remove testing utils from utils module

* fixes from  merge conflicts

* Update src/diffusers/pipelines/kandinsky2_2/__init__.py

* fix docs

* fix alt diffusion copied from

* fix check dummies

* fix more docs

* remove accelerate import from utils module

* add type checking

* make style

* fix check dummies

* remove torch import from xformers check

* clean up error message

* fixes after upstream merges

* dummy objects fix

* fix tests

* remove unused module import

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

b6e0b016

[Docs] fix: minor formatting in the Würstchen docs (#4965) · 88735249
Sayak Paul authored Sep 11, 2023
```
fix: minor formatting in the docs
```
88735249

07 Sep, 2023 1 commit

[InstructPix2Pix] Fix pipeline implementation and add docs (#4844) · 9800cc5e

Sayak Paul authored Sep 07, 2023

* initial evident fixes.

* instructpix2pix fixes.

* add: entry to doc.

* address PR feedback.

* make fix-copies

9800cc5e

06 Sep, 2023 1 commit

Würstchen model (#3849) · 541bb6ee

Kashif Rasul authored Sep 06, 2023



* initial

* initial

* added initial convert script for paella vqmodel

* initial wuerstchen pipeline

* add LayerNorm2d

* added modules

* fix typo

* use model_v2

* embed clip caption amd negative_caption

* fixed name of var

* initial modules in one place

* WuerstchenPriorPipeline

* inital shape

* initial denoising prior loop

* fix output

* add WuerstchenPriorPipeline to __init__.py

* use the noise ratio in the Prior

* try to save pipeline

* save_pretrained working

* Few additions

* add _execution_device

* shape is int

* fix batch size

* fix shape of ratio

* fix shape of ratio

* fix output dataclass

* tests folder

* fix formatting

* fix float16 + started with generator

* Update pipeline_wuerstchen.py

* removed vqgan code

* add WuerstchenGeneratorPipeline

* fix WuerstchenGeneratorPipeline

* fix docstrings

* fix imports

* convert generator pipeline

* fix convert

* Work on Generator Pipeline. WIP

* Pipeline works with our diffuzz code

* apply scale factor

* removed vqgan.py

* use cosine schedule

* redo the denoising loop

* Update src/diffusers/models/resnet.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* use torch.lerp

* use warp-diffusion org

* clip_sample=False,

* some refactoring

* use model_v3_stage_c

* c_cond size

* use clip-bigG

* allow stage b clip to be None

* add dummy

* würstchen scheduler

* minor changes

* set clip=None in the pipeline

* fix attention mask

* add attention_masks to text_encoder

* make fix-copies

* add back clip

* add text_encoder

* gen_text_encoder and tokenizer

* fix import

* updated pipeline test

* undo changes to pipeline test

* nip

* fix typo

* fix output name

* set guidance_scale=0 and remove diffuze

* fix doc strings

* make style

* nip

* removed unused

* initial docs

* rename

* toc

* cleanup

* remvoe test script

* fix-copies

* fix multi images

* remove dup

* remove unused modules

* undo changes for debugging

* no  new line

* remove dup conversion script

* fix doc string

* cleanup

* pass default args

* dup permute

* fix some tests

* fix prepare_latents

* move Prior class to modules

* offload only the text encoder and vqgan

* fix resolution calculation for prior

* nip

* removed testing script

* fix shape

* fix argument to set_timesteps

* do not change .gitignore

* fix resolution calculations + readme

* resolution calculation fix + readme

* small fixes

* Add combined pipeline

* rename generator -> decoder

* Update .gitignore
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* removed efficient_net

* create combined WuerstchenPipeline

* make arguments consistent with VQ model

* fix var names

* no need to return text_encoder_hidden_states

* add latent_dim_scale to config

* split model into its own file

* add WuerschenPipeline to docs

* remove unused latent_size

* register latent_dim_scale

* update script

* update docstring

* use Attention preprocessor

* concat with normed input

* fix-copies

* add docs

* fix test

* fix style

* add to cpu_offloaded_model

* updated type

* remove 1-line func

* updated type

* initial decoder test

* formatting

* formatting

* fix autodoc link

* num_inference_steps is int

* remove comments

* fix example in docs

* Update src/diffusers/pipelines/wuerstchen/diffnext.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* rename layernorm to WuerstchenLayerNorm

* rename DiffNext to WuerstchenDiffNeXt

* added comment about MixingResidualBlock

* move paella vq-vae to pipelines' folder

* initial decoder test

* increased test_float16_inference expected diff

* self_attn is always true

* more passing decoder tests

* batch image_embeds

* fix failing tests

* set the correct dtype

* relax inference test

* update prior

* added combined pipeline test

* faster test

* faster test

* Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix issues from review

* update wuerstchen.md + change generator name

* resolve issues

* fix copied from usage and add back batch_size

* fix API

* fix arguments

* fix combined test

* Added timesteps argument + fixes

* Update tests/pipelines/test_pipelines_common.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/pipelines/wuerstchen/test_wuerstchen_prior.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py

* up

* Fix more

* failing tests

* up

* up

* correct naming

* correct docs

* correct docs

* fix test params

* correct docs

* fix classifier free guidance

* fix classifier free guidance

* fix more

* fix all

* make tests faster

---------
Co-authored-by: Dominic Rampas <d6582533@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Dominic Rampas <61938694+dome272@users.noreply.github.com>

541bb6ee

05 Sep, 2023 1 commit
- [docs] Add stronger warning for SDXL height/width (#4867) · 946bb53c
  Steven Liu authored Sep 05, 2023
```
* add size warning

* feedback
```
  946bb53c
02 Sep, 2023 1 commit
- [docs] Shap-E guide (#4700) · 2c45a53a
  Steven Liu authored Sep 01, 2023
```
* first draft

* fixes

* more fixes

* fix toctree
```
  2c45a53a
01 Sep, 2023 3 commits

[docs] DiffEdit guide (#4722) · 22ea35cf
Steven Liu authored Sep 01, 2023
```
* first draft

* minor edits
```
22ea35cf
Fix link from API to using-diffusers (#4856) · 60d259ad
Pedro Cuenca authored Sep 01, 2023
```
* Fix link from API to using-diffusers

* Fix link
```
60d259ad

Add GLIGEN Text Image implementation (#4777) · 38466c36

Nguyễn Công Tú Anh authored Sep 01, 2023

* Add GLIGEN Text Image implementation

* add style transfer from image

* fix check_repository_consistency

* add convert script GLIGEN model to Diffusers

* rename attention type

* fix style code

* remove PositionNetTextImage

* Revert "fix check_repository_consistency"

This reverts commit 15f098c96e00bb9e67b831161615b30a2d28d815.

* change attention type name

* update docs for GLIGEN

* change examples with hf-document-image

* fix style

* add CLIPImageProjection for GLIGEN

* Add new encode_prompt, load project matrix in pipe init

* move CLIPImageProjection to stable_diffusion

* add comment

38466c36

31 Aug, 2023 1 commit

[docs] ControlNet guide (#4640) · aedd7876

Steven Liu authored Aug 31, 2023

* first draft

* finish first draft

* feedback and remove sections from API pages

* clean docstrings

* add full code example

aedd7876

30 Aug, 2023 1 commit

[docs] SDXL (#4428) · a1fdfca3

Steven Liu authored Aug 30, 2023

* first draft

* reorg toctree

* note about minsdxl

* feedback

* fix

* micro-conditionings

* add tip

* fix section levels

* d'oh fix pipeline names

* feedback

* remove old section

a1fdfca3

29 Aug, 2023 1 commit

add models for T2I-Adapter-XL (#4696) · 12358b98

Chong Mou authored Aug 29, 2023



* T2I-Adapter-XL

* update

* update

* add pipeline

* modify pipeline

* modify pipeline

* modify pipeline

* modify pipeline

* modify pipeline

* modify modeling_text_unet

* fix styling.

* fix: copies.

* adapter settings

* new test case

* new test case

* debugging

* debugging

* debugging

* debugging

* debugging

* debugging

* debugging

* debugging

* revert prints.

* new test case

* remove print

* org test case

* add test_pipeline

* styling.

* fix copies.

* modify test parameter

* style.

* add adapter-xl doc

* double quotes in docs

* Fix potential type mismatch

* style.

---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>

12358b98

26 Aug, 2023 1 commit

[Core] Support negative conditions in SDXL (#4774) · 3be0ff90

Sayak Paul authored Aug 26, 2023

* add: support negative conditions.

* fix: key

* add: tests

* address PR feedback.

* add documentation

* add img2img support.

* add inpainting support.

* ad controlnet support

* Apply suggestions from code review

* modify wording in the doc.

3be0ff90

25 Aug, 2023 1 commit

Convert MusicLDM (#4579) · b1290d3f

Sanchit Gandhi authored Aug 25, 2023



* from audioldm

* fix vae

* move to new pipeline

* copied from audioldm

* remove redundant control flow

* iterate

* fix docstring

* finish pipeline

* tests: from audioldm2

* iterate

* finish fast tests

* finish slow integration tests

* add docs

* remove dtype test

* update toctree

* "copied from" in conversion (where possible)

* Update docs/source/en/api/pipelines/musicldm.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* fix docstring

* make nightly

* style

* fix dtype test

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

b1290d3f

24 Aug, 2023 1 commit

[AudioLDM2] Doc fixes (#4739) · 24c5e770

Sanchit Gandhi authored Aug 24, 2023



* [AudioLDM2] Doc fixes

* update docstrings

* fix unet docstring

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

24c5e770

23 Aug, 2023 1 commit
- [AudioLDM Docs] Fix docs for output (#4737) · 05b0ec63
  Sanchit Gandhi authored Aug 23, 2023
  
  05b0ec63
22 Aug, 2023 1 commit
- [docs] Add note in UniDiffusers Doc about PyTorch 1.X numerical stability issue (#4703) · f75b8aa9
  dg845 authored Aug 21, 2023
```
* Add note regarding UniDiffuser pipeline numerical stability issues on PyTorch 1.X

* Use the doc-builder warning tag.
```
  f75b8aa9
21 Aug, 2023 1 commit

Add AudioLDM 2 (#4549) · 7a24977c

Sanchit Gandhi authored Aug 21, 2023



* from audioldm

* unet down + mid

* vae, clap, flan-t5

* start sequence audio mae

* iterate on audioldm encoder

* finish encoder

* finish weight conversion

* text pre-processing

* gpt2 pre-processing

* fix projection model

* working

* unet equivalence

* finish in base

* add unet cond

* finish unet

* finish custom unet

* start clean-up

* revert base unet changes

* refactor pre-processing

* tests: from audioldm

* fix some tests

* more fixes

* iterate on tests

* make fix copies

* harden fast tests

* slow integration tests

* finish tests

* update checkpoint

* update copyright

* docs

* remove outdated method

* add docstring

* make style

* remove decode latents

* enable cpu offload

* (text_encoder_1, tokenizer_1) -> (text_encoder, tokenizer)

* more clean up

* more refactor

* build pr docs

* Update docs/source/en/api/pipelines/audioldm2.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* small clean

* tidy conversion

* update for large checkpoint

* generate -> generate_language_model

* full clap model

* shrink clap-audio in tests

* fix large integration test

* fix fast tests

* use generation config

* make style

* update docs

* finish docs

* finish doc

* update tests

* fix last test

* syntax

* finalise tests

* refactor projection model in prep for TTS

* fix fast tests

* style

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

7a24977c

17 Aug, 2023 2 commits
- make things clear in the controlnet sdxl doc. (#4644) · 5333f4c0
  Sayak Paul authored Aug 17, 2023
  
  5333f4c0
- [docs] MultiControlNet (#4635) · bdc4c326
  Steven Liu authored Aug 16, 2023
```
multicontrolnet docs
```
  bdc4c326
16 Aug, 2023 2 commits

[docs] PushToHubMixin (#4622) · 4ff7264d
Steven Liu authored Aug 16, 2023
```
* push to hub docs

* fix typo

* feedback

* make style
```
4ff7264d

Add GLIGEN implementation (#4441) · da5ab51d

nikhil-masterful authored Aug 15, 2023

* Add GLIGEN implementation

* GLIGEN: Fix code quality check failures

* GLIGEN: Fix Import block un-sorted or un-formatted failures

* GLIGEN: Fix check_repository_consistency failures

* GLIGEN: Add 'PositionNet' to versatile_diffusion/modeling_text_unet.py

* GLIGEN: check_repository_consistency: fix 'copy does not match' error

* GLIGEN: Fix review comments (1)

* GLIGEN: Fix E721 Do not compare types, use `isinstance()` failures

* GLIGEN : Ensure _encode_prompt() copy matches to StableDiffusionPipeline

* GLIGEN: Fix ruff E721 failure in unidiffuser/test_unidiffuser.py

* GLIGEN: doc_builder: restyle pipeline_stable_diffusion_gligen.py

* GIGLEN: reset files unrelated to gligen

* GLIGEN: Fix documentation comments (1)

* GLIGEN: Fix review comments (2)

* GLIGEN: Added FastTest

* GLIGEN: Fix review comments (3)

da5ab51d

15 Aug, 2023 2 commits

add: pushtohubmixin to pipelines and schedulers docs overview. (#4607) · a7508a76

Sayak Paul authored Aug 15, 2023



* add: pushtohubmixin to pipelines and schedulers docs overview.

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

a7508a76

[Pipeline utils] feat: implement push_to_hub for standalone models, schedulers... · 15782fd5

Sayak Paul authored Aug 15, 2023


[Pipeline utils] feat: implement push_to_hub for standalone models, schedulers as well as pipelines (#4128)

* feat: implement push_to_hub for standalone models.

* address PR feedback.

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* remove max_shard_size.

* add: support for scheduler push_to_hub

* enable push_to_hub support for flax schedulers.

* enable push_to_hub for pipelines.

* Apply suggestions from code review
Co-authored-by: Lucain <lucainp@gmail.com>

* reflect pr feedback.

* address another round of deedback.

* better handling of kwargs.

* add: tests

* Apply suggestions from code review
Co-authored-by: Lucain <lucainp@gmail.com>

* setting hub staging to False for now.

* incorporate staging test as a separate job.
Co-authored-by: ydshieh <2521628+ydshieh@users.noreply.github.com>

* fix: tokenizer loading.

* fix: json dumping.

* move is_staging_test to a better location.

* better treatment to tokens.

* define repo_id to better handle concurrency

* style

* explicitly set token

* Empty-Commit

* move SUER, TOKEN to test

* collate org_repo_id

* delete repo

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Lucain <lucainp@gmail.com>
Co-authored-by: ydshieh <2521628+ydshieh@users.noreply.github.com>

15782fd5

12 Aug, 2023 1 commit

[Utility] adds an image grid utility (#4576) · d67eba0f

Sayak Paul authored Aug 12, 2023

* add: utility for image grid.

* add: return type.

* change necessary places.

* add to utility page.

d67eba0f

10 Aug, 2023 3 commits
- [Doc] update sdxl-controlnet repo name (#4564) · 3df52ba8
  YiYi Xu authored Aug 10, 2023
```
* rename

* style

---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
```
  3df52ba8
- improve controlnet sdxl docs now that we have a good checkpoint. (#4556) · c697c5ab
  Sayak Paul authored Aug 10, 2023
  
  c697c5ab
- Revert "introduce minimalistic reimplementation of SDXL on the SDXL doc" (#4548) · 5cbcbe3c
  Patrick von Platen authored Aug 10, 2023
```
Revert "introduce minimalistic reimplementation of SDXL on the SDXL doc (#4532)"

This reverts commit e7e37494.
```
  5cbcbe3c
09 Aug, 2023 2 commits
- [docs] Clean scheduler api (#4204) · 16ad13b6
  Steven Liu authored Aug 09, 2023
```
* clean scheduler mixin

* up to dpmsolvermultistep

* finish cleaning

* first draft

* fix overview table

* apply feedback

* update reference code
```
  16ad13b6
- introduce minimalistic reimplementation of SDXL on the SDXL doc (#4532) · e7e37494
  Simo Ryu authored Aug 09, 2023
```
minsdxl
```
  e7e37494
02 Aug, 2023 2 commits

[Feat] add tiny Autoencoder for (almost) instant decoding (#4384) · 18fc40c1

Sayak Paul authored Aug 02, 2023



* add: model implementation of tiny autoencoder.

* add: inits.

* push the latest devs.

* add: conversion script and finish.

* add: scaling factor args.

* debugging

* fix denormalization.

* fix: positional argument.

* handle use_torch_2_0_or_xformers.

* handle post_quant_conv

* handle dtype

* fix: sdxl image processor for tiny ae.

* fix: sdxl image processor for tiny ae.

* unify upcasting logic.

* copied from madness.

* remove trailing whitespace.

* set is_tiny_vae = False

* address PR comments.

* change to AutoencoderTiny

* make act_fn an str throughout

* fix: apply_forward_hook decorator call

* get rid of the special is_tiny_vae flag.

* directly scale the output.

* fix dummies?

* fix: act_fn.

* get rid of the Clamp() layer.

* bring back copied from.

* movement of the blocks to appropriate modules.

* add: docstrings to AutoencoderTiny

* add: documentation.

* changes to the conversion script.

* add doc entry.

* settle tests.

* style

* add one slow test.

* fix

* fix 2

* fix 2

* fix: 4

* fix: 5

* finish integration tests

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* style

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

18fc40c1

[docs] AutoPipeline tutorial (#4273) · ae82a3eb

Steven Liu authored Aug 02, 2023



* first draft

* tidy api

* apply feedback

* mdx to md

* apply feedback

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

ae82a3eb

01 Aug, 2023 2 commits

[ldm3d] documentation fixing typos (#4284) · 05a1cb90

estelleafl authored Aug 01, 2023



* fixed typo

* updated doc to be consistent in naming

* make style/quality

* preprocessing for 4 channels and not 6

* make style

* test for 4c

* make style/quality

* fixed test on cpu

* fixed doc typo

* changed default ckpt to 4c

* Update pipeline_stable_diffusion_ldm3d.py

---------
Co-authored-by: Aflalo <estellea@isl-iam1.rr.intel.com>
Co-authored-by: Aflalo <estellea@isl-gpu33.rr.intel.com>
Co-authored-by: Aflalo <estellea@isl-gpu38.rr.intel.com>

05a1cb90

[AutoPipeline] Correct naming (#4420) · c69526a3
Patrick von Platen authored Aug 01, 2023

c69526a3

28 Jul, 2023 1 commit
- fix fp type in t2i adapter docs (#4350) · 2b178673
  Will Berman authored Jul 28, 2023
  
  2b178673