Commits · 73cc43109b62a744f49eb803fef4c6d4e5211b7e · renzhc / diffusers_dcu

28 Apr, 2023 1 commit
- Update logging.mdx (#2863) · 73cc4310
  M. Tolga Cangöz authored Apr 28, 2023
```
Fix typos
```
  73cc4310
27 Apr, 2023 3 commits

Update IF name to XL (#3262) · eade4308
apolinário authored Apr 27, 2023
```
Co-authored-by: multimodalart <joaopaulo.passos+multimodal@gmail.com>
```
eade4308
[docs] Update interface in repaint.mdx (#3119) · fa31da29
Ernie Chu authored Apr 27, 2023
```
Update repaint.mdx

accomodate to #1701
```
fa31da29

[2064]: Add stochastic sampler (sample_dpmpp_sde) (#3020) · fd512d74

Nipun Jindal authored Apr 27, 2023



* [2064]: Add stochastic sampler

* [2064]: Add stochastic sampler

* [2064]: Add stochastic sampler

* [2064]: Add stochastic sampler

* [2064]: Add stochastic sampler

* [2064]: Add stochastic sampler

* [2064]: Add stochastic sampler

* Review comments

* [Review comment]: Add is_torchsde_available()

* [Review comment]: Test and docs

* [Review comment]

* [Review comment]

* [Review comment]

* [Review comment]

* [Review comment]

---------
Co-authored-by: njindal <njindal@adobe.com>

fd512d74

26 Apr, 2023 2 commits
- [docs] only mention one stage (#3246) · c399de39
  Pedro Cuenca authored Apr 26, 2023
```
* [docs] only mention one stage

* add blurb on auto accepting

---------
Co-authored-by: William Berman <WLBberman@gmail.com>
```
  c399de39
- [AudioLDM] Update docs to use updated ckpt (#3240) · 46ceba5b
  Sanchit Gandhi authored Apr 26, 2023
```
* [AudioLDM] Update docs to use updated ckpt

* make style
```
  46ceba5b
25 Apr, 2023 2 commits

add model (#3230) · e51f19ae

Patrick von Platen authored Apr 25, 2023



* add

* clean

* up

* clean up more

* fix more tests

* Improve docs further

* improve

* more fixes docs

* Improve docs more

* Update src/diffusers/models/unet_2d_condition.py

* fix

* up

* update doc links

* make fix-copies

* add safety checker and watermarker to stage 3 doc page code snippets

* speed optimizations docs

* memory optimization docs

* make style

* add watermarking snippets to doc string examples

* make style

* use pt_to_pil helper functions in doc strings

* skip mps tests

* Improve safety

* make style

* new logic

* fix

* fix bad onnx design

* make new stable diffusion upscale pipeline model arguments optional

* define has_nsfw_concept when non-pil output type

* lowercase linked to notebook name

---------
Co-authored-by: William Berman <WLBberman@gmail.com>

e51f19ae

Add ControlNet v1.1 docs (#3226) · 131312ca
Patrick von Platen authored Apr 25, 2023
```
Add v1.1 docs
```
131312ca

19 Apr, 2023 1 commit

add from_ckpt method as Mixin (#2318) · 86ecd4b7

1lint authored Apr 19, 2023



* add mixin class for pipeline from original sd ckpt

* Improve

* make style

* merge main into

* Improve more

* fix more

* up

* Apply suggestions from code review

* finish docs

* rename

* make style

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

86ecd4b7

14 Apr, 2023 1 commit

Add to support Guess Mode for StableDiffusionControlnetPipleline (#2998) · 5c9dd0af

Takuma Mori authored Apr 14, 2023

* add guess mode (WIP)

* fix uncond/cond order

* support guidance_scale=1.0 and batch != 1

* remove magic coeff

* add docstring

* add intergration test

* add document to controlnet.mdx

* made the comments a bit more explanatory

* fix table

5c9dd0af

12 Apr, 2023 3 commits

[Docs] refactor text-to-video zero (#3049) · fa736e32
Sayak Paul authored Apr 12, 2023
```
* fix: norm group test for UNet3D.

* refactor text-to-video zero docs.
```
fa736e32
[Docs] update Self-Attention Guidance docs (#2952) · 0df47efe
Susung Hong authored Apr 12, 2023
```
* Update index.mdx

* Edit docs & add HF space link

* Only change equation numbers in comments
```
0df47efe

[LoRA] Enabling limited LoRA support for text encoder (#2918) · a89a14fa

Sayak Paul authored Apr 12, 2023

* add: first draft for a better LoRA enabler.

* make fix-copies.

* feat: backward compatibility.

* add: entry to the docs.

* add: tests.

* fix: docs.

* fix: norm group test for UNet3D.

* feat: add support for flat dicts.

* add depcrcation message instead of warning.

a89a14fa

10 Apr, 2023 1 commit

[Pipeline] Add TextToVideoZeroPipeline (#2954) · ba49272d

Andranik Movsisyan authored Apr 11, 2023



* add TextToVideoZeroPipeline and CrossFrameAttnProcessor

* add docs for text-to-video zero

* add teaser image for text-to-video zero docs

* Fix review changes. Add Documentation. Add test

* clean up the codes in pipeline_text_to_video.py. Add descriptive comments and docstrings

* make style && make quality

* make fix-copies

* make requested changes to docs. use huggingface server links for resources, delete res folder

* make style && make quality && make fix-copies

* make style && make quality

* Apply suggestions from code review

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

ba49272d

07 Apr, 2023 1 commit

docs: Link Navigation Path API Pipelines (#2976) · ce144d6d

Guspan Tanadi authored Apr 08, 2023

* docs: link navigation Safe Stable Diffusion

Link navigation API pipelines text2img and using diffusers Conditional Image Generation.

* docs: link navigation Versatile Diffusion

Removing exceeding path Stable Diffusion Overview.

* docs: Python extension Spectrogram Diffusion

Link navigation Spectrogram Diffusion Pipeline source code

* docs: Link navigation AltDiffusion Pipelines

Stable Diffusion Overview and Using Diffusers path.

ce144d6d

04 Apr, 2023 7 commits
- Removing explicit markdown extension (#2944) · f3e72e9e
  Guspan Tanadi authored Apr 04, 2023
```
Trigger from previous PR. Build the page once again.
```
  f3e72e9e
- Update ddpm.mdx (#2929) · 4fd7e97f
  M. Tolga Cangöz authored Apr 04, 2023
  
  4fd7e97f
- Update ddim.mdx (#2926) · 4a1eae07
  M. Tolga Cangöz authored Apr 04, 2023
  
  4a1eae07
- Update score_sde_vp.mdx (#2938) · e329edff
  M. Tolga Cangöz authored Apr 04, 2023
  
  e329edff
- Update score_sde_ve.mdx (#2937) · 3e2d1af8
  M. Tolga Cangöz authored Apr 04, 2023
  
  3e2d1af8
- Update unipc.mdx (#2936) · 715c25d3
  M. Tolga Cangöz authored Apr 04, 2023
  
  715c25d3
- Update euler_ancestral.mdx (#2932) · 4274a3a9
  M. Tolga Cangöz authored Apr 04, 2023
  
  4274a3a9
31 Mar, 2023 3 commits
- Update controlnet.mdx (#2912) · c4335626
  M. Tolga Cangöz authored Mar 31, 2023
```
.
```
  c4335626
- Update image_variation.mdx (#2911) · 89b23d98
  M. Tolga Cangöz authored Mar 31, 2023
```
.
```
  89b23d98
- Have fix current pipeline link (#2910) · 419660c9
  Guspan Tanadi authored Mar 31, 2023
```
Also capitalization notebook provider name
```
  419660c9
30 Mar, 2023 1 commit

[Docs] add an example use for `StableUnCLIPPipeline` in the pipeline docs (#2897) · b2021273

Sayak Paul authored Mar 30, 2023



* improve stable unclip doc.

* add: entry of StableUnCLIPPipeline to the docs

* Apply suggestions from code review
Co-authored-by: apolinario <joaopaulo.passos@gmail.com>

---------
Co-authored-by: apolinario <joaopaulo.passos@gmail.com>

b2021273

28 Mar, 2023 5 commits
- Update stable_diffusion_safe.mdx (#2870) · 628fefb2
  M. Tolga Cangöz authored Mar 28, 2023
```
Fix typos
```
  628fefb2
- Update paint_by_example.mdx (#2869) · 03fe36f1
  M. Tolga Cangöz authored Mar 28, 2023
```
.
```
  03fe36f1
- Update alt_diffusion.mdx (#2865) · ef4c2fa4
  M. Tolga Cangöz authored Mar 28, 2023
```
Fix typos
```
  ef4c2fa4
- Update overview.mdx (#2864) · 3980858a
  M. Tolga Cangöz authored Mar 28, 2023
```
Fix typos
```
  3980858a
- improve stable unclip doc. (#2823) · fab4f3d6
  Sayak Paul authored Mar 28, 2023
  
  fab4f3d6
24 Mar, 2023 2 commits

[Docs] update docs (Stable unCLIP) to reflect the updated ckpts. (#2815) · 5883d8d4

Sayak Paul authored Mar 24, 2023



* update docs to reflect the updated ckpts.

* update: point about prompt.

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* emove image resizing.

* Apply suggestions from code review

* Apply suggestions from code review

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5883d8d4

Add ModelEditing pipeline (#2721) · 37a44bb2

Bahjat Kawar authored Mar 24, 2023



* TIME first commit

* styling.

* styling 2.

* fixes; tests

* apply styling and doc fix.

* remove sups.

* fixes

* remove temp file

* move augmentations to const

* added doc entry

* code quality

* customize augmentations

* quality

* quality

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

37a44bb2

23 Mar, 2023 6 commits

Add AudioLDM (#2232) · b94880e5

Sanchit Gandhi authored Mar 23, 2023



* Add AudioLDM

* up

* add vocoder

* start unet

* unconditional unet

* clap, vocoder and vae

* clean-up: conversion scripts

* fix: conversion script token_type_ids

* clean-up: pipeline docstring

* tests: from SD

* clean-up: cpu offload vocoder instead of safety checker

* feat: adapt tests to audioldm

* feat: add docs

* clean-up: amend pipeline docstrings

* clean-up: make style

* clean-up: make fix-copies

* fix: add doc path to toctree

* clean-up: args for conversion script

* clean-up: paths to checkpoints

* fix: use conditional unet

* clean-up: make style

* fix: type hints for UNet

* clean-up: docstring for UNet

* clean-up: make style

* clean-up: remove duplicate in docstring

* clean-up: make style

* clean-up: make fix-copies

* clean-up: move imports to start in code snippet

* fix: pass cross_attention_dim as a list/tuple to unet

* clean-up: make fix-copies

* fix: update checkpoint path

* fix: unet cross_attention_dim in tests

* film embeddings -> class embeddings

* Apply suggestions from code review
Co-authored-by: Will Berman <wlbberman@gmail.com>

* fix: unet film embed to use existing args

* fix: unet tests to use existing args

* fix: make style

* fix: transformers import and version in init

* clean-up: make style

* Revert "clean-up: make style"

This reverts commit 5d6d1f8b324f5583e7805dc01e2c86e493660d66.

* clean-up: make style

* clean-up: use pipeline tester mixin tests where poss

* clean-up: skip attn slicing test

* fix: add torch dtype to docs

* fix: remove conversion script out of src

* fix: remove .detach from 1d waveform

* fix: reduce default num inf steps

* fix: swap height/width -> audio_length_in_s

* clean-up: make style

* fix: remove nightly tests

* fix: imports in conversion script

* clean-up: slim-down to two slow tests

* clean-up: slim-down fast tests

* fix: batch consistent tests

* clean-up: make style

* clean-up: remove vae slicing fast test

* clean-up: propagate changes to doc

* fix: increase test tol to 1e-2

* clean-up: finish docs

* clean-up: make style

* feat: vocoder / VAE compatibility check

* feat: possibly expand / cut audio waveform

* fix: pipeline call signature test

* fix: slow tests output len

* clean-up: make style

* make style

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: William Berman <WLBberman@gmail.com>

b94880e5

Flax controlnet (#2727) · df91c447

YiYi Xu authored Mar 23, 2023



* add contronet flax

---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>

df91c447

[Docs] small fixes to the text to video doc. (#2787) · 0d7aac3e
Sayak Paul authored Mar 23, 2023
```
* small fixes to the text to video doc.

* add: Spaces link.

* add: warning on research-only model.
```
0d7aac3e

Music Spectrogram diffusion pipeline (#1044) · 2ef9bdd7

Kashif Rasul authored Mar 23, 2023



* initial TokenEncoder and ContinuousEncoder

* initial modules

* added ContinuousContextTransformer

* fix copy paste error

* use numpy for get_sequence_length

* initial terminal relative positional encodings

* fix weights keys

* fix assert

* cross attend style: concat encodings

* make style

* concat once

* fix formatting

* Initial SpectrogramPipeline

* fix input_tokens

* make style

* added mel output

* ignore weights for config

* move mel to numpy

* import pipeline

* fix class names and import

* moved models to models folder

* import ContinuousContextTransformer and SpectrogramDiffusionPipeline

* initial spec diffusion converstion script

* renamed config to t5config

* added weight loading

* use arguments instead of t5config

* broadcast noise time to batch dim

* fix call

* added scale_to_features

* fix weights

* transpose laynorm weight

* scale is a vector

* scale the query outputs

* added comment

* undo scaling

* undo depth_scaling

* inital get_extended_attention_mask

* attention_mask is none in self-attention

* cleanup

* manually invert attention

* nn.linear need bias=False

* added T5LayerFFCond

* remove to fix conflict

* make style and dummy

* remove unsed variables

* remove predict_epsilon

* Move accelerate to a soft-dependency (#1134)

* finish

* finish

* Update src/diffusers/modeling_utils.py

* Update src/diffusers/pipeline_utils.py
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

* more fixes

* fix
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

* fix order

* added initial midi to note token data pipeline

* added int to int tokenizer

* remove duplicate

* added logic for segments

* add melgan to pipeline

* move autoregressive gen into pipeline

* added note_representation_processor_chain

* fix dtypes

* remove immutabledict req

* initial doc

* use np.where

* require note_seq

* fix typo

* update dependency

* added note-seq to test

* added is_note_seq_available

* fix import

* added toc

* added example usage

* undo for now

* moved docs

* fix merge

* fix imports

* predict first segment

* avoid un-needed copy to and from cpu

* make style

* Copyright

* fix style

* add test and fix inference steps

* remove bogus files

* reorder models

* up

* remove transformers dependency

* make work with diffusers cross attention

* clean more

* remove @

* improve further

* up

* uP

* Apply suggestions from code review

* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py

* loop over all tokens

* make style

* Added a section on the model

* fix formatting

* grammer

* formatting

* make fix-copies

* Update src/diffusers/pipelines/__init__.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/pipelines/spectrogram_diffusion/pipeline_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* added callback ad optional ionnx

* do not squeeze batch dim

* clean up more

* upload

* convert jax to nnumpy

* make style

* fix warning

* make fix-copies

* fix warning

* add initial fast tests

* add initial pipeline_params

* eval mode due to dropout

* skip batch tests as pipeline runs on a single file

* make style

* fix relative path

* fix doc tests

* Update src/diffusers/models/t5_film_transformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/models/t5_film_transformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update docs/source/en/api/pipelines/spectrogram_diffusion.mdx
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* add MidiProcessor

* format

* fix org

* Apply suggestions from code review

* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py

* make style

* pin protobuf to <4

* fix formatting

* white space

* tensorboard needs protobuf

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

2ef9bdd7

Rename 'CLIPFeatureExtractor' class to 'CLIPImageProcessor' (#2732) · 14e3a28c

Naoki Ainoya authored Mar 23, 2023

The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.

14e3a28c

add: section on multiple controlnets. (#2762) · c681ad1a

Sayak Paul authored Mar 23, 2023



* add: section on multiple controlnets.
Co-authored-by: William Berman <WLBberman@gmail.com>

* fix: docs.

* fix: docs.

---------
Co-authored-by: William Berman <WLBberman@gmail.com>

c681ad1a

22 Mar, 2023 1 commit

[MS Text To Video] Add first text to video (#2738) · ca1a2229

Patrick von Platen authored Mar 22, 2023



* [MS Text To Video} Add first text to video

* upload

* make first model example

* match unet3d params

* make sure weights are correcctly converted

* improve

* forward pass works, but diff result

* make forward work

* fix more

* finish

* refactor video output class.

* feat: add support for a video export utility.

* fix: opencv availability check.

* run make fix-copies.

* add: docs for the model components.

* add: standalone pipeline doc.

* edit docstring of the pipeline.

* add: right path to TransformerTempModel

* add: first set of tests.

* complete fast tests for text to video.

* fix bug

* up

* three fast tests failing.

* add: note on slow tests

* make work with all schedulers

* apply styling.

* add slow tests

* change file name

* update

* more correction

* more fixes

* finish

* up

* Apply suggestions from code review

* up

* finish

* make copies

* fix pipeline tests

* fix more tests

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* apply suggestions

* up

* revert

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

ca1a2229