Commits · 94a0c644a8ce5b05a969859e0814ef4883ac870e · renzhc / diffusers_dcu

10 May, 2023 1 commit

add: a warning message when using xformers in a PT 2.0 env. (#3365) · 94a0c644

Sayak Paul authored May 10, 2023



* add: a warning message when using xformers in a PT 2.0 env.

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

94a0c644

09 May, 2023 2 commits

[docs] Improve safetensors docstring (#3368) · 26832aa5
Steven Liu authored May 09, 2023
```
* clarify safetensor docstring

* fix typo

* apply feedback
```
26832aa5

if dreambooth lora (#3360) · a757b2db

Will Berman authored May 09, 2023

* update IF stage I pipelines

add fixed variance schedulers and lora loading

* added kv lora attn processor

* allow loading into alternative lora attn processor

* make vae optional

* throw away predicted variance

* allow loading into added kv lora layer

* allow load T5

* allow pre compute text embeddings

* set new variance type in schedulers

* fix copies

* refactor all prompt embedding code

class prompts are now included in pre-encoding code
max tokenizer length is now configurable
embedding attention mask is now configurable

* fix for when variance type is not defined on scheduler

* do not pre compute validation prompt if not present

* add example test for if lora dreambooth

* add check for train text encoder and pre compute text embeddings

a757b2db

05 May, 2023 1 commit

Add upsample_size to AttnUpBlock2D, AttnDownBlock2D (#3275) · 36f43ea7

Will Rice authored May 05, 2023

The argument `upsample_size` needs to be added to these modules to allow compatibility with other blocks that require this argument.

36f43ea7

02 May, 2023 1 commit

[Torch 2.0 compile] Fix more torch compile breaks (#3313) · 5c7a35a2

Patrick von Platen authored May 02, 2023



* Fix more torch compile breaks

* add tests

* Fix all

* fix controlnet

* fix more

* Add Horace He as co-author.
>
>
Co-authored-by: Horace He <horacehe2007@yahoo.com>

* Add Horace He as co-author.
Co-authored-by: Horace He <horacehe2007@yahoo.com>

---------
Co-authored-by: Horace He <horacehe2007@yahoo.com>

5c7a35a2

01 May, 2023 1 commit

Torch compile graph fix (#3286) · 0e82fb19

Patrick von Platen authored May 01, 2023

* fix more

* Fix more

* fix more

* Apply suggestions from code review

* fix

* make style

* make fix-copies

* fix

* make sure torch compile

* Clean

* fix test

0e82fb19

28 Apr, 2023 1 commit
- Allow disabling torch 2_0 attention (#3273) · 4d35d7fe
  Patrick von Platen authored Apr 28, 2023
```
* Allow disabling torch 2_0 attention

* make style

* Update src/diffusers/models/attention.py
```
  4d35d7fe
26 Apr, 2023 1 commit
- Allow fp16 attn for x4 upscaler (#3239) · abbf3c1a
  Patrick von Platen authored Apr 26, 2023
```
* Add all files

* update

* Make sure vae is memory efficient for PT 1

* make style
```
  abbf3c1a
25 Apr, 2023 1 commit

add model (#3230) · e51f19ae

Patrick von Platen authored Apr 25, 2023



* add

* clean

* up

* clean up more

* fix more tests

* Improve docs further

* improve

* more fixes docs

* Improve docs more

* Update src/diffusers/models/unet_2d_condition.py

* fix

* up

* update doc links

* make fix-copies

* add safety checker and watermarker to stage 3 doc page code snippets

* speed optimizations docs

* memory optimization docs

* make style

* add watermarking snippets to doc string examples

* make style

* use pt_to_pil helper functions in doc strings

* skip mps tests

* Improve safety

* make style

* new logic

* fix

* fix bad onnx design

* make new stable diffusion upscale pipeline model arguments optional

* define has_nsfw_concept when non-pil output type

* lowercase linked to notebook name

---------
Co-authored-by: William Berman <WLBberman@gmail.com>

e51f19ae

24 Apr, 2023 1 commit
- [Bug fix] Fix batch size attention head size mismatch (#3214) · c5933c9c
  Patrick von Platen authored Apr 25, 2023
  
  c5933c9c
22 Apr, 2023 1 commit
- Make sure VAE attention works with Torch 2_0 (#3200) · 425192fe
  Patrick von Platen authored Apr 22, 2023
```
* Make sure attention works with Torch 2_0

* make style

* Fix more
```
  425192fe
21 Apr, 2023 1 commit
- make `from_flax` work for controlnet (#3161) · bc0392a0
  YiYi Xu authored Apr 21, 2023
```
fix from_flax
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
  bc0392a0
20 Apr, 2023 1 commit

adding custom diffusion training to diffusers examples (#3031) · 3979aac9

nupurkmr9 authored Apr 20, 2023



* diffusers==0.14.0 update

* custom diffusion update

* custom diffusion update

* custom diffusion update

* custom diffusion update

* custom diffusion update

* custom diffusion update

* custom diffusion

* custom diffusion

* custom diffusion

* custom diffusion

* custom diffusion

* apply formatting and get rid of bare except.

* refactor readme and other minor changes.

* misc refactor.

* fix: repo_id issue and loaders logging bug.

* fix: save_model_card.

* fix: save_model_card.

* fix: save_model_card.

* add: doc entry.

* refactor doc,.

* custom diffusion

* custom diffusion

* custom diffusion

* apply style.

* remove tralining whitespace.

* fix: toctree entry.

* remove unnecessary print.

* custom diffusion

* custom diffusion

* custom diffusion test

* custom diffusion xformer update

* custom diffusion xformer update

* custom diffusion xformer update

---------
Co-authored-by: Nupur Kumari <nupurkumari@Nupurs-MacBook-Pro.local>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Nupur Kumari <nupurkumari@nupurs-mbp.wifi.local.cmu.edu>

3979aac9

19 Apr, 2023 1 commit
- Correct `Transformer2DModel.forward` docstring (#3074) · c8fdfe45
  Chanchana Sornsoontorn authored Apr 19, 2023
```
⚙️chore(transformer_2d) update function signature for encoder_hidden_states
```
  c8fdfe45
18 Apr, 2023 2 commits

class labels timestep embeddings projection dtype cast (#3137) · fc188391
Will Berman authored Apr 18, 2023
```
This mimics the dtype cast for the standard time embeddings
```
fc188391

Add unet act fn to other model components (#3136) · f0c74e9a

Will Berman authored Apr 18, 2023

Adding act fn config to the unet timestep class embedding and conv
activation.

The custom activation defaults to silu which is the default
activation function for both the conv act and the timestep class
embeddings so default behavior is not changed.

The only unet which use the custom activation is the stable diffusion
latent upscaler https://huggingface.co/stabilityai/sd-x2-latent-upscaler/blob/main/unet/config.json
(I ran a script against the hub to confirm).
The latent upscaler does not use the conv activation nor the timestep
class embeddings so we don't change its behavior.

f0c74e9a

17 Apr, 2023 1 commit

Fix config deprecation (#3129) · 703307ef

Patrick von Platen authored Apr 17, 2023



* Better deprecation message

* Better deprecation message

* Better doc string

* Fixes

* fix more

* fix more

* Improve __getattr__

* correct more

* fix more

* fix

* Improve more

* more improvements

* fix more

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* make style

* Fix all rest & add tests & remove old deprecation fns

---------
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

703307ef

16 Apr, 2023 1 commit
- Add global pooling to controlnet (#3121) · cfc99adf
  Patrick von Platen authored Apr 16, 2023
  
  cfc99adf
14 Apr, 2023 1 commit

Add to support Guess Mode for StableDiffusionControlnetPipleline (#2998) · 5c9dd0af

Takuma Mori authored Apr 14, 2023

* add guess mode (WIP)

* fix uncond/cond order

* support guidance_scale=1.0 and batch != 1

* remove magic coeff

* add docstring

* add intergration test

* add document to controlnet.mdx

* made the comments a bit more explanatory

* fix table

5c9dd0af

12 Apr, 2023 2 commits

[WIP] implement rest of the test cases (LoRA tests) (#2824) · 9d7c08f9

Andy authored Apr 12, 2023



* inital commit for lora test cases

* help a bit with lora for 3d

* fixed lora tests

* replaced redundant code

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

9d7c08f9

Flax memory efficient attention (#2889) · dc277501

Pedro Cuenca authored Apr 12, 2023



* add use_memory_efficient params placeholder

* test

* add memory efficient attention jax

* add memory efficient attention jax

* newline

* forgot dot

* Rename use_memory_efficient

* Keep dtype last.

* Actually use key_chunk_size

* Rename symbol

* Apply style

* Rename use_memory_efficient

* Keep dtype last

* Pass `use_memory_efficient_attention` in `from_pretrained`

* Move JAX memory efficient attention to attention_flax.

* Simple test.

* style

---------
Co-authored-by: muhammad_hanif <muhammad_hanif@sofcograha.co.id>
Co-authored-by: MuhHanif <48muhhanif@gmail.com>

dc277501

11 Apr, 2023 8 commits

Attn added kv processor torch 2.0 block (#3023) · ea39cd7e
Will Berman authored Apr 11, 2023
```
add AttnAddedKVProcessor2_0 block
```
ea39cd7e

Attention processor cross attention norm group norm (#3021) · 98c5e5da

Will Berman authored Apr 11, 2023

add group norm type to attention processor cross attention norm

This lets the cross attention norm use both a group norm block and a
layer norm block.

The group norm operates along the channels dimension
and requires input shape (batch size, channels, *) where as the layer norm with a single
`normalized_shape` dimension only operates over the least significant
dimension i.e. (*, channels).

The channels we want to normalize are the hidden dimension of the encoder hidden states.

By convention, the encoder hidden states are always passed as (batch size, sequence
length, hidden states).

This means the layer norm can operate on the tensor without modification, but the group
norm requires flipping the last two dimensions to operate on (batch size, hidden states, sequence length).

All existing attention processors will have the same logic and we can
consolidate it in a helper function `prepare_encoder_hidden_states`

prepare_encoder_hidden_states -> norm_encoder_hidden_states re: @patrickvonplaten

move norm_cross defined check to outside norm_encoder_hidden_states

add missing attn.norm_cross check

98c5e5da

unet time embedding activation function (#3048) · 2d52e81c

Will Berman authored Apr 11, 2023

* unet time embedding activation function

* typo act_fn -> time_embedding_act_fn

* flatten conditional

2d52e81c

Fix typo and format BasicTransformerBlock attributes (#2953) · 52c4d32d

Chanchana Sornsoontorn authored Apr 12, 2023

* ⚙️chore(train_controlnet) fix typo in logger message

* ⚙️chore(models) refactor modules order; make them the same as calling order

When printing the BasicTransformerBlock to stdout, I think it's crucial that the attributes order are shown in proper order. And also previously the "3. Feed Forward" comment was not making sense. It should have been close to self.ff but it's instead next to self.norm3

* correct many tests

* remove bogus file

* make style

* correct more tests

* finish tests

* fix one more

* make style

* make unclip deterministic

* ⚙

️chore(models/attention) reorganize comments in BasicTransformerBlock class

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

52c4d32d

add only cross attention to simple attention blocks (#3011) · c6180a31

Will Berman authored Apr 11, 2023

* add only cross attention to simple attention blocks

* add test for only_cross_attention re: @patrickvonplaten

* mid_block_only_cross_attention better default

allow mid_block_only_cross_attention to default to
`only_cross_attention` when `only_cross_attention` is given
as a single boolean

c6180a31

Update documentation (#2996) · cb63febf

George Ogden authored Apr 11, 2023

* Update documentation

Based on sampling, the width and height must be powers of 2 as the samples halve in size each time

* make style

cb63febf

`AttentionProcessor.group_norm` num_channels should be `query_dim` (#3046) · 8c6b47cf

Will Berman authored Apr 11, 2023

* `AttentionProcessor.group_norm` num_channels should be `query_dim`

The group_norm on the attention processor should really norm the number
of channels in the query _not_ the inner dim. This wasn't caught before
because the group_norm is only used by the added kv attention processors
and the added kv attention processors are only used by the karlo models
which are configured such that the inner dim is the same as the query
dim.

* add_{k,v}_proj should be projecting to inner_dim

8c6b47cf

Fix config prints and save, load of pipelines (#2849) · 8b451eb6

Patrick von Platen authored Apr 11, 2023

* [Config] Fix config prints and save, load

* Only use potential nn.Modules for dtype and device

* Correct vae image processor

* make sure in_channels is not accessed directly

* make sure in channels is only accessed via config

* Make sure schedulers only access config attributes

* Make sure to access config in SAG

* Fix vae processor and make style

* add tests

* uP

* make style

* Fix more naming issues

* Final fix with vae config

* change more

8b451eb6

10 Apr, 2023 5 commits
- add `encoder_hid_dim` to unet · c413353e
  William Berman authored Apr 08, 2023
```
`encoder_hid_dim` provides an additional projection for the input `encoder_hidden_states` from `encoder_hidden_dim` to `cross_attention_dim`
```
  c413353e
- allow unet varying number of layers per block · 8db5e5b3
  William Berman authored Apr 08, 2023
  
  8db5e5b3
- resnet skip time activation and output scale factor · 707341ae
  William Berman authored Apr 08, 2023
  
  707341ae
- add missing AttnProcessor2_0 to AttentionProcessor union · 18ebd57b
  William Berman authored Apr 08, 2023
  
  18ebd57b
- fix simple attention processor encoder hidden states ordering · b6cc0502
  William Berman authored Apr 07, 2023
  
  b6cc0502
30 Mar, 2023 1 commit

add load textual inversion embeddings to stable diffusion (#2009) · a937e1b5

Pi Esposito authored Mar 30, 2023



* add load textual inversion embeddings draft

* fix quality

* fix typo

* make fix copies

* move to textual inversion mixin

* make it accept from sd-concept library

* accept list of paths to embeddings

* fix styling of stable diffusion pipeline

* add dummy TextualInversionMixin

* add docstring to textualinversionmixin

* add load textual inversion embeddings draft

* fix quality

* fix typo

* make fix copies

* move to textual inversion mixin

* make it accept from sd-concept library

* accept list of paths to embeddings

* fix styling of stable diffusion pipeline

* add dummy TextualInversionMixin

* add docstring to textualinversionmixin

* add case for parsing embedding from auto1111 UI format
Co-authored-by: Evan Jones <evan.a.jones3@gmail.com>
Co-authored-by: Ana Tamais <aninhamoraestamais@gmail.com>

* fix style after rebase

* move textual inversion mixin to loaders

* move mixin inheritance to DiffusionPipeline from StableDiffusionPipeline)

* update dummy class name

* addressed allo comments

* fix old dangling import

* fix style

* proposal

* remove bogus

* Apply suggestions from code review
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Will Berman <wlbberman@gmail.com>

* finish

* make style

* up

* fix code quality

* fix code quality - again

* fix code quality - 3

* fix alt diffusion code quality

* fix model editing pipeline

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Finish

---------
Co-authored-by: Evan Jones <evan.a.jones3@gmail.com>
Co-authored-by: Ana Tamais <aninhamoraestamais@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Will Berman <wlbberman@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

a937e1b5

28 Mar, 2023 2 commits
- [2761]: Add documentation for extra_in_channels UNet1DModel (#2817) · 53377ef8
  Nipun Jindal authored Mar 28, 2023
```
Co-authored-by: njindal <njindal@adobe.com>
```
  53377ef8
- [Init] Make sure shape mismatches are caught early (#2847) · 42d95017
  Patrick von Platen authored Mar 28, 2023
```
Improve init
```
  42d95017
27 Mar, 2023 2 commits

Helper function to disable custom attention processors (#2791) · b10f5275

Pedro Cuenca authored Mar 27, 2023

* Helper function to disable custom attention processors.

* Restore code deleted by mistake.

* Format

* Fix modeling_text_unet copy.

b10f5275

Ruff: apply same rules as in transformers (#2827) · 1d7b4b60

Pedro Cuenca authored Mar 27, 2023

* Apply same ruff settings as in transformers

See https://github.com/huggingface/transformers/blob/main/pyproject.toml

Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>

* Apply new style rules

* Style
Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>

* style

* remove list, ruff wouldn't auto fix.

---------
Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>

1d7b4b60

23 Mar, 2023 1 commit

Add AudioLDM (#2232) · b94880e5

Sanchit Gandhi authored Mar 23, 2023



* Add AudioLDM

* up

* add vocoder

* start unet

* unconditional unet

* clap, vocoder and vae

* clean-up: conversion scripts

* fix: conversion script token_type_ids

* clean-up: pipeline docstring

* tests: from SD

* clean-up: cpu offload vocoder instead of safety checker

* feat: adapt tests to audioldm

* feat: add docs

* clean-up: amend pipeline docstrings

* clean-up: make style

* clean-up: make fix-copies

* fix: add doc path to toctree

* clean-up: args for conversion script

* clean-up: paths to checkpoints

* fix: use conditional unet

* clean-up: make style

* fix: type hints for UNet

* clean-up: docstring for UNet

* clean-up: make style

* clean-up: remove duplicate in docstring

* clean-up: make style

* clean-up: make fix-copies

* clean-up: move imports to start in code snippet

* fix: pass cross_attention_dim as a list/tuple to unet

* clean-up: make fix-copies

* fix: update checkpoint path

* fix: unet cross_attention_dim in tests

* film embeddings -> class embeddings

* Apply suggestions from code review
Co-authored-by: Will Berman <wlbberman@gmail.com>

* fix: unet film embed to use existing args

* fix: unet tests to use existing args

* fix: make style

* fix: transformers import and version in init

* clean-up: make style

* Revert "clean-up: make style"

This reverts commit 5d6d1f8b324f5583e7805dc01e2c86e493660d66.

* clean-up: make style

* clean-up: use pipeline tester mixin tests where poss

* clean-up: skip attn slicing test

* fix: add torch dtype to docs

* fix: remove conversion script out of src

* fix: remove .detach from 1d waveform

* fix: reduce default num inf steps

* fix: swap height/width -> audio_length_in_s

* clean-up: make style

* fix: remove nightly tests

* fix: imports in conversion script

* clean-up: slim-down to two slow tests

* clean-up: slim-down fast tests

* fix: batch consistent tests

* clean-up: make style

* clean-up: remove vae slicing fast test

* clean-up: propagate changes to doc

* fix: increase test tol to 1e-2

* clean-up: finish docs

* clean-up: make style

* feat: vocoder / VAE compatibility check

* feat: possibly expand / cut audio waveform

* fix: pipeline call signature test

* fix: slow tests output len

* clean-up: make style

* make style

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: William Berman <WLBberman@gmail.com>

b94880e5