Commits · a784be2ebe6fe87258798affe5d90227b34c04ea · renzhc / diffusers_dcu

30 Sep, 2022 2 commits

Allow resolutions that are not multiples of 64 (#505) · a784be2e

Josh Achiam authored Sep 30, 2022



* Allow resolutions that are not multiples of 64

* ran black

* fix bug

* add test

* more explanation

* more comments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a784be2e

Optimize Stable Diffusion (#371) · 9ebaea54

Nouamane Tazi authored Sep 30, 2022

* initial commit

* make UNet stream capturable

* try to fix noise_pred value

* remove cuda graph and keep NB

* non blocking unet with PNDMScheduler

* make timesteps np arrays for pndm scheduler
because lists don't get formatted to tensors in `self.set_format`

* make max async in pndm

* use channel last format in unet

* avoid moving timesteps device in each unet call

* avoid memcpy op in `get_timestep_embedding`

* add `channels_last` kwarg to `DiffusionPipeline.from_pretrained`

* update TODO

* replace `channels_last` kwarg with `memory_format` for more generality

* revert the channels_last changes to leave it for another PR

* remove non_blocking when moving input ids to device

* remove blocking from all .to() operations at beginning of pipeline

* fix merging

* fix merging

* model can run in other precisions without autocast

* attn refactoring

* Revert "attn refactoring"

This reverts commit 0c70c0e189cd2c4d8768274c9fcf5b940ee310fb.

* remove restriction to run conv_norm in fp32

* use `baddbmm` instead of `matmul`for better in attention for better perf

* removing all reshapes to test perf

* Revert "removing all reshapes to test perf"

This reverts commit 006ccb8a8c6bc7eb7e512392e692a29d9b1553cd.

* add shapes comments

* hardcore whats needed for jitting

* Revert "hardcore whats needed for jitting"

This reverts commit 2fa9c698eae2890ac5f8e367ca80532ecf94df9a.

* Revert "remove restriction to run conv_norm in fp32"

This reverts commit cec592890c32da3d1b78d38b49e4307aedf459b9.

* revert using baddmm in attention's forward

* cleanup comment

* remove restriction to run conv_norm in fp32. no quality loss was noticed

This reverts commit cc9bc1339c998ebe9e7d733f910c6d72d9792213.

* add more optimizations techniques to docs

* Revert "add shapes comments"

This reverts commit 31c58eadb8892f95478cdf05229adf678678c5f4.

* apply suggestions

* make quality

* apply suggestions

* styling

* `scheduler.timesteps` are now arrays so we dont need .to()

* remove useless .type()

* use mean instead of max in `test_stable_diffusion_inpaint_pipeline_k_lms`

* move scheduler timestamps to correct device if tensors

* add device to `set_timesteps` in LMSD scheduler

* `self.scheduler.set_timesteps` now uses device arg for schedulers that accept it

* quick fix

* styling

* remove kwargs from schedulers `set_timesteps`

* revert to using max in K-LMS inpaint pipeline test

* Revert "`self.scheduler.set_timesteps` now uses device arg for schedulers that accept it"

This reverts commit 00d5a51e5c20d8d445c8664407ef29608106d899.

* move timesteps to correct device before loop in SD pipeline

* apply previous fix to other SD pipelines

* UNet now accepts tensor timesteps even on wrong device, to avoid errors
- it shouldnt affect performance if timesteps are alrdy on correct device
- it does slow down performance if they're on the wrong device

* fix pipeline when timesteps are arrays with strides

9ebaea54

29 Sep, 2022 3 commits
- Renamed x -> hidden_states in resnet.py (#676) · a7058f42
  Partho authored Sep 30, 2022
```
renamed x to hidden_states
```
  a7058f42
- `trained_betas` ignored in some schedulers (#635) · 3dacbb94
  V Vishnu Anirudh authored Sep 29, 2022
```
* correcting the beta value assignment

* updating DDIM and LMSDiscreteFlax schedulers

* bringing back the changes that were lost as part of main branch merge
```
  3dacbb94
- Flax `from_pretrained`: clean up `mismatched_keys`. (#630) · f10576ad
  Pedro Cuenca authored Sep 29, 2022
```
Flax from_pretrained: clean up `mismatched_keys`.

Originally removed in 73e0bc692c5761e55faff39c80a26d7a3cfc748c.
```
  f10576ad
28 Sep, 2022 1 commit

Fix the LMS pytorch regression (#664) · 765506ce

Anton Lozhkov authored Sep 28, 2022

* Fix the LMS pytorch regression

* Copy over the changes from #637

* Copy over the changes from #637

* Fix betas test

765506ce

27 Sep, 2022 10 commits

Fix `main`: stable diffusion pipelines cannot be loaded (#655) · 235770dd

Pedro Cuenca authored Sep 27, 2022

* Replace deprecation warning f-string with class name.

When `__repr__` is invoked in the instance serialization of
`config_dict` fails, because it contains `kwargs` of type `<class
inspect._empty>`.

* Revert "Replace deprecation warning f-string with class name."

This reverts commit 1c4eb8cb104374bd84e43865fc3865862473799c.

* Do not attempt to register `"kwargs"` as an attribute.

Otherwise serialization could fail.
This may happen for other attributes, so we should create a better
solution.

235770dd

Fix onnx tensor format (#654) · d8572f20
Anton Lozhkov authored Sep 27, 2022
```
fix np onnx
```
d8572f20
[Pytorch] add dep. warning for pytorch schedulers (#651) · 85494e88
Kashif Rasul authored Sep 27, 2022
```
* add dep. warning for schedulers

* fix format
```
85494e88
[DDIM, DDPM] fix add_noise (#648) · 33045382
Suraj Patil authored Sep 27, 2022
```
fix add noise
```
33045382

[Pytorch] Pytorch only schedulers (#534) · bd8df2da

Kashif Rasul authored Sep 27, 2022



* pytorch only schedulers

* fix style

* remove match_shape

* pytorch only ddpm

* remove SchedulerMixin

* remove numpy from karras_ve

* fix types

* remove numpy from lms_discrete

* remove numpy from pndm

* fix typo

* remove mixin and numpy from sde_vp and ve

* remove remaining tensor_format

* fix style

* sigmas has to be torch tensor

* removed set_format in readme

* remove set format from docs

* remove set_format from pipelines

* update tests

* fix typo

* continue to use mixin

* fix imports

* removed unsed imports

* match shape instead of assuming image shapes

* remove import typo

* update call to add_noise

* use math instead of numpy

* fix t_index

* removed commented out numpy tests

* timesteps needs to be discrete

* cast timesteps to int in flax scheduler too

* fix device mismatch issue

* small fix

* Update src/diffusers/schedulers/scheduling_pndm.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

bd8df2da

Fix `SpatialTransformer` (#578) · d886e497

Yih-Dar authored Sep 27, 2022



* Fix SpatialTransformer

* Fix SpatialTransformer
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d886e497

Flax pipeline pndm (#583) · ab3fd671

Pedro Cuenca authored Sep 27, 2022



* WIP: flax FlaxDiffusionPipeline & FlaxStableDiffusionPipeline

* todo comment

* Fix imports

* Fix imports

* add dummies

* Fix empty init

* make pipeline work

* up

* Allow dtype to be overridden on model load.

This may be a temporary solution until #567 is addressed.

* Convert params to bfloat16 or fp16 after loading.

This deals with the weights, not the model.

* Use Flax schedulers (typing, docstring)

* PNDM: replace control flow with jax functions.

Otherwise jitting/parallelization don't work properly as they don't know
how to deal with traced objects.

I temporarily removed `step_prk`.

* Pass latents shape to scheduler set_timesteps()

PNDMScheduler uses it to reserve space, other schedulers will just
ignore it.

* Wrap model imports inside availability checks.

* Optionally return state in from_config.

Useful for Flax schedulers.

* Do not convert model weights to dtype.

* Re-enable PRK steps with functional implementation.

Values returned still not verified for correctness.

* Remove left over has_state var.

* make style

* Apply suggestion list -> tuple
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Apply suggestion list -> tuple
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Remove unused comments.

* Use zeros instead of empty.
Co-authored-by: Mishig Davaadorj <dmishig@gmail.com>
Co-authored-by: Mishig Davaadorj <mishig.davaadorj@coloradocollege.edu>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

ab3fd671

Remove inappropriate docstrings in LMS docstrings. (#634) · c070e5f0
Pedro Cuenca authored Sep 27, 2022

c070e5f0
Remove deprecated `torch_device` kwarg (#623) · b671cb09
Pedro Cuenca authored Sep 27, 2022
```
* Remove deprecated `torch_device` kwarg.

* Remove unused imports.
```
b671cb09

Warning for too long prompts in DiffusionPipelines (Resolve #447) (#472) · f7ebe569

Yuta Hayashibe authored Sep 27, 2022

* Return encoded texts by DiffusionPipelines

* Updated README to show hot to use enoded_text_input

* Reverted examples in README.md

* Reverted all

* Warning for long prompts

* Fix bugs

* Formatted

f7ebe569

24 Sep, 2022 2 commits

Fix formula for noise levels in Karras scheduler and tests (#627) · 35e92096
Grigory Sizov authored Sep 24, 2022
```
fix formula for noise levels in karras scheduler and tests
```
35e92096

docs: `src/diffusers` readability improvements (#629) · d0aa899f

Ryan Russell authored Sep 24, 2022



* docs: `src/diffusers` readability improvements
Signed-off-by: Ryan Russell <git@ryanrussell.org>

* docs: `make style` lint
Signed-off-by: Ryan Russell <git@ryanrussell.org>
Signed-off-by: Ryan Russell <git@ryanrussell.org>

d0aa899f

23 Sep, 2022 6 commits

Fix breaking error: "ort is not defined" (#626) · 1e152030
Pedro Cuenca authored Sep 23, 2022
```
Fix "ort is not defined" issue.
```
1e152030
Allow passing session_options for ORT backend (#620) · 8211b622
cloudhan authored Sep 23, 2022

8211b622

refactor: pipelines readability improvements (#622) · ce31f83d

Ryan Russell authored Sep 23, 2022



* refactor: pipelines readability improvements
Signed-off-by: Ryan Russell <git@ryanrussell.org>

* docs: remove todo comment from flax pipeline
Signed-off-by: Ryan Russell <git@ryanrussell.org>
Signed-off-by: Ryan Russell <git@ryanrussell.org>

ce31f83d

fix docs: change sample to images (#613) · b00382e2
Abdullah Alfaraj authored Sep 23, 2022
```
the result of running the pipeline is stored in StableDiffusionPipelineOutput.images
```
b00382e2

Flax documentation (#589) · 8b0be935

Younes Belkada authored Sep 23, 2022



* documenting `attention_flax.py` file

* documenting `embeddings_flax.py`

* documenting `unet_blocks_flax.py`

* Add new objs to doc page

* document `vae_flax.py`

* Apply suggestions from code review

* modify `unet_2d_condition_flax.py`

* make style

* Apply suggestions from code review

* make style

* Apply suggestions from code review

* fix indent

* fix typo

* fix indent unet

* Update src/diffusers/models/vae_flax.py

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Mishig Davaadorj <dmishig@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

8b0be935

docs: `.md` readability fixups (#619) · df80ccf7
Ryan Russell authored Sep 23, 2022
```
Signed-off-by: Ryan Russell <git@ryanrussell.org>
```
df80ccf7

22 Sep, 2022 4 commits

Adding pred_original_sample to SchedulerOutput for some samplers (#614) · 91db8189

Jonathan Whitaker authored Sep 22, 2022

* Adding pred_original_sample to SchedulerOutput of DDPMScheduler, DDIMScheduler, LMSDiscreteScheduler, KarrasVeScheduler step methods so we can access the predicted denoised outputs

* Gave DDPMScheduler, DDIMScheduler and LMSDiscreteScheduler their own output dataclasses so the default SchedulerOutput in scheduling_utils does not need pred_original_sample as an optional extra

* Reordered library imports to follow standard

* didnt get import order quite right apparently

* Forgot to change name of LMSDiscreteSchedulerOutput

* Aha, needed some extra libs for make style to fully work

91db8189

[UNet2DConditionModel] add gradient checkpointing (#461) · e7120bae

Suraj Patil authored Sep 22, 2022

* add grad ckpt to downsample blocks

* make it work

* don't pass gradient_checkpointing to upsample block

* add tests for UNet2DConditionModel

* add test_gradient_checkpointing

* add gradient_checkpointing for up and down blocks

* add functions to enable and disable grad ckpt

* remove the forward argument

* better naming

* make supports_gradient_checkpointing private

e7120bae

[flax] 'dtype' should not be part of self._internal_dict (#609) · 534512be
Mishig Davaadorj authored Sep 22, 2022

534512be
Make flax from_pretrained work with local subfolder (#608) · 4b8880a3
Mishig Davaadorj authored Sep 22, 2022

4b8880a3

21 Sep, 2022 9 commits
- Handle the PIL.Image.Resampling deprecation (#588) · dd350c8a
  Anton Lozhkov authored Sep 22, 2022
```
* Handle the PIL.Image.Resampling deprecation

* style
```
  dd350c8a
- docs: fix `Berkeley` ref (#611) · 80183ca5
  Ryan Russell authored Sep 21, 2022
```
Signed-off-by: Ryan Russell <git@ryanrussell.org>
Signed-off-by: Ryan Russell <git@ryanrussell.org>
```
  80183ca5
- [ONNX] Collate the external weights, speed up loading from the hub (#610) · 6bd005eb
  Anton Lozhkov authored Sep 21, 2022
  
  6bd005eb
- Return Flax scheduler state (#601) · a9fdb3de
  Pedro Cuenca authored Sep 21, 2022
```
* Optionally return state in from_config.

Useful for Flax schedulers.

* has_state is now a property, make check more strict.

I don't check the class is `SchedulerMixin` to prevent circular
dependencies. It should be enough that the class name starts with "Flax"
the object declares it "has_state" and the "create_state" exists too.

* Use state in pipeline from_pretrained.

* Make style
```
  a9fdb3de
- Replace `dropout_prob` by `dropout` in `vae` (#595) · 3fc8ef72
  Younes Belkada authored Sep 21, 2022
```
replace `dropout_prob` by `dropout` in `vae`
```
  3fc8ef72
- Mv weights name consts to diffusers.utils (#605) · 86856993
  Mishig Davaadorj authored Sep 21, 2022
  
  86856993
- Fix flax from_pretrained pytorch weight check (#603) · f8100600
  Mishig Davaadorj authored Sep 21, 2022
  
  f8100600
- Allow dtype to be specified in Flax pipeline (#600) · fb2fbab1
  Pedro Cuenca authored Sep 21, 2022
```
* Fix typo in docstring.

* Allow dtype to be overridden on model load.

This may be a temporary solution until #567 is addressed.

* Create latents in float32

The denoising loop always computes the next step in float32, so this
would fail when using `bfloat16`.
```
  fb2fbab1
- Fix params replication when using the dummy checker (#602) · fb03aad8
  Pedro Cuenca authored Sep 21, 2022
```
Fix params replication when sing the dummy checker.
```
  fb03aad8
20 Sep, 2022 3 commits

[Flax] Fix unet and ddim scheduler (#594) · 2345481c
Patrick von Platen authored Sep 20, 2022
```
* [Flax] Fix unet and ddim scheduler

* correct

* finish
```
2345481c

FlaxDiffusionPipeline & FlaxStableDiffusionPipeline (#559) · d934d3d7

Mishig Davaadorj authored Sep 20, 2022



* WIP: flax FlaxDiffusionPipeline & FlaxStableDiffusionPipeline

* todo comment

* Fix imports

* Fix imports

* add dummies

* Fix empty init

* make pipeline work

* up

* Use Flax schedulers (typing, docstring)

* Wrap model imports inside availability checks.

* more updates

* make sure flax is not broken

* make style

* more fixes

* up
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@latenitesoft.com>

d934d3d7

[flax safety checker] Use `FlaxPreTrainedModel` for saving/loading (#591) · c6629e6f
Suraj Patil authored Sep 20, 2022
```
* use FlaxPreTrainedModel for flax safety module

* fix name

* fix one more

* Apply suggestions from code review
```
c6629e6f