Commits · 877bec8a916c0c2a7ff9d746db9068c5eb38ef61 · renzhc / diffusers_dcu

30 Sep, 2022 3 commits

refactor: update ldm-bert `config.json` url closes #675 (#680) · 877bec8a

Ryan Russell authored Sep 30, 2022



refactor: update ldm-bert `config.json` url
Signed-off-by: Ryan Russell <git@ryanrussell.org>
Signed-off-by: Ryan Russell <git@ryanrussell.org>

877bec8a

Allow resolutions that are not multiples of 64 (#505) · a784be2e

Josh Achiam authored Sep 30, 2022



* Allow resolutions that are not multiples of 64

* ran black

* fix bug

* add test

* more explanation

* more comments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a784be2e

Optimize Stable Diffusion (#371) · 9ebaea54

Nouamane Tazi authored Sep 30, 2022

* initial commit

* make UNet stream capturable

* try to fix noise_pred value

* remove cuda graph and keep NB

* non blocking unet with PNDMScheduler

* make timesteps np arrays for pndm scheduler
because lists don't get formatted to tensors in `self.set_format`

* make max async in pndm

* use channel last format in unet

* avoid moving timesteps device in each unet call

* avoid memcpy op in `get_timestep_embedding`

* add `channels_last` kwarg to `DiffusionPipeline.from_pretrained`

* update TODO

* replace `channels_last` kwarg with `memory_format` for more generality

* revert the channels_last changes to leave it for another PR

* remove non_blocking when moving input ids to device

* remove blocking from all .to() operations at beginning of pipeline

* fix merging

* fix merging

* model can run in other precisions without autocast

* attn refactoring

* Revert "attn refactoring"

This reverts commit 0c70c0e189cd2c4d8768274c9fcf5b940ee310fb.

* remove restriction to run conv_norm in fp32

* use `baddbmm` instead of `matmul`for better in attention for better perf

* removing all reshapes to test perf

* Revert "removing all reshapes to test perf"

This reverts commit 006ccb8a8c6bc7eb7e512392e692a29d9b1553cd.

* add shapes comments

* hardcore whats needed for jitting

* Revert "hardcore whats needed for jitting"

This reverts commit 2fa9c698eae2890ac5f8e367ca80532ecf94df9a.

* Revert "remove restriction to run conv_norm in fp32"

This reverts commit cec592890c32da3d1b78d38b49e4307aedf459b9.

* revert using baddmm in attention's forward

* cleanup comment

* remove restriction to run conv_norm in fp32. no quality loss was noticed

This reverts commit cc9bc1339c998ebe9e7d733f910c6d72d9792213.

* add more optimizations techniques to docs

* Revert "add shapes comments"

This reverts commit 31c58eadb8892f95478cdf05229adf678678c5f4.

* apply suggestions

* make quality

* apply suggestions

* styling

* `scheduler.timesteps` are now arrays so we dont need .to()

* remove useless .type()

* use mean instead of max in `test_stable_diffusion_inpaint_pipeline_k_lms`

* move scheduler timestamps to correct device if tensors

* add device to `set_timesteps` in LMSD scheduler

* `self.scheduler.set_timesteps` now uses device arg for schedulers that accept it

* quick fix

* styling

* remove kwargs from schedulers `set_timesteps`

* revert to using max in K-LMS inpaint pipeline test

* Revert "`self.scheduler.set_timesteps` now uses device arg for schedulers that accept it"

This reverts commit 00d5a51e5c20d8d445c8664407ef29608106d899.

* move timesteps to correct device before loop in SD pipeline

* apply previous fix to other SD pipelines

* UNet now accepts tensor timesteps even on wrong device, to avoid errors
- it shouldnt affect performance if timesteps are alrdy on correct device
- it does slow down performance if they're on the wrong device

* fix pipeline when timesteps are arrays with strides

9ebaea54

29 Sep, 2022 6 commits
- Renamed x -> hidden_states in resnet.py (#676) · a7058f42
  Partho authored Sep 30, 2022
```
renamed x to hidden_states
```
  a7058f42
- `trained_betas` ignored in some schedulers (#635) · 3dacbb94
  V Vishnu Anirudh authored Sep 29, 2022
```
* correcting the beta value assignment

* updating DDIM and LMSDiscreteFlax schedulers

* bringing back the changes that were lost as part of main branch merge
```
  3dacbb94
- Flax `from_pretrained`: clean up `mismatched_keys`. (#630) · f10576ad
  Pedro Cuenca authored Sep 29, 2022
```
Flax from_pretrained: clean up `mismatched_keys`.

Originally removed in 73e0bc692c5761e55faff39c80a26d7a3cfc748c.
```
  f10576ad
- [gradient checkpointing] lower tolerance for test (#652) · 84b9df57
  Suraj Patil authored Sep 29, 2022
```
* lowe tolerance

* put model in eval mode
```
  84b9df57
- [examples] update transfomers version (#665) · 210be4fe
  Suraj Patil authored Sep 29, 2022
```
update transfomrers version in example
```
  210be4fe
- Update index.mdx (#670) · f5b9bc8b
  Tanishq Abraham authored Sep 29, 2022
  
  f5b9bc8b
28 Sep, 2022 3 commits
- [CLIPGuidedStableDiffusion] take the correct text embeddings (#667) · c16761e9
  Suraj Patil authored Sep 28, 2022
```
take the correct text embeddings
```
  c16761e9
- Added script to save during textual inversion training. Issue 524 (#645) · 7f31142c
  Isamu Isozaki authored Sep 29, 2022
```
* Added script to save during training

* Suggested changes
```
  7f31142c
- Fix the LMS pytorch regression (#664) · 765506ce
  Anton Lozhkov authored Sep 28, 2022
```
* Fix the LMS pytorch regression

* Copy over the changes from #637

* Copy over the changes from #637

* Fix betas test
```
  765506ce
27 Sep, 2022 16 commits

Fix `main`: stable diffusion pipelines cannot be loaded (#655) · 235770dd

Pedro Cuenca authored Sep 27, 2022

* Replace deprecation warning f-string with class name.

When `__repr__` is invoked in the instance serialization of
`config_dict` fails, because it contains `kwargs` of type `<class
inspect._empty>`.

* Revert "Replace deprecation warning f-string with class name."

This reverts commit 1c4eb8cb104374bd84e43865fc3865862473799c.

* Do not attempt to register `"kwargs"` as an attribute.

Otherwise serialization could fail.
This may happen for other attributes, so we should create a better
solution.

235770dd

Fix onnx tensor format (#654) · d8572f20
Anton Lozhkov authored Sep 27, 2022
```
fix np onnx
```
d8572f20
[CLIPGuidedStableDiffusion] remove set_format from pipeline (#653) · c0c98df9
Suraj Patil authored Sep 27, 2022
```
remove set_format from pipeline
```
c0c98df9
[Pytorch] add dep. warning for pytorch schedulers (#651) · 85494e88
Kashif Rasul authored Sep 27, 2022
```
* add dep. warning for schedulers

* fix format
```
85494e88
[DDIM, DDPM] fix add_noise (#648) · 33045382
Suraj Patil authored Sep 27, 2022
```
fix add noise
```
33045382
[dreambooth] update install section (#650) · e5eed523
Suraj Patil authored Sep 27, 2022
```
update install section
```
e5eed523
[examples/dreambooth] don't pass tensor_format to scheduler. (#649) · ac665b64
Suraj Patil authored Sep 27, 2022
```
don't pass tensor_format
```
ac665b64

[Pytorch] Pytorch only schedulers (#534) · bd8df2da

Kashif Rasul authored Sep 27, 2022



* pytorch only schedulers

* fix style

* remove match_shape

* pytorch only ddpm

* remove SchedulerMixin

* remove numpy from karras_ve

* fix types

* remove numpy from lms_discrete

* remove numpy from pndm

* fix typo

* remove mixin and numpy from sde_vp and ve

* remove remaining tensor_format

* fix style

* sigmas has to be torch tensor

* removed set_format in readme

* remove set format from docs

* remove set_format from pipelines

* update tests

* fix typo

* continue to use mixin

* fix imports

* removed unsed imports

* match shape instead of assuming image shapes

* remove import typo

* update call to add_noise

* use math instead of numpy

* fix t_index

* removed commented out numpy tests

* timesteps needs to be discrete

* cast timesteps to int in flax scheduler too

* fix device mismatch issue

* small fix

* Update src/diffusers/schedulers/scheduling_pndm.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

bd8df2da

Add training example for DreamBooth. (#554) · 3b747de8

Zhenhuan Liu authored Sep 27, 2022



* Add training example for DreamBooth.

* Fix bugs.

* Update readme and default hyperparameters.

* Reformatting code with black.

* Update for multi-gpu trianing.

* Apply suggestions from code review

* improgve sampling

* fix autocast

* improve sampling more

* fix saving

* actuallu fix saving

* fix saving

* improve dataset

* fix collate fun

* fix collate_fn

* fix collate fn

* fix key name

* fix dataset

* fix collate fn

* concat batch in collate fn

* add grad ckpt

* add option for 8bit adam

* do two forward passes for prior preservation

* Revert "do two forward passes for prior preservation"

This reverts commit 661ca4677e6dccc4ad596c2ee6ca4baad4159e95.

* add option for prior_loss_weight

* add option for clip grad norm

* add more comments

* update readme

* update readme

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* add docstr for dataset

* update the saving logic

* Update examples/dreambooth/README.md

* remove unused imports
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

3b747de8

Fix `SpatialTransformer` (#578) · d886e497

Yih-Dar authored Sep 27, 2022



* Fix SpatialTransformer

* Fix SpatialTransformer
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d886e497

Flax pipeline pndm (#583) · ab3fd671

Pedro Cuenca authored Sep 27, 2022



* WIP: flax FlaxDiffusionPipeline & FlaxStableDiffusionPipeline

* todo comment

* Fix imports

* Fix imports

* add dummies

* Fix empty init

* make pipeline work

* up

* Allow dtype to be overridden on model load.

This may be a temporary solution until #567 is addressed.

* Convert params to bfloat16 or fp16 after loading.

This deals with the weights, not the model.

* Use Flax schedulers (typing, docstring)

* PNDM: replace control flow with jax functions.

Otherwise jitting/parallelization don't work properly as they don't know
how to deal with traced objects.

I temporarily removed `step_prk`.

* Pass latents shape to scheduler set_timesteps()

PNDMScheduler uses it to reserve space, other schedulers will just
ignore it.

* Wrap model imports inside availability checks.

* Optionally return state in from_config.

Useful for Flax schedulers.

* Do not convert model weights to dtype.

* Re-enable PRK steps with functional implementation.

Values returned still not verified for correctness.

* Remove left over has_state var.

* make style

* Apply suggestion list -> tuple
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Apply suggestion list -> tuple
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Remove unused comments.

* Use zeros instead of empty.
Co-authored-by: Mishig Davaadorj <dmishig@gmail.com>
Co-authored-by: Mishig Davaadorj <mishig.davaadorj@coloradocollege.edu>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

ab3fd671

Remove inappropriate docstrings in LMS docstrings. (#634) · c070e5f0
Pedro Cuenca authored Sep 27, 2022

c070e5f0

refactor: `custom_init_isort` readability fixups (#631) · b6945310

Ryan Russell authored Sep 27, 2022


Signed-off-by: Ryan Russell <git@ryanrussell.org>
Signed-off-by: Ryan Russell <git@ryanrussell.org>

b6945310

Remove deprecated `torch_device` kwarg (#623) · b671cb09
Pedro Cuenca authored Sep 27, 2022
```
* Remove deprecated `torch_device` kwarg.

* Remove unused imports.
```
b671cb09
Fix docs link to train_unconditional.py (#642) · bb0c5d15
Abdullah Alfaraj authored Sep 27, 2022
```
the link points to an old location of the train_unconditional.py file
```
bb0c5d15

Warning for too long prompts in DiffusionPipelines (Resolve #447) (#472) · f7ebe569

Yuta Hayashibe authored Sep 27, 2022

* Return encoded texts by DiffusionPipelines

* Updated README to show hot to use enoded_text_input

* Reverted examples in README.md

* Reverted all

* Warning for long prompts

* Fix bugs

* Formatted

f7ebe569

24 Sep, 2022 3 commits
- [CI] Fix onnxruntime installation order (#633) · 57b70c59
  Anton Lozhkov authored Sep 24, 2022
  
  57b70c59
- Fix formula for noise levels in Karras scheduler and tests (#627) · 35e92096
  Grigory Sizov authored Sep 24, 2022
```
fix formula for noise levels in karras scheduler and tests
```
  35e92096
- docs: `src/diffusers` readability improvements (#629) · d0aa899f
  Ryan Russell authored Sep 24, 2022
```
* docs: `src/diffusers` readability improvements
Signed-off-by: Ryan Russell <git@ryanrussell.org>

* docs: `make style` lint
Signed-off-by: Ryan Russell <git@ryanrussell.org>
Signed-off-by: Ryan Russell <git@ryanrussell.org>
```
  d0aa899f
23 Sep, 2022 6 commits

Fix breaking error: "ort is not defined" (#626) · 1e152030
Pedro Cuenca authored Sep 23, 2022
```
Fix "ort is not defined" issue.
```
1e152030
Allow passing session_options for ORT backend (#620) · 8211b622
cloudhan authored Sep 23, 2022

8211b622

refactor: pipelines readability improvements (#622) · ce31f83d

Ryan Russell authored Sep 23, 2022



* refactor: pipelines readability improvements
Signed-off-by: Ryan Russell <git@ryanrussell.org>

* docs: remove todo comment from flax pipeline
Signed-off-by: Ryan Russell <git@ryanrussell.org>
Signed-off-by: Ryan Russell <git@ryanrussell.org>

ce31f83d

fix docs: change sample to images (#613) · b00382e2
Abdullah Alfaraj authored Sep 23, 2022
```
the result of running the pipeline is stored in StableDiffusionPipelineOutput.images
```
b00382e2

Flax documentation (#589) · 8b0be935

Younes Belkada authored Sep 23, 2022



* documenting `attention_flax.py` file

* documenting `embeddings_flax.py`

* documenting `unet_blocks_flax.py`

* Add new objs to doc page

* document `vae_flax.py`

* Apply suggestions from code review

* modify `unet_2d_condition_flax.py`

* make style

* Apply suggestions from code review

* make style

* Apply suggestions from code review

* fix indent

* fix typo

* fix indent unet

* Update src/diffusers/models/vae_flax.py

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Mishig Davaadorj <dmishig@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

8b0be935

docs: `.md` readability fixups (#619) · df80ccf7
Ryan Russell authored Sep 23, 2022
```
Signed-off-by: Ryan Russell <git@ryanrussell.org>
```
df80ccf7

22 Sep, 2022 3 commits

Adding pred_original_sample to SchedulerOutput for some samplers (#614) · 91db8189

Jonathan Whitaker authored Sep 22, 2022

* Adding pred_original_sample to SchedulerOutput of DDPMScheduler, DDIMScheduler, LMSDiscreteScheduler, KarrasVeScheduler step methods so we can access the predicted denoised outputs

* Gave DDPMScheduler, DDIMScheduler and LMSDiscreteScheduler their own output dataclasses so the default SchedulerOutput in scheduling_utils does not need pred_original_sample as an optional extra

* Reordered library imports to follow standard

* didnt get import order quite right apparently

* Forgot to change name of LMSDiscreteSchedulerOutput

* Aha, needed some extra libs for make style to fully work

91db8189

docs: fix `stochastic_karras_ve` ref (#618) · f149d037

Ryan Russell authored Sep 22, 2022


Signed-off-by: Ryan Russell <git@ryanrussell.org>
Signed-off-by: Ryan Russell <git@ryanrussell.org>

f149d037

[UNet2DConditionModel] add gradient checkpointing (#461) · e7120bae

Suraj Patil authored Sep 22, 2022

* add grad ckpt to downsample blocks

* make it work

* don't pass gradient_checkpointing to upsample block

* add tests for UNet2DConditionModel

* add test_gradient_checkpointing

* add gradient_checkpointing for up and down blocks

* add functions to enable and disable grad ckpt

* remove the forward argument

* better naming

* make supports_gradient_checkpointing private

e7120bae