Commits · 5afc2b60cd1c555c33e2df7caa93372b5c21a3e8 · renzhc / diffusers_dcu

12 Oct, 2022 1 commit
- add or fix license formatting in models directory (#808) · 5afc2b60
  Nathan Lambert authored Oct 12, 2022
```
* add or fix license formatting

* fix quality
```
  5afc2b60
30 Sep, 2022 2 commits

Nouamane Tazi authored Sep 30, 2022

* revert using baddbmm in attention
- to fix `test_stable_diffusion_memory_chunking` test

* styling

b2cfc7a0

Optimize Stable Diffusion (#371) · 9ebaea54

Nouamane Tazi authored Sep 30, 2022

* initial commit

* make UNet stream capturable

* try to fix noise_pred value

* remove cuda graph and keep NB

* non blocking unet with PNDMScheduler

* make timesteps np arrays for pndm scheduler
because lists don't get formatted to tensors in `self.set_format`

* make max async in pndm

* use channel last format in unet

* avoid moving timesteps device in each unet call

* avoid memcpy op in `get_timestep_embedding`

* add `channels_last` kwarg to `DiffusionPipeline.from_pretrained`

* update TODO

* replace `channels_last` kwarg with `memory_format` for more generality

* revert the channels_last changes to leave it for another PR

* remove non_blocking when moving input ids to device

* remove blocking from all .to() operations at beginning of pipeline

* fix merging

* fix merging

* model can run in other precisions without autocast

* attn refactoring

* Revert "attn refactoring"

This reverts commit 0c70c0e189cd2c4d8768274c9fcf5b940ee310fb.

* remove restriction to run conv_norm in fp32

* use `baddbmm` instead of `matmul`for better in attention for better perf

* removing all reshapes to test perf

* Revert "removing all reshapes to test perf"

This reverts commit 006ccb8a8c6bc7eb7e512392e692a29d9b1553cd.

* add shapes comments

* hardcore whats needed for jitting

* Revert "hardcore whats needed for jitting"

This reverts commit 2fa9c698eae2890ac5f8e367ca80532ecf94df9a.

* Revert "remove restriction to run conv_norm in fp32"

This reverts commit cec592890c32da3d1b78d38b49e4307aedf459b9.

* revert using baddmm in attention's forward

* cleanup comment

* remove restriction to run conv_norm in fp32. no quality loss was noticed

This reverts commit cc9bc1339c998ebe9e7d733f910c6d72d9792213.

* add more optimizations techniques to docs

* Revert "add shapes comments"

This reverts commit 31c58eadb8892f95478cdf05229adf678678c5f4.

* apply suggestions

* make quality

* apply suggestions

* styling

* `scheduler.timesteps` are now arrays so we dont need .to()

* remove useless .type()

* use mean instead of max in `test_stable_diffusion_inpaint_pipeline_k_lms`

* move scheduler timestamps to correct device if tensors

* add device to `set_timesteps` in LMSD scheduler

* `self.scheduler.set_timesteps` now uses device arg for schedulers that accept it

* quick fix

* styling

* remove kwargs from schedulers `set_timesteps`

* revert to using max in K-LMS inpaint pipeline test

* Revert "`self.scheduler.set_timesteps` now uses device arg for schedulers that accept it"

This reverts commit 00d5a51e5c20d8d445c8664407ef29608106d899.

* move timesteps to correct device before loop in SD pipeline

* apply previous fix to other SD pipelines

* UNet now accepts tensor timesteps even on wrong device, to avoid errors
- it shouldnt affect performance if timesteps are alrdy on correct device
- it does slow down performance if they're on the wrong device

* fix pipeline when timesteps are arrays with strides

9ebaea54

27 Sep, 2022 1 commit

Fix `SpatialTransformer` (#578) · d886e497

Yih-Dar authored Sep 27, 2022



* Fix SpatialTransformer

* Fix SpatialTransformer
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

d886e497

19 Sep, 2022 3 commits
- Fix `CrossAttention._sliced_attention` (#563) · 84616b5d
  Yih-Dar authored Sep 19, 2022
```
* Fix CrossAttention._sliced_attention
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>
```
  84616b5d
- revert the accidental commit · 0424615a
  ydshieh authored Sep 19, 2022
  
  0424615a
- Fix CrossAttention._sliced_attention · 8187865a
  ydshieh authored Sep 19, 2022
  
  8187865a
15 Sep, 2022 1 commit

[UNet2DConditionModel, UNet2DModel] pass norm_num_groups to all the blocks (#442) · d144c46a

Suraj Patil authored Sep 15, 2022

* pass norm_num_groups to unet blocs and attention

* fix UNet2DConditionModel

* add norm_num_groups arg in vae

* add tests

* remove comment

* Apply suggestions from code review

d144c46a

14 Sep, 2022 1 commit

[CrossAttention] add different method for sliced attention (#446) · 8b450969

Suraj Patil authored Sep 14, 2022



* add different method for sliced attention

* Update src/diffusers/models/attention.py

* Apply suggestions from code review

* Update src/diffusers/models/attention.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

8b450969

09 Sep, 2022 2 commits

Renamed variables from single letter to better naming (#449) · 58434879

Partho authored Sep 09, 2022

* renamed variable names

q -> query
k -> key
v -> value
b -> batch
c -> channel
h -> height
w -> weight

* rename variable names

missed some in the initial commit

* renamed more variable names

As per  code review suggestions, renamed x -> hidden_states and x_in -> residual

* fixed minor typo

58434879

use torch.matmul instead of einsum in attnetion. (#445) · 5adb0a7b
Suraj Patil authored Sep 09, 2022
```
* use torch.matmul instead of einsum

* fix softmax
```
5adb0a7b

08 Sep, 2022 2 commits

[Docs] Models (#416) · 5e6417e9

Kashif Rasul authored Sep 08, 2022



* docs for attention

* types for embeddings

* unet2d docstrings

* UNet2DConditionModel docstrings

* fix typos

* style and vq-vae docstrings

* docstrings  for VAE

* Update src/diffusers/models/unet_2d.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* make style

* added inherits from sentence

* docstring to forward

* make style

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* finish model docs

* up
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

5e6417e9

Inference support for `mps` device (#355) · 5dda1735

Pedro Cuenca authored Sep 08, 2022

* Initial support for mps in Stable Diffusion pipeline.

* Initial "warmup" implementation when using mps.

* Make some deterministic tests pass with mps.

* Disable training tests when using mps.

* SD: generate latents in CPU then move to device.

This is especially important when using the mps device, because
generators are not supported there. See for example
https://github.com/pytorch/pytorch/issues/84288.

In addition, the other pipelines seem to use the same approach: generate
the random samples then move to the appropriate device.

After this change, generating an image in MPS produces the same result
as when using the CPU, if the same seed is used.

* Remove prints.

* Pass AutoencoderKL test_output_pretrained with mps.

Sampling from `posterior` must be done in CPU.

* Style

* Do not use torch.long for log op in mps device.

* Perform incompatible padding ops in CPU.

UNet tests now pass.
See https://github.com/pytorch/pytorch/issues/84535



* Style: fix import order.

* Remove unused symbols.

* Remove MPSWarmupMixin, do not apply automatically.

We do apply warmup in the tests, but not during normal use.
This adopts some PR suggestions by @patrickvonplaten.

* Add comment for mps fallback to CPU step.

* Add README_mps.md for mps installation and use.

* Apply `black` to modified files.

* Restrict README_mps to SD, show measures in table.

* Make PNDM indexing compatible with mps.

Addresses #239.

* Do not use float64 when using LDMScheduler.

Fixes #358.

* Fix typo identified by @patil-suraj
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Adapt example to new output style.

* Restore 1:1 results reproducibility with CompVis.

However, mps latents need to be generated in CPU because generators
don't work in the mps device.

* Move PyTorch nightly to requirements.

* Adapt `test_scheduler_outputs_equivalence` ton MPS.

* mps: skip training tests instead of ignoring silently.

* Make VQModel tests pass on mps.

* mps ddim tests: warmup, increase tolerance.

* ScoreSdeVeScheduler indexing made mps compatible.

* Make ldm pipeline tests pass using warmup.

* Style

* Simplify casting as suggested in PR.

* Add Known Issues to readme.

* `isort` import order.

* Remove _mps_warmup helpers from ModelMixin.

And just make changes to the tests.

* Skip tests using unittest decorator for consistency.

* Remove temporary var.

* Remove spurious blank space.

* Remove unused symbol.

* Remove README_mps.
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

5dda1735

06 Sep, 2022 1 commit

Efficient Attention (#366) · 5c4ea00d

Patrick von Platen authored Sep 06, 2022



* up

* add tests

* correct

* up

* finish

* better naming

* Update README.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

5c4ea00d

25 Aug, 2022 1 commit
- [Clean up] Clean unused code (#245) · c1efda70
  Patrick von Platen authored Aug 25, 2022
```
* CleanResNet

* refactor more

* correct
```
  c1efda70
20 Jul, 2022 1 commit

Big Model Renaming (#109) · 9c3820d0

Patrick von Platen authored Jul 21, 2022

* up

* change model name

* renaming

* more changes

* up

* up

* up

* save checkpoint

* finish api / naming

* finish config renaming

* rename all weights

* finish really

9c3820d0

19 Jul, 2022 1 commit
- Finalize ldm (#96) · d5acb411
  Patrick von Platen authored Jul 19, 2022
```
* upload

* make checkpoint work

* finalize
```
  d5acb411
18 Jul, 2022 1 commit

[SDE] Merge to unconditional model (#89) · ba3c9a9a

Patrick von Platen authored Jul 18, 2022

* up

* more

* uP

* make dummy test pass

* save intermediate

* p

* p

* finish

* finish

* finish

ba3c9a9a

14 Jul, 2022 2 commits
- [DDPM] Make DDPM work (#88) · 6d5ef87e
  Patrick von Platen authored Jul 14, 2022
```
* up

* finish

* uP
```
  6d5ef87e
- save intermediate (#87) · e7fe901e
  Patrick von Platen authored Jul 14, 2022
```
* save intermediate

* up

* up
```
  e7fe901e
12 Jul, 2022 1 commit
- Add unconditional image generation (#79) · 06c79730
  Patrick von Platen authored Jul 12, 2022
```
* uP

* finish downsampling layers

* finish major refactor

* remove bugus file
```
  06c79730
05 Jul, 2022 1 commit
- [MidBlock] Fix mid block (#78) · ea8d58ea
  Patrick von Platen authored Jul 05, 2022
```
* upload files

* finish
```
  ea8d58ea
04 Jul, 2022 3 commits
- Fix attention for Glide (#75) · 10798663
  Anton Lozhkov authored Jul 04, 2022
  
  10798663
- Fix mutable proj_out weight in the Attention layer (#73) · d9316bf8
  Anton Lozhkov authored Jul 04, 2022
```
* Catch unused params in DDP

* Fix proj_out, add test
```
  d9316bf8
- update mid block (#70) · 94566e6d
  Patrick von Platen authored Jul 04, 2022
```
* update mid block

* finish mid block
```
  94566e6d
28 Jun, 2022 7 commits
- some clean up · c482d7bd
  Patrick von Platen authored Jun 28, 2022
  
  c482d7bd
- final fix · 31d1f3c8
  Patrick von Platen authored Jun 28, 2022
  
  31d1f3c8
- one attention module only · 635da723
  Patrick von Platen authored Jun 28, 2022
  
  635da723
- merge unet attention into glide attention · c45fd749
  Patrick von Platen authored Jun 28, 2022
  
  c45fd749
- refactor unet's attention · 9dccc7dc
  Patrick von Platen authored Jun 28, 2022
  
  9dccc7dc
- unify ldm and glide attention · 52b3ff5e
  Patrick von Platen authored Jun 28, 2022
  
  52b3ff5e
- all attentions collected · fff981df
  Patrick von Platen authored Jun 28, 2022
  
  fff981df
27 Jun, 2022 1 commit
- add layers · f6e8c8c0
  Patrick von Platen authored Jun 27, 2022
  
  f6e8c8c0
07 Jun, 2022 1 commit
- fix issues with loading, add test for pipeline · d8287fcd
  patil-suraj authored Jun 07, 2022
  
  d8287fcd
06 Jun, 2022 1 commit
- up · 6ab2dd18
  Patrick von Platen authored Jun 06, 2022
  
  6ab2dd18
01 Jun, 2022 2 commits
- more examples · c7ba6ba2
  Patrick von Platen authored Jun 02, 2022
  
  c7ba6ba2
- add examples · f15f0cd2
  Patrick von Platen authored Jun 02, 2022
  
  f15f0cd2