Commits · 86087957117649524c671563e3e051eadfdad5e8 · renzhc / diffusers_dcu

02 Nov, 2022 1 commit

Up to 2x speedup on GPUs using memory efficient attention (#532) · 98c42134

MatthieuTPHR authored Nov 02, 2022



* 2x speedup using memory efficient attention

* remove einops dependency

* Swap K, M in op instantiation

* Simplify code, remove unnecessary maybe_init call and function, remove unused self.scale parameter

* make xformers a soft dependency

* remove one-liner functions

* change one letter variable to appropriate names

* Remove Env variable dependency, remove MemoryEfficientCrossAttention class and use enable_xformers_memory_efficient_attention method

* Add memory efficient attention toggle to img2img and inpaint pipelines

* Clearer management of xformers' availability

* update optimizations markdown to add info about memory efficient attention

* add benchmarks for TITAN RTX

* More detailed explanation of how the mem eff benchmark were ran

* Removing autocast from optimization markdown

* import_utils: import torch only if is available
Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>

98c42134

31 Oct, 2022 7 commits

Remove some unused parameter in CrossAttnUpBlock2D (#1034) · 7fb4b882

Laurent Mazare authored Oct 31, 2022

Remove some unused parameter

The `downsample_padding` parameter does not seem to be used in `CrossAttnUpBlock2D` (or by any up block for that matter) so removing it.

7fb4b882

Remove nn sequential (#1086) · 888468dd
Patrick von Platen authored Oct 31, 2022
```
* Remove nn sequential

* up
```
888468dd
[Tests] Fix slow tests (#1087) · 17c2c060
Patrick von Platen authored Oct 31, 2022

17c2c060

[Better scheduler docs] Improve usage examples of schedulers (#890) · c18941b0

Patrick von Platen authored Oct 31, 2022



* [Better scheduler docs] Improve usage examples of schedulers

* finish

* fix warnings and add test

* finish

* more replacements

* adapt fast tests hf token

* correct more

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Integrate compatibility with euler
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

c18941b0

k-diffusion-euler (#1019) · a1ea8c01

hlky authored Oct 31, 2022



* k-diffusion-euler

* make style make quality

* make fix-copies

* fix tests for euler a

* Update src/diffusers/schedulers/scheduling_euler_ancestral_discrete.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update src/diffusers/schedulers/scheduling_euler_ancestral_discrete.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update src/diffusers/schedulers/scheduling_euler_discrete.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update src/diffusers/schedulers/scheduling_euler_discrete.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* remove unused arg and method

* update doc

* quality

* make flake happy

* use logger instead of warn

* raise error instead of deprication

* don't require scipy

* pass generator in step

* fix tests

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update tests/test_scheduler.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* remove unused generator

* pass generator as extra_step_kwargs

* update tests

* pass generator as kwarg

* pass generator as kwarg

* quality

* fix test for lms

* fix tests
Co-authored-by: patil-suraj <surajp815@gmail.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a1ea8c01

Allow `safety_checker` to be `None` when using CPU offload (#1078) · bf7b0bc2
Pedro Cuenca authored Oct 31, 2022
```
Allow None safety_checker when using CPU offload.
```
bf7b0bc2
Fix pipelines user_agent, ignore CI requests (#1058) · 1606eb99
Anton Lozhkov authored Oct 31, 2022
```
* Fix pipelines user_agent, ignore CI requests

* fix circular import

* N/A versions

* N/A versions
```
1606eb99

30 Oct, 2022 1 commit

Move safety detection to model call in Flax safety checker (#1023) · 8e4fd686

Jonatan Kłosko authored Oct 30, 2022

* Move safety detection to model call in Flax safety checker

* Update src/diffusers/pipelines/stable_diffusion/safety_checker_flax.py

8e4fd686

29 Oct, 2022 2 commits

Experimental: allow fp16 in `mps` (#961) · 95414bd6

Pedro Cuenca authored Oct 29, 2022

* Docs: refer to pre-RC version of PyTorch 1.13.0.

* Remove temporary workaround for unavailable op.

* Update comment to make it less ambiguous.

* Remove use of contiguous in mps.

It appears to not longer be necessary.

* Special case: use einsum for much better performance in mps

* Update mps docs.

* MPS: make pipeline work in half precision.

95414bd6

clean incomplete pages (#1008) · 12fd0736
Nathan Lambert authored Oct 29, 2022

12fd0736

28 Oct, 2022 5 commits
- [Tests] no random latents anymore (#1045) · d37f08da
  Patrick von Platen authored Oct 28, 2022
  
  d37f08da
- [Tests] Better prints (#1043) · c4ef1efe
  Patrick von Platen authored Oct 28, 2022
  
  c4ef1efe
- Fix some failing tests (#1041) · 8d6487f3
  Patrick von Platen authored Oct 28, 2022
```
* up

* up

* up

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py

* Apply suggestions from code review
```
  8d6487f3
- [Tests] Improve unet / vae tests (#1018) · a80480f0
  Patrick von Platen authored Oct 28, 2022
```
* improve tests

* up

* finish

* upload

* add init

* up

* finish vae

* finish

* reduce loading time with device_map

* remove device_map from CPU

* uP
```
  a80480f0
- fix `F.interpolate()` for large batch sizes (#1006) · ab079f27
  Nouamane Tazi authored Oct 28, 2022
```
* fix `upsample_nearest_nhwc` for large bsz

* fix `upsample_nearest_nhwc` for large bsz
```
  ab079f27
27 Oct, 2022 5 commits

Support grayscale images in `numpy_to_pil` (#1025) · fb38bb16
Anton Lozhkov authored Oct 27, 2022

fb38bb16

Document sequential CPU offload method on Stable Diffusion pipeline (#1024) · de00c632

Pi Esposito authored Oct 27, 2022



* document cpu offloading method

* address review comments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

de00c632

Deprecate `init_git_repo`, refactor `train_unconditional.py` (#1022) · fbcc3833
Anton Lozhkov authored Oct 27, 2022
```
Deprecate `init_git_repo` and `push_to_hub`, refactor `train_unconditional.py`
```
fbcc3833
[Accelerate model loading] Fix meta device and super low memory usage (#1016) · 3be9fa97
Patrick von Platen authored Oct 27, 2022
```
* [Accelerate model loading] Fix meta device and super low memory usage

* better naming
```
3be9fa97

Continuation of #942: additional float64 failure (#996) · 1d04e1b4

Pedro Cuenca authored Oct 27, 2022

* Add failing test for #940.

* Do not use torch.float64 in mps.

* style

* Temporarily skip add_noise for IPNDMScheduler.

Until #990 is addressed.

* Fix additional float64 error in mps.

* Improve add_noise test

* Slight edit – I think it's clearer this way.

1d04e1b4

26 Oct, 2022 4 commits

[inpaint pipeline] fix bug for multiple prompts inputs (#959) · bd06dd02
Hu Ye authored Oct 26, 2022

bd06dd02

minimal stable diffusion GPU memory usage with accelerate hooks (#850) · b2e2d141

Pi Esposito authored Oct 26, 2022

* add method to enable cuda with minimal gpu usage to stable diffusion

* add test to minimal cuda memory usage

* ensure all models but unet are onn torch.float32

* move to cpu_offload along with minor internal changes to make it work

* make it test against accelerate master branch

* coming back, its official: I don't know how to make it test againt the master branch from accelerate

* make it install accelerate from master on tests

* go back to accelerate>=0.11

* undo prettier formatting on yml files

* undo prettier formatting on yml files againn

b2e2d141

Fix typos (#978) · cc436087
Yuta Hayashibe authored Oct 26, 2022

cc436087

Do not use torch.float64 on the mps device (#942) · 0343d8f5

Pedro Cuenca authored Oct 26, 2022

* Add failing test for #940.

* Do not use torch.float64 in mps.

* style

* Temporarily skip add_noise for IPNDMScheduler.

Until #990 is addressed.

0343d8f5

25 Oct, 2022 7 commits

[Dance Diffusion] Better naming (#981) · 59f0ce82
Patrick von Platen authored Oct 25, 2022
```
uP
```
59f0ce82
[Dance Diffusion] FP16 (#980) · 365ff8f7
Patrick von Platen authored Oct 25, 2022
```
* add in fp16

* up
```
365ff8f7

[Dance Diffusion] Add dance diffusion (#803) · 88fa6b7d

Patrick von Platen authored Oct 25, 2022



* start

* add more logic

* Update src/diffusers/models/unet_2d_condition_flax.py

* match weights

* up

* make model work

* making class more general, fixing missed file rename

* small fix

* make new conversion work

* up

* finalize conversion

* up

* first batch of variable renamings

* remove c and c_prev var names

* add mid and out block structure

* add pipeline

* up

* finish conversion

* finish

* upload

* more fixes

* Apply suggestions from code review

* add attr

* up

* uP

* up

* finish tests

* finish

* uP

* finish

* fix test

* up

* naming consistency in tests

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Nathan Lambert <nathan@huggingface.co>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

* remove hardcoded 16

* Remove bogus

* fix some stuff

* finish

* improve logging

* docs

* upload
Co-authored-by: Nathan Lambert <nol@berkeley.edu>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Nathan Lambert <nathan@huggingface.co>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

88fa6b7d

[Onnx] support half-precision and fix bugs for onnx pipelines (#932) · 0b42b074

SkyTNT authored Oct 25, 2022

* [Onnx] support half-precision and fix bugs for onnx pipelines

* Update convert_stable_diffusion_checkpoint_to_onnx.py

* style

* fix has_nsfw_concept

* Update convert_stable_diffusion_checkpoint_to_onnx.py

* fix style

0b42b074

mps changes for PyTorch 1.13 (#926) · 3d02c921

Pedro Cuenca authored Oct 25, 2022



* Docs: refer to pre-RC version of PyTorch 1.13.0.

* Remove temporary workaround for unavailable op.

* Update comment to make it less ambiguous.

* Remove use of contiguous in mps.

It appears to not longer be necessary.

* Special case: use einsum for much better performance in mps

* Update mps docs.

* Minor doc update.

* Accept suggestion
Co-authored-by: Anton Lozhkov <anton@huggingface.co>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

3d02c921

[Flax] added broadcast_to_shape_from_left helper and Scheduler tests (#864) · 240abddf

Kashif Rasul authored Oct 25, 2022



* added broadcast_to_shape_from_left helper

* initial tests

* fixed pndm tests

* shape required for pndm

* added require_flax

* fix style

* fix more imports
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

240abddf

add num_inference_steps arg to DDPM (#935) · 6e099e2c
Tanishq Abraham authored Oct 25, 2022

6e099e2c

24 Oct, 2022 4 commits

add community pipeline docs; add minimal text to some empty doc pages (#930) · 2fb8fafa

Nathan Lambert authored Oct 24, 2022

* add community pipeline docs

* fix style in code snippets (lol)

* clean up loading docs

* add license to doc files

* fix some weird links

2fb8fafa

v1-5 docs updates (#921) · 8aac1f99

apolinario authored Oct 24, 2022



* Update README.md

Additionally add FLAX so the model card can be slimmer and point to this page

* Find and replace all

* v-1-5 -> v1-5

* revert test changes

* Update README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update docs/source/quicktour.mdx
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update README.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update docs/source/quicktour.mdx
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update README.md
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Revert certain references to v1-5

* Docs changes

* Apply suggestions from code review
Co-authored-by: apolinario <joaopaulo.passos+multimodal@gmail.com>
Co-authored-by: anton-l <anton@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

8aac1f99

Reorganize pipeline tests (#963) · 2c82e0c4
Anton Lozhkov authored Oct 24, 2022
```
* Reorganize pipeline tests

* fix vq
```
2c82e0c4
fix a small typo in pipeline_ddpm.py (#948) · 2d35f673
Chenguo Lin authored Oct 24, 2022
```
one small typo in pipeline_ddpm.py

just a small typo in one comment
```
2d35f673

21 Oct, 2022 3 commits

Support LMSDiscreteScheduler in LDMPipeline (#891) · 31af4d17

mkshing authored Oct 21, 2022



* Support LMSDiscreteScheduler in LDMPipeline

This is a small change to support all schedulers such as LMSDiscreteScheduler in LDMPipeline.

What's changed
-------
* Add the `scale_model_input` function before `step` to ensure correct denoising (L77)

* Add "scale the initial noise by the standard deviation required by the scheduler"

* run `make style`
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

31af4d17

[Flax] dont warn for bf16 weights (#923) · dec18c86
Suraj Patil authored Oct 21, 2022
```
dont warn for bf16 weights
```
dec18c86
[Tests] Move stable diffusion into their own files (#936) · 25dfd0f8
Patrick von Platen authored Oct 21, 2022
```
* [Tests] Move stable diffusion into their own files

* up
```
25dfd0f8

20 Oct, 2022 1 commit

Introduce the copy mechanism (#924) · 32bf4fdc

Anton Lozhkov authored Oct 20, 2022

* Introduce the copy mechanism

* init tests

* fix dummy tests

* with

* update copies tests

32bf4fdc