Commits · 1172c9634b4a32d6e82301e3d59ce17005e13e85 · renzhc / diffusers_dcu

04 Nov, 2022 1 commit

add enable sequential cpu offloading to other stable diffusion pipelines (#1085) · 1172c963

Pi Esposito authored Nov 04, 2022



* add enable sequential cpu offloading to other stable diffusion pipelines

* trigger ci

* fix styling

* interpolate before converting to device to avoid breking when cpu_offload is enabled with fp16
Co-authored-by: Pedro Gengo  <pedro.gabriel.lourenco@hotmail.com>

* style again I need to stop forgething this thing

* fix inpainting bug that could cause device misalignment
Co-authored-by: Pedro Gengo  <pedro.gabriel.lourenco@hotmail.com>

* Apply suggestions from code review
Co-authored-by: Pedro Gengo  <pedro.gabriel.lourenco@hotmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

1172c963

03 Nov, 2022 1 commit

Continuation of #1035 (#1120) · 269109db

Pedro Cuenca authored Nov 03, 2022



* remove batch size from repeat

* repeat empty string if uncond_tokens is none

* fix inpaint pipes

* return back whitespace to pass code quality

* Apply suggestions from code review

* Fix typos.
Co-authored-by: Had <had-95@yandex.ru>

269109db

02 Nov, 2022 1 commit

Up to 2x speedup on GPUs using memory efficient attention (#532) · 98c42134

MatthieuTPHR authored Nov 02, 2022



* 2x speedup using memory efficient attention

* remove einops dependency

* Swap K, M in op instantiation

* Simplify code, remove unnecessary maybe_init call and function, remove unused self.scale parameter

* make xformers a soft dependency

* remove one-liner functions

* change one letter variable to appropriate names

* Remove Env variable dependency, remove MemoryEfficientCrossAttention class and use enable_xformers_memory_efficient_attention method

* Add memory efficient attention toggle to img2img and inpaint pipelines

* Clearer management of xformers' availability

* update optimizations markdown to add info about memory efficient attention

* add benchmarks for TITAN RTX

* More detailed explanation of how the mem eff benchmark were ran

* Removing autocast from optimization markdown

* import_utils: import torch only if is available
Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>

98c42134

31 Oct, 2022 4 commits

[Tests] Fix slow tests (#1087) · 17c2c060
Patrick von Platen authored Oct 31, 2022

17c2c060

[Better scheduler docs] Improve usage examples of schedulers (#890) · c18941b0

Patrick von Platen authored Oct 31, 2022



* [Better scheduler docs] Improve usage examples of schedulers

* finish

* fix warnings and add test

* finish

* more replacements

* adapt fast tests hf token

* correct more

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Integrate compatibility with euler
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

c18941b0

k-diffusion-euler (#1019) · a1ea8c01

hlky authored Oct 31, 2022



* k-diffusion-euler

* make style make quality

* make fix-copies

* fix tests for euler a

* Update src/diffusers/schedulers/scheduling_euler_ancestral_discrete.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update src/diffusers/schedulers/scheduling_euler_ancestral_discrete.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update src/diffusers/schedulers/scheduling_euler_discrete.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update src/diffusers/schedulers/scheduling_euler_discrete.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* remove unused arg and method

* update doc

* quality

* make flake happy

* use logger instead of warn

* raise error instead of deprication

* don't require scipy

* pass generator in step

* fix tests

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update tests/test_scheduler.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* remove unused generator

* pass generator as extra_step_kwargs

* update tests

* pass generator as kwarg

* pass generator as kwarg

* quality

* fix test for lms

* fix tests
Co-authored-by: patil-suraj <surajp815@gmail.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a1ea8c01

Allow `safety_checker` to be `None` when using CPU offload (#1078) · bf7b0bc2
Pedro Cuenca authored Oct 31, 2022
```
Allow None safety_checker when using CPU offload.
```
bf7b0bc2

30 Oct, 2022 1 commit

Move safety detection to model call in Flax safety checker (#1023) · 8e4fd686

Jonatan Kłosko authored Oct 30, 2022

* Move safety detection to model call in Flax safety checker

* Update src/diffusers/pipelines/stable_diffusion/safety_checker_flax.py

8e4fd686

28 Oct, 2022 1 commit

Fix some failing tests (#1041) · 8d6487f3

Patrick von Platen authored Oct 28, 2022

* up

* up

* up

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py

* Apply suggestions from code review

8d6487f3

27 Oct, 2022 2 commits

Document sequential CPU offload method on Stable Diffusion pipeline (#1024) · de00c632

Pi Esposito authored Oct 27, 2022



* document cpu offloading method

* address review comments
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

de00c632

[Accelerate model loading] Fix meta device and super low memory usage (#1016) · 3be9fa97
Patrick von Platen authored Oct 27, 2022
```
* [Accelerate model loading] Fix meta device and super low memory usage

* better naming
```
3be9fa97

26 Oct, 2022 2 commits

[inpaint pipeline] fix bug for multiple prompts inputs (#959) · bd06dd02
Hu Ye authored Oct 26, 2022

bd06dd02

minimal stable diffusion GPU memory usage with accelerate hooks (#850) · b2e2d141

Pi Esposito authored Oct 26, 2022

* add method to enable cuda with minimal gpu usage to stable diffusion

* add test to minimal cuda memory usage

* ensure all models but unet are onn torch.float32

* move to cpu_offload along with minor internal changes to make it work

* make it test against accelerate master branch

* coming back, its official: I don't know how to make it test againt the master branch from accelerate

* make it install accelerate from master on tests

* go back to accelerate>=0.11

* undo prettier formatting on yml files

* undo prettier formatting on yml files againn

b2e2d141

25 Oct, 2022 2 commits

[Onnx] support half-precision and fix bugs for onnx pipelines (#932) · 0b42b074

SkyTNT authored Oct 25, 2022

* [Onnx] support half-precision and fix bugs for onnx pipelines

* Update convert_stable_diffusion_checkpoint_to_onnx.py

* style

* fix has_nsfw_concept

* Update convert_stable_diffusion_checkpoint_to_onnx.py

* fix style

0b42b074

mps changes for PyTorch 1.13 (#926) · 3d02c921

Pedro Cuenca authored Oct 25, 2022



* Docs: refer to pre-RC version of PyTorch 1.13.0.

* Remove temporary workaround for unavailable op.

* Update comment to make it less ambiguous.

* Remove use of contiguous in mps.

It appears to not longer be necessary.

* Special case: use einsum for much better performance in mps

* Update mps docs.

* Minor doc update.

* Accept suggestion
Co-authored-by: Anton Lozhkov <anton@huggingface.co>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

3d02c921

24 Oct, 2022 1 commit

v1-5 docs updates (#921) · 8aac1f99

apolinario authored Oct 24, 2022



* Update README.md

Additionally add FLAX so the model card can be slimmer and point to this page

* Find and replace all

* v-1-5 -> v1-5

* revert test changes

* Update README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update docs/source/quicktour.mdx
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update README.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update docs/source/quicktour.mdx
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update README.md
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Revert certain references to v1-5

* Docs changes

* Apply suggestions from code review
Co-authored-by: apolinario <joaopaulo.passos+multimodal@gmail.com>
Co-authored-by: anton-l <anton@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

8aac1f99

19 Oct, 2022 4 commits

ONNX supervised inpainting (#906) · 89d12494

Anton Lozhkov authored Oct 19, 2022

* ONNX supervised inpainting

* sync with the torch pipeline

* fix concat

* update ref values

* back to 8 steps

* type fix

* make fix-copies

89d12494

Stable diffusion inpainting. (#904) · b35d88c5

Suraj Patil authored Oct 19, 2022



* begin pipe

* add new pipeline

* add tests

* correct fast test

* up

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_inpaint.py

* Update tests/test_pipelines.py

* up

* up

* make style

* add fp16 test

* doc, comments

* up
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

b35d88c5

[Stable Diffusion Inpainting] Deprecate inpainting pipeline in favor of official one (#903) · 6ea83608

Patrick von Platen authored Oct 19, 2022



* finish

* up

* Apply suggestions from code review
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

* Update src/diffusers/pipeline_utils.py

* Finish
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

6ea83608

Improve ONNX img2img numpy handling, temporarily fix the tests (#899) · 8eb9d970
Anton Lozhkov authored Oct 19, 2022
```
* [WIP] Onnx img2img determinism

* more numpy + seed

* numpy inpainting, tolerance

* revert test workflow
```
8eb9d970

18 Oct, 2022 2 commits

Stable Diffusion image-to-image and inpaint using onnx. (#552) · a9908ecf

Žilvinas Ledas authored Oct 18, 2022



* * Stabe Diffusion img2img using onnx.

* * Stabe Diffusion inpaint using onnx.

* Export vae_encoder, upgrade img2img, add test

* updated inpainting pipeline + test

* style
Co-authored-by: anton-l <anton@huggingface.co>

a9908ecf

Rename StableDiffusionOnnxPipeline -> OnnxStableDiffusionPipeline (#887) · 728a3f3e
Anton Lozhkov authored Oct 18, 2022
```
Rename and deprecate
```
728a3f3e

14 Oct, 2022 1 commit
- Fix Flax pipeline: width and height are ignored #838 (#848) · 93a81a3f
  camenduru authored Oct 14, 2022
```
* Fix Flax pipeline: width and height are ignored #838

* Fix Flax pipeline: width and height are ignored
```
  93a81a3f
13 Oct, 2022 7 commits

[FlaxStableDiffusionPipeline] fix bug when nsfw is detected (#832) · effe9d66
Suraj Patil authored Oct 13, 2022
```
fix nsfw bug
```
effe9d66
Align PT and Flax API - allow loading checkpoint from PyTorch configs (#827) · 7c226264
Patrick von Platen authored Oct 13, 2022
```
* up

* finish

* add more tests

* up

* up

* finish
```
7c226264

Flax safety checker (#825) · 78db11db

Pedro Cuenca authored Oct 13, 2022



* Remove set_format in Flax pipeline.

* Remove DummyChecker.

* Run safety_checker in pipeline.

* Don't pmap on every call.

We could have decorated `generate` with `pmap`, but I wanted to keep it
in case someone wants to invoke it in non-parallel mode.

* Remove commented line
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Replicate outside __call__, prepare for optional jitting.

* Remove unnecessary clipping.

As suggested by @kashif.

* Do not jit unless requested.

* Send all args to generate.

* make style

* Remove unused imports.

* Fix docstring.
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

78db11db

Give more customizable options for safety checker (#815) · e713346a

Patrick von Platen authored Oct 13, 2022



* Give more customizable options for safety checker

* Apply suggestions from code review

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py

* Finish

* make style

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* up
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

e713346a

Fix type mismatch error, add tests for negative prompts (#823) · 26c7df5d
Anton Lozhkov authored Oct 13, 2022

26c7df5d

update flax scheduler API (#822) · 0a09af2f

Suraj Patil authored Oct 13, 2022

* update flax scheduler API

* remoev set format

* fix call to scale_model_input

* update flax pndm

* use int32

* update docstr

0a09af2f

[Flax] Add test (#824) · f1d4289b
Patrick von Platen authored Oct 13, 2022

f1d4289b

12 Oct, 2022 1 commit

[Img2Img] Fix batch size mismatch prompts vs. init images (#793) · 6bc11782

Patrick von Platen authored Oct 12, 2022



* [Img2Img] Fix batch size mismatch prompts vs. init images

* Remove bogus folder

* fix

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_img2img.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

6bc11782

11 Oct, 2022 2 commits

`mps`: Alternative implementation for `repeat_interleave` (#766) · 24b8b5cf

Pedro Cuenca authored Oct 11, 2022



* mps: alt. implementation for repeat_interleave

* style

* Bump mps version of PyTorch in the documentation.

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>

* Simplify: do not check for device.

* style

* Fix repeat dimensions:

- The unconditional embeddings are always created from a single prompt.
- I was shadowing the batch_size var.

* Split long lines as suggested by Suraj.
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Suraj Patil <surajp815@gmail.com>

24b8b5cf

support bf16 for stable diffusion (#792) · 797b290e
Suraj Patil authored Oct 11, 2022
```
* support bf16 for stable diffusion

* fix typo

* address review comments
```
797b290e

10 Oct, 2022 1 commit

[Low CPU memory] + device map (#772) · fab17528

Patrick von Platen authored Oct 10, 2022



* add accelerate to load models with smaller memory footprint

* remove low_cpu_mem_usage as it is reduntant

* move accelerate init weights context to modelling utils

* add test to ensure results are the same when loading with accelerate

* add tests to ensure ram usage gets lower when using accelerate

* move accelerate logic to single snippet under modelling utils and remove it from configuration utils

* format code using to pass quality check

* fix imports with isor

* add accelerate to test extra deps

* only import accelerate if device_map is set to auto

* move accelerate availability check to diffusers import utils

* format code

* add device map to pipeline abstraction

* lint it to pass PR quality check

* fix class check to use accelerate when using diffusers ModelMixin subclasses

* use low_cpu_mem_usage in transformers if device_map is not available

* NoModuleLayer

* comment out tests

* up

* uP

* finish

* Update src/diffusers/pipelines/stable_diffusion/safety_checker.py

* finish

* uP

* make style
Co-authored-by: Pi Esposito <piero.skywalker@gmail.com>

fab17528

07 Oct, 2022 1 commit

[img2img, inpainting] fix fp16 inference (#769) · 92d70863

Suraj Patil authored Oct 07, 2022

* handle dtype in vae and image2image pipeline

* fix inpaint in fp16

* dtype should be handled in add_noise

* style

* address review comments

* add simple fast tests to check fp16

* fix test name

* put mask in fp16

92d70863

06 Oct, 2022 3 commits
- Revert "[v0.4.0] Temporarily remove Flax modules from the public API (#755)" · 970e3060
  anton-l authored Oct 06, 2022
```
This reverts commit 2e209c30.
```
  970e3060
- [v0.4.0] Temporarily remove Flax modules from the public API (#755) · 2e209c30
  Anton Lozhkov authored Oct 06, 2022
```
Temporarily remove Flax modules from the public API
```
  2e209c30
- allow multiple generations per prompt (#741) · c119dc4c
  Suraj Patil authored Oct 06, 2022
```
* compute text embeds per prompt

* don't repeat uncond prompts

* repeat separatly

* update image2image

* fix repeat uncond embeds

* adapt inpaint pipeline

* ifx uncond tokens in img2img

* add tests and fix ucond embeds in im2img and inpaint pipe
```
  c119dc4c
05 Oct, 2022 2 commits
- remove use_auth_token from remaining places (#737) · 19e559d5
  Suraj Patil authored Oct 05, 2022
```
remove use_auth_token
```
  19e559d5
- No more use_auth_token=True (#733) · 78744b6a
  Patrick von Platen authored Oct 05, 2022
```
* up

* uP

* uP

* make style

* Apply suggestions from code review

* up

* finish
```
  78744b6a