Commits · 7482178162b779506a54538f2cf2565c8b88c597 · renzhc / diffusers_dcu

03 Nov, 2022 5 commits

(#1115) · 74821781

Suraj Patil authored Nov 03, 2022



* make accelerate hard dep

* default fast init

* move params to cpu when device map is None

* handle device_map=None

* handle torch < 1.9

* remove device_map="auto"

* style

* add accelerate in torch extra

* remove accelerate from extras["test"]

* raise an error if torch is available but not accelerate

* update installation docs

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* improve defautl loading speed even further, allow disabling fats loading

* address review comments

* adapt the tests

* fix test_stable_diffusion_fast_load

* fix test_read_init

* temp fix for dummy checks

* Trigger Build

* Apply suggestions from code review
Co-authored-by: Anton Lozhkov <anton@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

74821781

VQ-diffusion (#658) · ef2ea33c

Will Berman authored Nov 03, 2022



* Changes for VQ-diffusion VQVAE

Add specify dimension of embeddings to VQModel:
`VQModel` will by default set the dimension of embeddings to the number
of latent channels. The VQ-diffusion VQVAE has a smaller
embedding dimension, 128, than number of latent channels, 256.

Add AttnDownEncoderBlock2D and AttnUpDecoderBlock2D to the up and down
unet block helpers. VQ-diffusion's VQVAE uses those two block types.

* Changes for VQ-diffusion transformer

Modify attention.py so SpatialTransformer can be used for
VQ-diffusion's transformer.

SpatialTransformer:
- Can now operate over discrete inputs (classes of vector embeddings) as well as continuous.
- `in_channels` was made optional in the constructor so two locations where it was passed as a positional arg were moved to kwargs
- modified forward pass to take optional timestep embeddings

ImagePositionalEmbeddings:
- added to provide positional embeddings to discrete inputs for latent pixels

BasicTransformerBlock:
- norm layers were made configurable so that the VQ-diffusion could use AdaLayerNorm with timestep embeddings
- modified forward pass to take optional timestep embeddings

CrossAttention:
- now may optionally take a bias parameter for its query, key, and value linear layers

FeedForward:
- Internal layers are now configurable

ApproximateGELU:
- Activation function in VQ-diffusion's feedforward layer

AdaLayerNorm:
- Norm layer modified to incorporate timestep embeddings

* Add VQ-diffusion scheduler

* Add VQ-diffusion pipeline

* Add VQ-diffusion convert script to diffusers

* Add VQ-diffusion dummy objects

* Add VQ-diffusion markdown docs

* Add VQ-diffusion tests

* some renaming

* some fixes

* more renaming

* correct

* fix typo

* correct weights

* finalize

* fix tests

* Apply suggestions from code review
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* finish

* finish

* up
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

ef2ea33c

feat: add repaint (#974) · d38c8043

Revist authored Nov 03, 2022



* feat: add repaint

* fix: fix quality check with `make fix-copies`

* fix: remove old unnecessary arg

* chore: change default to DDPM (looks better in experiments)

* ".to(device)" changed to "device="
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* make generator device-specific
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* make generator device-specific and change shape
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* fix: add preprocessing for image and mask
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* fix: update test
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update src/diffusers/pipelines/repaint/pipeline_repaint.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Add docs and examples

* Fix toctree
Co-authored-by: fja <fja@zurich.ibm.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

d38c8043

Allow saving `None` pipeline components (#1118) · 4a38166a
Anton Lozhkov authored Nov 03, 2022
```
* Allow saving `None` pipeline components

* support flax as well

* style
```
4a38166a
Fix hub-dependent tests for PRs (#1119) · 0edf9ca0
Anton Lozhkov authored Nov 03, 2022
```
* Remove the hub token

* replace repos

* style
```
0edf9ca0

02 Nov, 2022 4 commits

[Loading] Ignore unneeded files (#1107) · c39a511b
Patrick von Platen authored Nov 02, 2022
```
* [Loading] Ignore unneeded files

* up
```
c39a511b

Fix tests for equivalence of DDIM and DDPM pipelines (#1069) · 5cd29d62

Grigory Sizov authored Nov 02, 2022

* Fix equality test for ddim and ddpm

* add docs for use_clipped_model_output in DDIM

* fix inline comment

* reorder imports in test_pipelines.py

* Ignore use_clipped_model_output if scheduler doesn't take it

5cd29d62

[CI] Framework and hardware-specific CI tests (#997) · 4e59bcc6

Anton Lozhkov authored Nov 02, 2022

* [WIP][CI] Framework and hardware-specific docker images for CI tests

* username

* fix cpu

* try out the image

* push latest

* update workspace

* no root isolation for actions

* add a flax image

* flax and onnx matrix

* fix runners

* add reports

* onnxruntime image

* retry tpu

* fix

* fix

* build onnxruntime

* naming

* onnxruntime-gpu image

* onnxruntime-gpu image, slow tests

* latest jax version

* trigger flax

* run flax tests in one thread

* fast flax tests on cpu

* fast flax tests on cpu

* trigger slow tests

* rebuild torch cuda

* force cuda provider

* fix onnxruntime tests

* trigger slow

* don't specify gpu for tpu

* optimize

* memory limit

* fix flax tests

* disable docker cache

4e59bcc6

Integration tests precision improvement for inpainting (#1052) · 8ee21915

Lewington-pitsos authored Nov 02, 2022



* improve test precision

get tests passing with greater precision using lewington images

* make old numpy load function a wrapper around a more flexible numpy loading function

* adhere to black formatting

* add more black formatting

* adhere to isort

* loosen precision and replace path
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

8ee21915

31 Oct, 2022 4 commits

[Tests] Fix slow tests (#1087) · 17c2c060
Patrick von Platen authored Oct 31, 2022

17c2c060

[Better scheduler docs] Improve usage examples of schedulers (#890) · c18941b0

Patrick von Platen authored Oct 31, 2022



* [Better scheduler docs] Improve usage examples of schedulers

* finish

* fix warnings and add test

* finish

* more replacements

* adapt fast tests hf token

* correct more

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Integrate compatibility with euler
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

c18941b0

k-diffusion-euler (#1019) · a1ea8c01

hlky authored Oct 31, 2022



* k-diffusion-euler

* make style make quality

* make fix-copies

* fix tests for euler a

* Update src/diffusers/schedulers/scheduling_euler_ancestral_discrete.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update src/diffusers/schedulers/scheduling_euler_ancestral_discrete.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update src/diffusers/schedulers/scheduling_euler_discrete.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* Update src/diffusers/schedulers/scheduling_euler_discrete.py
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>

* remove unused arg and method

* update doc

* quality

* make flake happy

* use logger instead of warn

* raise error instead of deprication

* don't require scipy

* pass generator in step

* fix tests

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* Update tests/test_scheduler.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* remove unused generator

* pass generator as extra_step_kwargs

* update tests

* pass generator as kwarg

* pass generator as kwarg

* quality

* fix test for lms

* fix tests
Co-authored-by: patil-suraj <surajp815@gmail.com>
Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

a1ea8c01

fix slow test · 707b8684
Patrick von Platen authored Oct 31, 2022

707b8684

28 Oct, 2022 8 commits
- higher precision for vae · 81b6fbf1
  Patrick von Platen authored Oct 28, 2022
  
  81b6fbf1
- increase tolerance · a7ae808e
  Patrick von Platen authored Oct 28, 2022
  
  a7ae808e
- hot fix · cbbb2939
  Patrick von Platen authored Oct 28, 2022
  
  cbbb2939
- [Tests] no random latents anymore (#1045) · d37f08da
  Patrick von Platen authored Oct 28, 2022
  
  d37f08da
- [Tests] Better prints (#1043) · c4ef1efe
  Patrick von Platen authored Oct 28, 2022
  
  c4ef1efe
- Fix some failing tests (#1041) · 8d6487f3
  Patrick von Platen authored Oct 28, 2022
```
* up

* up

* up

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py

* Apply suggestions from code review
```
  8d6487f3
- [Tests] Speed up slow tests (#1040) · d2d9764f
  Patrick von Platen authored Oct 28, 2022
```
* [Tests] Speed up slow tests

* Up

* up
```
  d2d9764f
- [Tests] Improve unet / vae tests (#1018) · a80480f0
  Patrick von Platen authored Oct 28, 2022
```
* improve tests

* up

* finish

* upload

* add init

* up

* finish vae

* finish

* reduce loading time with device_map

* remove device_map from CPU

* uP
```
  a80480f0
27 Oct, 2022 2 commits

[Accelerate model loading] Fix meta device and super low memory usage (#1016) · 3be9fa97
Patrick von Platen authored Oct 27, 2022
```
* [Accelerate model loading] Fix meta device and super low memory usage

* better naming
```
3be9fa97

Continuation of #942: additional float64 failure (#996) · 1d04e1b4

Pedro Cuenca authored Oct 27, 2022

* Add failing test for #940.

* Do not use torch.float64 in mps.

* style

* Temporarily skip add_noise for IPNDMScheduler.

Until #990 is addressed.

* Fix additional float64 error in mps.

* Improve add_noise test

* Slight edit – I think it's clearer this way.

1d04e1b4

26 Oct, 2022 2 commits

minimal stable diffusion GPU memory usage with accelerate hooks (#850) · b2e2d141

Pi Esposito authored Oct 26, 2022

* add method to enable cuda with minimal gpu usage to stable diffusion

* add test to minimal cuda memory usage

* ensure all models but unet are onn torch.float32

* move to cpu_offload along with minor internal changes to make it work

* make it test against accelerate master branch

* coming back, its official: I don't know how to make it test againt the master branch from accelerate

* make it install accelerate from master on tests

* go back to accelerate>=0.11

* undo prettier formatting on yml files

* undo prettier formatting on yml files againn

b2e2d141

Do not use torch.float64 on the mps device (#942) · 0343d8f5

Pedro Cuenca authored Oct 26, 2022

* Add failing test for #940.

* Do not use torch.float64 in mps.

* style

* Temporarily skip add_noise for IPNDMScheduler.

Until #990 is addressed.

0343d8f5

25 Oct, 2022 4 commits

[Dance Diffusion] Better naming (#981) · 59f0ce82
Patrick von Platen authored Oct 25, 2022
```
uP
```
59f0ce82
[Dance Diffusion] FP16 (#980) · 365ff8f7
Patrick von Platen authored Oct 25, 2022
```
* add in fp16

* up
```
365ff8f7

[Dance Diffusion] Add dance diffusion (#803) · 88fa6b7d

Patrick von Platen authored Oct 25, 2022



* start

* add more logic

* Update src/diffusers/models/unet_2d_condition_flax.py

* match weights

* up

* make model work

* making class more general, fixing missed file rename

* small fix

* make new conversion work

* up

* finalize conversion

* up

* first batch of variable renamings

* remove c and c_prev var names

* add mid and out block structure

* add pipeline

* up

* finish conversion

* finish

* upload

* more fixes

* Apply suggestions from code review

* add attr

* up

* uP

* up

* finish tests

* finish

* uP

* finish

* fix test

* up

* naming consistency in tests

* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Nathan Lambert <nathan@huggingface.co>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

* remove hardcoded 16

* Remove bogus

* fix some stuff

* finish

* improve logging

* docs

* upload
Co-authored-by: Nathan Lambert <nol@berkeley.edu>
Co-authored-by: Suraj Patil <surajp815@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Nathan Lambert <nathan@huggingface.co>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

88fa6b7d

[Flax] added broadcast_to_shape_from_left helper and Scheduler tests (#864) · 240abddf

Kashif Rasul authored Oct 25, 2022



* added broadcast_to_shape_from_left helper

* initial tests

* fixed pndm tests

* shape required for pndm

* added require_flax

* fix style

* fix more imports
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

240abddf

24 Oct, 2022 1 commit
- Reorganize pipeline tests (#963) · 2c82e0c4
  Anton Lozhkov authored Oct 24, 2022
```
* Reorganize pipeline tests

* fix vq
```
  2c82e0c4
22 Oct, 2022 1 commit
- [MPS] fix mps failing tests (#934) · 9bca4029
  Kashif Rasul authored Oct 22, 2022
```
fix mps failing tests
```
  9bca4029
21 Oct, 2022 1 commit
- [Tests] Move stable diffusion into their own files (#936) · 25dfd0f8
  Patrick von Platen authored Oct 21, 2022
```
* [Tests] Move stable diffusion into their own files

* up
```
  25dfd0f8
20 Oct, 2022 4 commits
- Introduce the copy mechanism (#924) · 32bf4fdc
  Anton Lozhkov authored Oct 20, 2022
```
* Introduce the copy mechanism

* init tests

* fix dummy tests

* with

* update copies tests
```
  32bf4fdc
- fix test_components (#928) · 8be48507
  Suraj Patil authored Oct 20, 2022
  
  8be48507
- [DiffusionPipeline.from_pretrained] add warning when passing unused k… (#870) · db19a9d9
  Patrick von Platen authored Oct 20, 2022
```
[DiffusionPipeline.from_pretrained] add warning when passing unused kwargs
```
  db19a9d9
- [Stable Diffusion] Add components function (#889) · 83f8a5ff
  Patrick von Platen authored Oct 20, 2022
```
* [Stable Diffusion] Add components function

* uP
```
  83f8a5ff
19 Oct, 2022 4 commits

ONNX supervised inpainting (#906) · 89d12494

Anton Lozhkov authored Oct 19, 2022

* ONNX supervised inpainting

* sync with the torch pipeline

* fix concat

* update ref values

* back to 8 steps

* type fix

* make fix-copies

89d12494

finish tests (#909) · 46557121
Patrick von Platen authored Oct 19, 2022

46557121

Stable diffusion inpainting. (#904) · b35d88c5

Suraj Patil authored Oct 19, 2022



* begin pipe

* add new pipeline

* add tests

* correct fast test

* up

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_inpaint.py

* Update tests/test_pipelines.py

* up

* up

* make style

* add fp16 test

* doc, comments

* up
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

b35d88c5

Improve ONNX img2img numpy handling, temporarily fix the tests (#899) · 8eb9d970
Anton Lozhkov authored Oct 19, 2022
```
* [WIP] Onnx img2img determinism

* more numpy + seed

* numpy inpainting, tolerance

* revert test workflow
```
8eb9d970