Commits · 1870fb05a903546b79236d277ae4bc12e626b328 · renzhc / diffusers_dcu

23 Mar, 2023 14 commits

[docs] Add Colab notebooks and Spaces (#2713) · 1870fb05
Steven Liu authored Mar 23, 2023
```
* add colab notebook and spaces

* fix image link
```
1870fb05

YiYi Xu authored Mar 23, 2023



* add contronet flax

---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>

df91c447

Skip `mps` in text-to-video tests (#2792) · aa0531fa
Pedro Cuenca authored Mar 23, 2023
```
* Skip mps in text-to-video tests.

* style

* Skip UNet3D mps tests.
```
aa0531fa

Update train_text_to_image_lora.py (#2767) · dc5b4e23

Haofan Wang authored Mar 23, 2023

* Update train_text_to_image_lora.py

* Update train_text_to_image_lora.py

* Update train_text_to_image_lora.py

* Update train_text_to_image_lora.py

* format

dc5b4e23

[Docs] small fixes to the text to video doc. (#2787) · 0d7aac3e
Sayak Paul authored Mar 23, 2023
```
* small fixes to the text to video doc.

* add: Spaces link.

* add: warning on research-only model.
```
0d7aac3e

[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipeline (#2779) · 055c90f5

Nipun Jindal authored Mar 23, 2023



[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipelines
Co-authored-by: njindal <njindal@adobe.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

055c90f5

Music Spectrogram diffusion pipeline (#1044) · 2ef9bdd7

Kashif Rasul authored Mar 23, 2023



* initial TokenEncoder and ContinuousEncoder

* initial modules

* added ContinuousContextTransformer

* fix copy paste error

* use numpy for get_sequence_length

* initial terminal relative positional encodings

* fix weights keys

* fix assert

* cross attend style: concat encodings

* make style

* concat once

* fix formatting

* Initial SpectrogramPipeline

* fix input_tokens

* make style

* added mel output

* ignore weights for config

* move mel to numpy

* import pipeline

* fix class names and import

* moved models to models folder

* import ContinuousContextTransformer and SpectrogramDiffusionPipeline

* initial spec diffusion converstion script

* renamed config to t5config

* added weight loading

* use arguments instead of t5config

* broadcast noise time to batch dim

* fix call

* added scale_to_features

* fix weights

* transpose laynorm weight

* scale is a vector

* scale the query outputs

* added comment

* undo scaling

* undo depth_scaling

* inital get_extended_attention_mask

* attention_mask is none in self-attention

* cleanup

* manually invert attention

* nn.linear need bias=False

* added T5LayerFFCond

* remove to fix conflict

* make style and dummy

* remove unsed variables

* remove predict_epsilon

* Move accelerate to a soft-dependency (#1134)

* finish

* finish

* Update src/diffusers/modeling_utils.py

* Update src/diffusers/pipeline_utils.py
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

* more fixes

* fix
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

* fix order

* added initial midi to note token data pipeline

* added int to int tokenizer

* remove duplicate

* added logic for segments

* add melgan to pipeline

* move autoregressive gen into pipeline

* added note_representation_processor_chain

* fix dtypes

* remove immutabledict req

* initial doc

* use np.where

* require note_seq

* fix typo

* update dependency

* added note-seq to test

* added is_note_seq_available

* fix import

* added toc

* added example usage

* undo for now

* moved docs

* fix merge

* fix imports

* predict first segment

* avoid un-needed copy to and from cpu

* make style

* Copyright

* fix style

* add test and fix inference steps

* remove bogus files

* reorder models

* up

* remove transformers dependency

* make work with diffusers cross attention

* clean more

* remove @

* improve further

* up

* uP

* Apply suggestions from code review

* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py

* loop over all tokens

* make style

* Added a section on the model

* fix formatting

* grammer

* formatting

* make fix-copies

* Update src/diffusers/pipelines/__init__.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/pipelines/spectrogram_diffusion/pipeline_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* added callback ad optional ionnx

* do not squeeze batch dim

* clean up more

* upload

* convert jax to nnumpy

* make style

* fix warning

* make fix-copies

* fix warning

* add initial fast tests

* add initial pipeline_params

* eval mode due to dropout

* skip batch tests as pipeline runs on a single file

* make style

* fix relative path

* fix doc tests

* Update src/diffusers/models/t5_film_transformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/models/t5_film_transformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update docs/source/en/api/pipelines/spectrogram_diffusion.mdx
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* add MidiProcessor

* format

* fix org

* Apply suggestions from code review

* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py

* make style

* pin protobuf to <4

* fix formatting

* white space

* tensorboard needs protobuf

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Anton Lozhkov <anton@huggingface.co>

2ef9bdd7

Rename 'CLIPFeatureExtractor' class to 'CLIPImageProcessor' (#2732) · 14e3a28c

Naoki Ainoya authored Mar 23, 2023

The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.

14e3a28c

[doc wip] literalinclude (#2718) · 8e35ef01
Mishig authored Mar 23, 2023

8e35ef01
[UNet3DModel] Fix with attn processor (#2790) · a8315ce1
Patrick von Platen authored Mar 23, 2023
```
* [UNet3DModel] Fix attn processor

* make style
```
a8315ce1
deduplicate training section in the docs. (#2788) · 0d633a42
Sayak Paul authored Mar 23, 2023

0d633a42

[Examples] InstructPix2Pix instruct training script (#2478) · 9dc84448

Sayak Paul authored Mar 23, 2023



* add: initial implementation of the pix2pix instruct training script.

* shorten cli arg.

* fix: main process check.

* fix: dataset column names.

* simplify tokenization.

* proper placement of null conditions.

* apply styling.

* remove debugging message for conditioning do.

* complete license.

* add: requirements.tzt

* wandb column name order.

* fix: augmentation.

* change: dataset_id.

* fix: convert_to_np() call.

* fix: reshaping.

* fix: final ema copy.

* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* address PR comments.

* add: readme details.

* config fix.

* downgrade version.

* reduce image width in the readme.

* note on hyperparameters during generation.

* add: output images.

* update readme.

* minor edits to readme.

* debugging statement.

* explicitly placement of the pipeline.

* bump minimum diffusers version.

* fix: device attribute error.

* weight dtype.

* debugging.

* add dtype inform.

* add seoarate te and vae.

* add: explicit casting/

* remove casting.

* up.

* up 2.

* up 3.

* autocast.

* disable mixed-precision in the final inference.

* debugging information.

* autocasting.

* add: instructpix2pix training section to the docs.

* Empty-Commit

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

9dc84448

add: section on multiple controlnets. (#2762) · c681ad1a

Sayak Paul authored Mar 23, 2023



* add: section on multiple controlnets.
Co-authored-by: William Berman <WLBberman@gmail.com>

* fix: docs.

* fix: docs.

---------
Co-authored-by: William Berman <WLBberman@gmail.com>

c681ad1a

Support for Offset Noise in examples (#2753) · e0d8c9ef
Haofan Wang authored Mar 23, 2023
```
* add noise offset

* make style
```
e0d8c9ef

22 Mar, 2023 3 commits

`mps`: remove warmup passes (#2771) · 92e1164e

Pedro Cuenca authored Mar 22, 2023

* Remove warmup passes in mps tests.

* Update mps docs: no warmup pass in PyTorch 2

* Update imports.

92e1164e

[MS Text To Video] Add first text to video (#2738) · ca1a2229

Patrick von Platen authored Mar 22, 2023



* [MS Text To Video} Add first text to video

* upload

* make first model example

* match unet3d params

* make sure weights are correcctly converted

* improve

* forward pass works, but diff result

* make forward work

* fix more

* finish

* refactor video output class.

* feat: add support for a video export utility.

* fix: opencv availability check.

* run make fix-copies.

* add: docs for the model components.

* add: standalone pipeline doc.

* edit docstring of the pipeline.

* add: right path to TransformerTempModel

* add: first set of tests.

* complete fast tests for text to video.

* fix bug

* up

* three fast tests failing.

* add: note on slow tests

* make work with all schedulers

* apply styling.

* add slow tests

* change file name

* update

* more correction

* more fixes

* finish

* up

* Apply suggestions from code review

* up

* finish

* make copies

* fix pipeline tests

* fix more tests

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* apply suggestions

* up

* revert

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

ca1a2229

[docs] Clarify purpose of reproducibility docs (#2756) · 7fe88613
Steven Liu authored Mar 21, 2023
```
* clarify purpose of repro docs

* apply feedback
```
7fe88613

21 Mar, 2023 10 commits

[docs] update torch 2 benchmark (#2764) · a39d42b9

Pedro Cuenca authored Mar 21, 2023

* Update benchmark for A100, 3090, 3090 Ti, 4090.

* Link to PyTorch blog.

* Update install instructions.

a39d42b9

stable diffusion depth batching fix (#2757) · ca1e4072
Will Berman authored Mar 21, 2023

ca1e4072
Add option to set dtype in pipeline.to() method (#2317) · b33bd91f
1lint authored Mar 21, 2023
```
add test_to_dtype to check pipe.to(fp16)
```
b33bd91f
Fix mps tests on torch 2.0 (#2766) · 1fcf279d
Pedro Cuenca authored Mar 21, 2023

1fcf279d
Add guidance start/end parameters to StableDiffusionControlNetImg2ImgPipeline (#2731) · 58bcf46a
Hyowon Ha authored Mar 21, 2023
```
* Add guidance start/end parameters to community controlnet img2img pipeline

* Fix formats
```
58bcf46a

[1929]: Add CLIP guidance for Img2Img stable diffusion pipeline (#2723) · 0042efd0

Nipun Jindal authored Mar 21, 2023



* [Img2Img]: Copyover img2img pipeline

* [Img2Img]: img2img pipeline

* [Img2Img]: img2img pipeline

* [Img2Img]: img2img pipeline

---------
Co-authored-by: njindal <njindal@adobe.com>

0042efd0

Fix typos (#2715) · f024e003
Alexander Pivovarov authored Mar 21, 2023
```
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
```
f024e003

Improve Contribution Doc (#2043) · 2120b4ee

Patrick von Platen authored Mar 21, 2023



* first refactor

* more text

* improve

* finish

* up

* up

* up

* up

* finish

* Apply suggestions from code review
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* up

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* finished

* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* finished

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

2120b4ee

Update numbers for Habana Gaudi in documentation (#2734) · c10d6854
regisss authored Mar 21, 2023
```
Update numbers for Habana Gaudi in doc
```
c10d6854

add: controlnet entry to training section in the docs. (#2677) · 73bdad08

Sayak Paul authored Mar 21, 2023



* add: controlnet entry to training section in the docs.

* formatting.

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* wrap in a tip block.

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

73bdad08

20 Mar, 2023 6 commits
- Update text_inversion.mdx (#2751) · ba87c160
  M. Tolga Cangöz authored Mar 20, 2023
```
Fix typos
```
  ba87c160
- Update philosophy.mdx (#2752) · afe59a92
  M. Tolga Cangöz authored Mar 20, 2023
```
Fix typos
```
  afe59a92
- Update dreambooth.mdx (#2742) · 25ed7cb0
  M. Tolga Cangöz authored Mar 20, 2023
```
Fix typos
```
  25ed7cb0
- Update fp16.mdx (#2746) · af86b0cc
  M. Tolga Cangöz authored Mar 20, 2023
```
Fix typos
```
  af86b0cc
- Update torch2.0.mdx (#2748) · a9f28b68
  M. Tolga Cangöz authored Mar 20, 2023
```
Fix typos
```
  a9f28b68
- Update mps.mdx (#2749) · d91dc57d
  M. Tolga Cangöz authored Mar 20, 2023
```
Fix typos
```
  d91dc57d
18 Mar, 2023 3 commits
- Fix more slow tests · fdcff560
  Patrick von Platen authored Mar 18, 2023
  
  fdcff560
- Update README.md · ec2c1bc9
  Patrick von Platen authored Mar 18, 2023
  
  ec2c1bc9
- [Tests] Correct PT2 (#2724) · 9ecd9248
  Patrick von Platen authored Mar 18, 2023
```
* [Tests] Correct PT2

* correct more

* move versatile to nightly

* up

* up

* again

* Apply suggestions from code review
```
  9ecd9248
17 Mar, 2023 3 commits

Enabling gradient checkpointing for VAE (#2536) · 116f70cb

Andy authored Mar 17, 2023



* updated black format

* update black format

* make style format

* updated line endings

* update code formatting

* Update examples/research_projects/onnxruntime/text_to_image/train_text_to_image.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/models/vae.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* Update src/diffusers/models/vae.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>

* added vae gradient checkpointing test

* make style

---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Will Berman <wlbberman@gmail.com>

116f70cb

[docs] Update ONNX doc to use `optimum` (#2702) · a1695715

Sayak Paul authored Mar 17, 2023



* minor edits to onnx and openvino docs.

* Apply suggestions from code review
Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

---------
Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

a1695715

fix image link in inpaint doc (#2693) · f4bbcb29
YiYi Xu authored Mar 16, 2023
```
fix link
Co-authored-by: yiyixuxu <yixu310@gmail,com>
```
f4bbcb29

16 Mar, 2023 1 commit
- Improve deprecation error message when using cross_attention import (#2710) · a41850a2
  Patrick von Platen authored Mar 17, 2023
```
Improve error message
```
  a41850a2