Commits · 04f4bd54ea3126185ced2ffdf26f608dcd1db30e · renzhc / diffusers_dcu

10 May, 2024 1 commit

[Core] introduce videoprocessor. (#7776) · 04f4bd54

Sayak Paul authored May 10, 2024



* introduce videoprocessor.

* fix quality

* address yiyi's feedback

* fix preprocess_video call.

* video_processor -> image_processor

* fix

* fix more.

* quality

* image_processor -> video_processor

* support List[List[PIL.Image.Image]]

* change to video_processor.

* documentation

* Apply suggestions from code review

* changes

* remove print.

* refactor video processor (part # 7776) (#7861)

* update

* update remove deprecate

* Update src/diffusers/video_processor.py

* update

* Apply suggestions from code review

* deprecate list of 5d for video and list of 4d for image + apply other feedbacks

* up

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* add doc.

* tensor2vid -> postprocess_video.

* refactor preprocess with preprocess_video

* set default values.

* empty commit

* more refactoring of prepare_latents in animatediff vid2vid

* checking documentation

* remove documentation for now.

* fix animatediff sdxl

* fix test failure [part of video processor PR] (#7905)

up

* remove preceed_with_frames.

* doc

* fix

* fix

* remove video input as a single-frame video.

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

04f4bd54

09 May, 2024 4 commits

[scheduler] support custom `timesteps` and `sigmas` (#7817) · b934215d

YiYi Xu authored May 09, 2024



* support custom sigmas and timesteps, dpm euler

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

b934215d

fix `_optional_components` in `StableCascadeCombinedPipeline` (#7894) · 5ed3abd3
YiYi Xu authored May 09, 2024
```
* fix

* up
```
5ed3abd3

[Tests] fix things after #7013 (#7899) · 305f2b44

Sayak Paul authored May 09, 2024

* debugging

* save the resulting image

* check if order reversing works.

* checking values.

* up

* okay

* checking

* fix

* remove print

305f2b44

[Refactor] Better align `from_single_file` logic with `from_pretrained` (#7496) · cb0f3b49

Dhruv Nair authored May 09, 2024



* refactor unet single file loading a bit.

* retrieve the unet from create_diffusers_unet_model_from_ldm

* update

* update

* updae

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* tests

* update

* update

* update

* Update docs/source/en/api/single_file.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update docs/source/en/api/single_file.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* Update docs/source/en/api/loaders/single_file.md
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/loaders/single_file.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update docs/source/en/api/loaders/single_file.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update docs/source/en/api/loaders/single_file.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update docs/source/en/api/loaders/single_file.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update docs/source/en/api/loaders/single_file.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

cb0f3b49

08 May, 2024 2 commits

fix offload test (#7868) · 35358a2d
YiYi Xu authored May 08, 2024
```
fix
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
```
35358a2d

[Pipeline] AnimateDiff SDXL (#6721) · 818f7607

Aryan authored May 08, 2024



* update conversion script to handle motion adapter sdxl checkpoint

* add animatediff xl

* handle addition_embed_type

* fix output

* update

* add imports

* make fix-copies

* add decode latents

* update docstrings

* add animatediff sdxl to docs

* remove unnecessary lines

* update example

* add test

* revert conv_in conv_out kernel param

* remove unused param addition_embed_type_num_heads

* latest IPAdapter impl

* make fix-copies

* fix return

* add IPAdapterTesterMixin to tests

* fix return

* revert based on suggestion

* add freeinit

* fix test_to_dtype test

* use StableDiffusionMixin instead of different helper methods

* fix progress bar iterations

* apply suggestions from review

* hardcode flip_sin_to_cos and freq_shift

* make fix-copies

* fix ip adapter implementation

* fix last failing test

* make style

* Update docs/source/en/api/pipelines/animatediff.md
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* remove todo

* fix doc-builder errors

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

818f7607

07 May, 2024 1 commit
- Fix for "no lora weight found module" with some loras (#7875) · 23e09156
  Álvaro Somoza authored May 07, 2024
```
* return layer weight if not found

* better system and test

* key example and typo
```
  23e09156
03 May, 2024 2 commits

Add Ascend NPU support for SDXL fine-tuning and fix the model saving bug when... · 58237364

HelloWorldBeginner authored May 04, 2024


Add Ascend NPU support for SDXL fine-tuning and fix the model saving bug when using DeepSpeed. (#7816)

* Add Ascend NPU support for SDXL fine-tuning and fix the model saving bug when using DeepSpeed.

* fix check code quality

* Decouple the NPU flash attention and make it an independent module.

* add doc and unit tests for npu flash attention.

---------
Co-authored-by: mhh001 <mahonghao1@huawei.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

58237364

[Tests] reduce the model size in the blipdiffusion fast test (#7849) · fa489eae
Aritra Roy Gosthipaty authored May 03, 2024
```
reducing model size
```
fa489eae

02 May, 2024 2 commits
- Update download diff format tests (#7831) · 03ca1131
  Dhruv Nair authored May 02, 2024
```
update
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  03ca1131
- [Tests] reduce the model size in the audioldm fast test (#7833) · 435d37ce
  Aritra Roy Gosthipaty authored May 02, 2024
```
chore: initial size reduction of models
```
  435d37ce
01 May, 2024 2 commits
- update the logic of `is_sequential_cpu_offload` (#7788) · 21a7ff12
  YiYi Xu authored May 01, 2024
```
* up

* add comment to the tests + fix dit

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  21a7ff12
- [Tests] fix: device map tests for models (#7825) · 8909ab4b
  Sayak Paul authored May 01, 2024
```
* fix: device module tests

* remove patch file

* Empty-Commit
```
  8909ab4b
30 Apr, 2024 4 commits

[Core] introduce _no_split_modules to `ModelMixin` (#6396) · 3fd31eef

Sayak Paul authored Apr 30, 2024

* introduce _no_split_modules.

* unnecessary spaces.

* remove unnecessary kwargs and style

* fix: accelerate imports.

* change to _determine_device_map

* add the blocks that have residual connections.

* add: CrossAttnUpBlock2D

* add: testin

* style

* line-spaces

* quality

* add disk offload test without safetensors.

* checking disk offloading percentages.

* change model split

* add: utility for checking multi-gpu requirement.

* model parallelism test

* splits.

* splits.

* splits

* splits.

* splits.

* splits.

* offload folder to test_disk_offload_with_safetensors

* add _no_split_modules

* fix-copies

3fd31eef

[Tests] reduce the model size in the amused fast test (#7804) · b02e2113

Aritra Roy Gosthipaty authored Apr 30, 2024



* chore: reducing model sizes

* chore: shrinks further

* chore: shrinks further

* chore: shrinking model for img2img pipeline

* chore: reducing size of model for inpaint pipeline

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

b02e2113

[Tests] reduce the model size in the ddpm fast test (#7797) · 21f023ec

Aritra Roy Gosthipaty authored Apr 30, 2024



* chore: reducing unet size for faster tests

* review suggestions

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

21f023ec

[Tests] reduce the model size in the ddim fast test (#7803) · 31d9f9ea
Aritra Roy Gosthipaty authored Apr 30, 2024
```
chore: reducing model size for ddim fast pipeline
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
31d9f9ea

25 Apr, 2024 1 commit
- [Tests] mark UNetControlNetXSModelTests::test_forward_no_control to be flaky (#7771) · b833d0fc
  Sayak Paul authored Apr 25, 2024
```
decorate UNetControlNetXSModelTests::test_forward_no_control with is_flaky
```
  b833d0fc
24 Apr, 2024 3 commits

PixArt-Sigma Implementation (#7654) · 39215aa3

Junsong Chen authored Apr 24, 2024



* support PixArt-DMD

---------
Co-authored-by: jschen <chenjunsong4@h-partners.com>
Co-authored-by: badayvedat <badayvedat@gmail.com>
Co-authored-by: Vedat Baday <54285744+badayvedat@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail,com>

39215aa3

Fix test for consistency decoder. (#7746) · 9ef43f38
Dhruv Nair authored Apr 24, 2024
```
update
```
9ef43f38
Fix failing VAE tiling test (#7747) · 88018fcf
Dhruv Nair authored Apr 24, 2024
```
update
```
88018fcf

22 Apr, 2024 4 commits

Restore AttnProcessor2_0 in unload_ip_adapter (#7727) · 065f2517

Fabio Rigano authored Apr 23, 2024



* Restore AttnProcessor2_0 in unload_ip_adapter

* Fix style

* Update test

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

065f2517

Support InstantStyle (#7668) · 21c747fa

Jenyuan-Huang authored Apr 23, 2024



* enable control ip-adapter per-transformer block on-the-fly

---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>
Co-authored-by: ResearcherXman <xhs.research@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

21c747fa

Fix Kandinksy V22 tests (#7699) · a9dd8602
Dhruv Nair authored Apr 22, 2024
```
update
```
a9dd8602
Update Wuerschten Test (#7700) · 91006524
Dhruv Nair authored Apr 22, 2024
```
update
```
91006524

19 Apr, 2024 3 commits

Cleanup ControlnetXS (#7701) · 3cfe187d
Dhruv Nair authored Apr 19, 2024
```
* update

* update
```
3cfe187d

adding back test_conversion_when_using_device_map (#7704) · e5674015

YiYi Xu authored Apr 18, 2024



* style


* Fix device map nits (#7705)


---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

e5674015

Move IP Adapter Face ID to core (#7186) · b5c8b555

Fabio Rigano authored Apr 19, 2024



* Switch to peft and multi proj layers

* Move Face ID loading and inference to core

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

b5c8b555

16 Apr, 2024 1 commit

Fixing implementation of ControlNet-XS (#6772) · fda1531d

UmerHA authored Apr 16, 2024



* CheckIn - created DownSubBlocks

* Added extra channels, implemented subblock fwd

* Fixed connection sizes

* checkin

* Removed iter, next in forward

* Models for SD21 & SDXL run through

* Added back pipelines, cleared up connections

* Cleaned up connection creation

* added debug logs

* updated logs

* logs: added input loading

* Update umer_debug_logger.py

* log: Loading hint

* Update umer_debug_logger.py

* added logs

* Changed debug logging

* debug: added more logs

* Fixed num_norm_groups

* Debug: Logging all of SDXL input

* Update umer_debug_logger.py

* debug: updated logs

* checkim

* Readded tests

* Removed debug logs

* Fixed Slow Tests

* Added value ckecks | Updated model_cpu_offload_seq

* accelerate-offloading works ; fast tests work

* Made unet & addon explicit in controlnet

* Updated slow tests

* Added dtype/device to ControlNetXS

* Filled in test model paths

* Added image_encoder/feature_extractor to XL pipe

* Fixed fast tests

* Added comments and docstrings

* Fixed copies

* Added docs ; Updates slow tests

* Moved changes to UNetMidBlock2DCrossAttn

* tiny cleanups

* Removed stray prints

* Removed ip adapters + freeU

- Removed ip adapters + freeU as they don't make sense for ControlNet-XS
- Fixed imports of UNet components

* Fixed test_save_load_float16

* Make style, quality, fix-copies

* Changed loading/saving API for ControlNetXS

- Changed loading/saving API for ControlNetXS
- other small fixes

* Removed ControlNet-XS from research examples

* Make style, quality, fix-copies

* Small fixes

- deleted ControlNetXSModel.init_original
- added time_embedding_mix to StableDiffusionControlNetXSPipeline .from_pretrained / StableDiffusionXLControlNetXSPipeline.from_pretrained
- fixed copy hints

* checkin May 11 '23

* CheckIn Mar 12 '24

* Fixed tests for SD

* Added tests for UNetControlNetXSModel

* Fixed SDXL tests

* cleanup

* Delete Pipfile

* CheckIn Mar 20

Started replacing sub blocks  by `ControlNetXSCrossAttnDownBlock2D` and `ControlNetXSCrossAttnUplock2D`

* check-in Mar 23

* checkin 24 Mar

* Created init for UNetCnxs and CnxsAddon

* CheckIn

* Made from_modules, from_unet and no_control work

* make style,quality,fix-copies & small changes

* Fixed freezing

* Added gradient ckpt'ing; fixed tests

* Fix slow tests(+compile) ; clear naming confusion

* Don't create UNet in init ; removed class_emb

* Incorporated review feedback

- Deleted get_base_pipeline /  get_controlnet_addon for pipes
- Pipes inherit from StableDiffusionXLPipeline
- Made module dicts for cnxs-addon's down/mid/up classes
- Added support for qkv fusion and freeU

* Make style, quality, fix-copies

* Implemented review feedback

* Removed compatibility check for vae/ctrl embedding

* make style, quality, fix-copies

* Delete Pipfile

* Integrated review feedback

- Importing ControlNetConditioningEmbedding now
- get_down/mid/up_block_addon now outside class
- renamed `do_control` to `apply_control`

* Reduced size of test tensors

For this, added `norm_num_groups` as parameter everywhere

* Renamed cnxs-`Addon` to cnxs-`Adapter`

- `ControlNetXSAddon` -> `ControlNetXSAdapter`
- `ControlNetXSAddonDownBlockComponents` -> `DownBlockControlNetXSAdapter`, and similarly for mid/up
- `get_mid_block_addon` -> `get_mid_block_adapter`, and similarly for mid/up

* Fixed save_pretrained/from_pretrained bug

* Removed redundant code

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

fda1531d

12 Apr, 2024 1 commit
- FIX Setting device for DoRA parameters (#7655) · 2523390c
  Benjamin Bossan authored Apr 12, 2024
```
Fix a bug that causes the the call to set_lora_device to ignore the DoRA
parameters.
```
  2523390c
10 Apr, 2024 2 commits

[Tests] reduce the model sizes in the SD fast tests (#7580) · b2323aa2

Sayak Paul authored Apr 11, 2024

* give it a shot.

* print.

* correct assertion.

* gather results from the rest of the tests.

* change the assertion values where needed.

* remove print statements.

b2323aa2

[Core] add "balanced" `device_map` support to pipelines (#6857) · 3e4a6bd2

Sayak Paul authored Apr 10, 2024



* get device <-> component mapping when using multiple gpus.

* condition the device_map bits.

* relax condition

* device_map progress.

* device_map enhancement

* some cleaning up and debugging

* Apply suggestions from code review
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* incorporate suggestions from PR.

* remove multi-gpu condition for now.

* guard check the component -> device mapping

* fix: device_memory variable

* dispatching transformers model to have force_hooks=True

* better guarding for transformers device_map

* introduce support balanced_low_memory and balanced_ultra_low_memory.

* remove device_map patch.

* fix: intermediate variable scoping.

* fix: condition in cpu offload.

* fix: flax class restrictions.

* remove modifications from cpu_offload and model_offload

* incorporate changes.

* add a simple forward pass test

* add: torch_device in get_inputs()

* add: tests

* remove print

* safe-guard to(), model offloading and cpu offloading when balanced is used as a device_map.

* style

* remove .

* safeguard device_map with more checks and remove invalid device_mapping strategues.

* make  a class attribute and adjust tests accordingly.

* fix device_map check

* fix test

* adjust comment

* fix: device_map attribute

* fix: dispatching.

* max_memory test for pipeline

* version guard the tests

* fix guard.

* address review feedback.

* reset_device_map method.

* add: test for reset_hf_device_map

* fix a couple things.

* add reset_device_map() in the error message.

* add tests for checking reset_device_map doesn't have unintended consequences.

* fix reset_device_map and offloading tests.

* create _get_final_device_map utility.

* hf_device_map -> _hf_device_map

* add documentation

* add notes suggested by Marc.

* styling.

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

* move updates within gpu condition.

* other docs related things

* note on ignore a device not specified in .

* provide a suggestion if device mapping errors out.

* fix: typo.

* _hf_device_map -> hf_device_map

* Empty-Commit

* add: example hf_device_map.

---------
Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Pedro Cuenca <pedro@huggingface.co>

3e4a6bd2

09 Apr, 2024 2 commits

Multi-image masking for single IP Adapter (#7499) · a0cf6076

Fabio Rigano authored Apr 09, 2024



* Support multiimage masking

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

a0cf6076

disable test_conversion_when_using_device_map (#7620) · a341b536
YiYi Xu authored Apr 09, 2024
```
* disable test

* update

---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
```
a341b536

08 Apr, 2024 1 commit

Add AudioLDM2 TTS (#5381) · 56a76082

Nguyễn Công Tú Anh authored Apr 08, 2024



* add audioldm2 tts

* change gpt2 max new tokens

* remove unnecessary pipeline and class

* add TTS to AudioLDM2Pipeline

* add TTS docs

* delete unnecessary file

* remove unnecessary import

* add audioldm2 slow testcase

* fix code quality

* remove AudioLDMLearnablePositionalEmbedding

* add variable check vits encoder

* add use_learned_position_embedding

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

56a76082

05 Apr, 2024 1 commit

[Tests] reduce block sizes of UNet and VAE tests (#7560) · 1c60e094

Sayak Paul authored Apr 05, 2024

* reduce block sizes for unet1d.

* reduce blocks for unet_2d.

* reduce block size for unet_motion

* increase channels.

* correctly increase channels.

* reduce number of layers in unet2dconditionmodel tests.

* reduce block sizes for unet2dconditionmodel tests

* reduce block sizes for unet3dconditionmodel.

* fix: test_feed_forward_chunking

* fix: test_forward_with_norm_groups

* skip spatiotemporal tests on MPS.

* reduce block size in AutoencoderKL.

* reduce block sizes for vqmodel.

* further reduce block size.

* make style.

* Empty-Commit

* reduce sizes for ConsistencyDecoderVAETests

* further reduction.

* further block reductions in AutoencoderKL and AssymetricAutoencoderKL.

* massively reduce the block size in unet2dcontionmodel.

* reduce sizes for unet3d

* fix tests in unet3d.

* reduce blocks further in motion unet.

* fix: output shape

* add attention_head_dim to the test configuration.

* remove unexpected keyword arg

* up a bit.

* groups.

* up again

* fix

1c60e094

04 Apr, 2024 1 commit

Skip `test_freeu_enabled ` on MPS (#7570) · 71f49a5d

UmerHA authored Apr 04, 2024

* Skip `test_freeu_enabled ` on MPS

* Small fixes

- import skip_mps correctly
- disable all instances of test_freeu_enabled

* Empty commit to trigger tests

* Empty commit to trigger CI

71f49a5d

03 Apr, 2024 2 commits

Update pipeline_animatediff_video2video.py (#7457) · 35db2fde

Abhinav Gopal authored Apr 03, 2024

* Update pipeline_animatediff_video2video.py

* commit with test for whether latent input can be passed into animatediffvid2vid

35db2fde

UniPC Multistep add `rescale_betas_zero_snr` (#7531) · aa190259

Beinsezii authored Apr 02, 2024

* UniPC Multistep add `rescale_betas_zero_snr`

Same patch as DPM and Euler with the patched final alpha cumprod

BF16 doesn't seem to break down, I think cause UniPC upcasts during some
phases already? We could still force an upcast since it only
loses ≈ 0.005 it/s for me but the difference in output is very small. A
better endeavor might upcasting in step() and removing all the other
upcasts elsewhere?

* UniPC ZSNR UT

* Re-add `rescale_betas_zsnr` doc oops

aa190259