Commits · a0acbdc989dc957338f63f45123fe54f78855368 · renzhc / diffusers_dcu

08 Jan, 2025 2 commits

Add AuraFlow GGUF support (#10463) · cb342b74

AstraliteHeart authored Jan 07, 2025

* Add support for loading AuraFlow models from GGUF

https://huggingface.co/city96/AuraFlow-v0.3-gguf



* Update AuraFlow documentation for GGUF, add GGUF tests and model detection.

* Address code review comments.

* Remove unused config.

---------
Co-authored-by: hlky <hlky@hlky.ac>

cb342b74

Add `_no_split_modules` to some models (#10308) · 71ad16b4

Aryan authored Jan 08, 2025



* set supports gradient checkpointing to true where necessary; add missing no split modules

* fix cogvideox tests

* update

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

71ad16b4

07 Jan, 2025 1 commit

[LoRA] Support original format loras for HunyuanVideo (#10376) · 811560b1

Aryan authored Jan 07, 2025



* update

* fix make copies

* update

* add relevant markers to the integration test suite.

* add copied.

* fox-copies

* temporarily add print.

* directly place on CUDA as CPU isn't that big on the CIO.

* fixes to fuse_lora, aryan was right.

* fixes

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

811560b1

06 Jan, 2025 3 commits

Add torch_xla and from_single_file to instruct-pix2pix (#10444) · 8f2253c5

hlky authored Jan 06, 2025



* Add torch_xla and from_single_file to instruct-pix2pix

* StableDiffusionInstructPix2PixPipelineSingleFileSlowTests

* StableDiffusionInstructPix2PixPipelineSingleFileSlowTests

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

8f2253c5

[LoRA] fix: lora unloading when using expanded Flux LoRAs. (#10397) · d9d94e12

Sayak Paul authored Jan 07, 2025



* fix: lora unloading when using expanded Flux LoRAs.

* fix argument name.
Co-authored-by: a-r-r-o-w <contact.aryanvs@gmail.com>

* docs.

---------
Co-authored-by: a-r-r-o-w <contact.aryanvs@gmail.com>

d9d94e12

[Tests] add slow and nightly markers to sd3 lora integation. (#10458) · b5726358
Sayak Paul authored Jan 06, 2025
```
add slow and nightly markers to sd3 lora integation.
```
b5726358

02 Jan, 2025 2 commits

IP-Adapter support for `StableDiffusion3ControlNetPipeline` (#10363) · 68bd6934

Daniel Regado authored Jan 02, 2025



* IP-Adapter support for `StableDiffusion3ControlNetPipeline`

* Update src/diffusers/pipelines/controlnet_sd3/pipeline_stable_diffusion_3_controlnet.py
Co-authored-by: hlky <hlky@hlky.ac>

---------
Co-authored-by: hlky <hlky@hlky.ac>

68bd6934

Fix Flux multiple Lora loading bug (#10388) · 44640c83

maxs-kan authored Jan 02, 2025



* check for base_layer key in transformer state dict

* test_lora_expansion_works_for_absent_keys

* check

* Update tests/lora/test_lora_layers_flux.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* check

* test_lora_expansion_works_for_absent_keys/test_lora_expansion_works_for_extra_keys

* absent->extra

---------
Co-authored-by: hlky <hlky@hlky.ac>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

44640c83

25 Dec, 2024 2 commits

[LoRA] feat: support `unload_lora_weights()` for Flux Control. (#10206) · 1b202c57
Sayak Paul authored Dec 25, 2024
```
* feat: support unload_lora_weights() for Flux Control.

* tighten test

* minor

* updates

* meta device fixes.
```
1b202c57

Aryan authored Dec 25, 2024

* Revert "Add support for sharded models when TorchAO quantization is enabled (#10256)"

This reverts commit 41ba8c0b

.

* update tests

* udpate

* update

* update

* update device map tests

* apply review suggestions

* update

* make style

* fix

* update docs

* update tests

* update workflow

* update

* improve tests

* allclose tolerance

* Update src/diffusers/models/modeling_utils.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update tests/quantization/torchao/test_torchao.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* improve tests

* fix

* update correct slices

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

cd991d1e

24 Dec, 2024 1 commit
- [tests] fix `AssertionError: Torch not compiled with CUDA enabled` (#10356) · 023b0e0d
  Fanli Lin authored Dec 24, 2024
```
fix bug on xpu
```
  023b0e0d
23 Dec, 2024 8 commits

[core] LTX Video 0.9.1 (#10330) · 4b557132

Aryan authored Dec 23, 2024

* update

* make style

* update

* update

* update

* make style

* single file related changes

* update

* fix

* update single file urls and docs

* update

* fix

4b557132

[Tests] Fix more tests sayak (#10359) · 851dfa30
Sayak Paul authored Dec 23, 2024
```
* fixes to tests

* fixture

* fixes
```
851dfa30
[LoRA] test fix (#10351) · ea1ba0ba
Sayak Paul authored Dec 23, 2024
```
updates
```
ea1ba0ba
[Tests] QoL improvements to the LoRA test suite (#10304) · c34fc345
Sayak Paul authored Dec 23, 2024
```
* misc lora test improvements.

* updates

* fixes to tests
```
c34fc345

[SANA LoRA] sana lora training tests and misc. (#10296) · 76e2727b

Sayak Paul authored Dec 23, 2024



* sana lora training tests and misc.

* remove push to hub

* Update examples/dreambooth/train_dreambooth_lora_sana.py
Co-authored-by: Aryan <aryan@huggingface.co>

---------
Co-authored-by: Aryan <aryan@huggingface.co>

76e2727b

[tests] Refactor TorchAO serialization fast tests (#10271) · 02c777c0
Aryan authored Dec 23, 2024
```
refactor
```
02c777c0
Bump minimum TorchAO version to 0.7.0 (#10293) · ffc0eaab
Aryan authored Dec 23, 2024
```
* bump min torchao version to 0.7.0

* update
```
ffc0eaab

[Sana bug] bug fix for 2K model config (#10340) · b58868e6

Junsong Chen authored Dec 23, 2024



* fix the Positinoal Embedding bug in 2K model;

* Change the default model to the BF16 one for more stable training and output

* make style

* substract buffer size

* add compute_module_persistent_sizes

---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>

b58868e6

21 Dec, 2024 2 commits

Support Flux IP Adapter (#10261) · be207099

hlky authored Dec 21, 2024



* Flux IP-Adapter

* test cfg

* make style

* temp remove copied from

* fix test

* fix test

* v2

* fix

* make style

* temp remove copied from

* Apply suggestions from code review
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Move encoder_hid_proj to inside FluxTransformer2DModel

* merge

* separate encode_prompt, add copied from, image_encoder offload

* make

* fix test

* fix

* Update src/diffusers/pipelines/flux/pipeline_flux.py

* test_flux_prompt_embeds change not needed

* true_cfg -> true_cfg_scale

* fix merge conflict

* test_flux_ip_adapter_inference

* add fast test

* FluxIPAdapterMixin not test mixin

* Update pipeline_flux.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

be207099

Fix EMAModel test_from_pretrained (#10325) · bf9a641f
hlky authored Dec 21, 2024

bf9a641f

20 Dec, 2024 5 commits

[Tests] add integration tests for lora expansion stuff in Flux. (#10318) · bf6eaa8a
Sayak Paul authored Dec 20, 2024
```
add integration tests for lora expansion stuff in Flux.
```
bf6eaa8a

[LoRA] feat: support loading regular Flux LoRAs into Flux Control, and Fill (#10259) · 17128c42

Sayak Paul authored Dec 20, 2024



* lora expansion with dummy zeros.

* updates

* fix working 🥳

* working.

* use torch.device meta for state dict expansion.

* tests
Co-authored-by: a-r-r-o-w <contact.aryanvs@gmail.com>

* fixes

* fixes

* switch to debug

* fix

* Apply suggestions from code review
Co-authored-by: Aryan <aryan@huggingface.co>

* fix stuff

* docs

---------
Co-authored-by: a-r-r-o-w <contact.aryanvs@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>

17128c42

Add support for sharded models when TorchAO quantization is enabled (#10256) · 41ba8c0b
Aryan authored Dec 20, 2024
```
* add sharded + device_map check
```
41ba8c0b

[WIP] SD3.5 IP-Adapter Pipeline Integration (#9987) · 31912484

Daniel Regado authored Dec 20, 2024




* Added support for single IPAdapter on SD3.5 pipeline



---------
Co-authored-by: hlky <hlky@hlky.ac>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

31912484

Enable Gradient Checkpointing for UNet2DModel (New) (#7201) · 648d968c

dg845 authored Dec 19, 2024



* Port UNet2DModel gradient checkpointing code from #6718.


---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Vincent Neemie <92559302+VincentNeemie@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: hlky <hlky@hlky.ac>

648d968c

19 Dec, 2024 5 commits

unet's `sample_size` attribute is to accept tuple(h, w) in `StableDiffusionPipeline` (#10181) · b756ec6e
djm authored Dec 20, 2024

b756ec6e
Fix failing lora tests after HunyuanVideo lora (#10307) · d8825e76
Aryan authored Dec 20, 2024
```
fix
```
d8825e76

[LoRA] Support HunyuanVideo (#10254) · 1826a1e7

Shenghai Yuan authored Dec 19, 2024



* 1217

* 1217

* 1217

* update

* reverse

* add test

* update test

* make style

* update

* make style

---------
Co-authored-by: Aryan <aryan@huggingface.co>

1826a1e7

Check correct model type is passed to `from_pretrained` (#10189) · 0ed09a17

hlky authored Dec 19, 2024



* Check correct model type is passed to `from_pretrained`

* Flax, skip scheduler

* test_wrong_model

* Fix for scheduler

* Update tests/pipelines/test_pipelines.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* EnumMeta

* Flax

* scheduler in expected types

* make

* type object 'CLIPTokenizer' has no attribute '_PipelineFastTests__name'

* support union

* fix typing in kandinsky

* make

* add LCMScheduler

* 'LCMScheduler' object has no attribute 'sigmas'

* tests for wrong scheduler

* make

* update

* warning

* tests

* Update src/diffusers/pipelines/pipeline_utils.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* import FlaxSchedulerMixin

* skip scheduler

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

0ed09a17

Hunyuan VAE tiling fixes and transformer docs (#10295) · f781b8c3
Aryan authored Dec 19, 2024
```
* update

* udpate

* fix test
```
f781b8c3

18 Dec, 2024 4 commits

[tests] remove nullop import checks from lora tests (#10273) · f35a3872
Aryan authored Dec 19, 2024
```
remove nullop imports
```
f35a3872
Rename Mochi integration test correctly (#10220) · f66bd326
Aryan authored Dec 18, 2024
```
rename integration test
```
f66bd326

Flux Control(Depth/Canny) + Inpaint (#10192) · 83709d5a

Andrés Romero authored Dec 18, 2024



* flux_control_inpaint - failing test_flux_different_prompts

* removing test_flux_different_prompts?

* fix style

* fix from PR comments

* fix style

* reducing guidance_scale in demo

* Update src/diffusers/pipelines/flux/pipeline_flux_control_inpaint.py
Co-authored-by: hlky <hlky@hlky.ac>

* make

* prepare_latents is not copied from

* update docs

* typos

---------
Co-authored-by: affromero <ubuntu@ip-172-31-17-146.ec2.internal>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: hlky <hlky@hlky.ac>

83709d5a

[LoRA] feat: lora support for SANA. (#10234) · 9408aa2d

Sayak Paul authored Dec 18, 2024



* feat: lora support for SANA.

* make fix-copies

* rename test class.

* attention_kwargs -> cross_attention_kwargs.

* Revert "attention_kwargs -> cross_attention_kwargs."

This reverts commit 23433bf9bccc12e0f2f55df26bae58a894e8b43b.

* exhaust 119 max line limit

* sana lora fine-tuning script.

* readme

* add a note about the supported models.

* Apply suggestions from code review
Co-authored-by: Aryan <aryan@huggingface.co>

* style

* docs for attention_kwargs.

* remove lora_scale from pag pipeline.

* copy fix

---------
Co-authored-by: Aryan <aryan@huggingface.co>

9408aa2d

17 Dec, 2024 3 commits

[tests] Remove/rename unsupported quantization torchao type (#10263) · 1524781b
Aryan authored Dec 17, 2024
```
update
```
1524781b

[Single File] Add GGUF support (#9964) · e24941b2

Dhruv Nair authored Dec 17, 2024



* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* Update src/diffusers/quantizers/gguf/utils.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* Update docs/source/en/quantization/gguf.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* update

* update

* update

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

e24941b2

[LoRA] Support LTX Video (#10228) · ac863934

Aryan authored Dec 17, 2024



* add lora support for ltx

* add tests

* fix copied from comments

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

ac863934

16 Dec, 2024 2 commits

[core] TorchAO Quantizer (#10009) · 9f00c617

Aryan authored Dec 17, 2024



* torchao quantizer


---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

9f00c617

[core] Hunyuan Video (#10136) · aace1f41

Aryan authored Dec 16, 2024



* copy transformer

* copy vae

* copy pipeline

* make fix-copies

* refactor; make original code work with diffusers; test latents for comparison generated with this commit

* move rope into pipeline; remove flash attention; refactor

* begin conversion script

* make style

* refactor attention

* refactor

* refactor final layer

* their mlp -> our feedforward

* make style

* add docs

* refactor layer names

* refactor modulation

* cleanup

* refactor norms

* refactor activations

* refactor single blocks attention

* refactor attention processor

* make style

* cleanup a bit

* refactor double transformer block attention

* update mochi attn proc

* use diffusers attention implementation in all modules; checkpoint for all values matching original

* remove helper functions in vae

* refactor upsample

* refactor causal conv

* refactor resnet

* refactor

* refactor

* refactor

* grad checkpointing

* autoencoder test

* fix scaling factor

* refactor clip

* refactor llama text encoding

* add coauthor
Co-Authored-By: "Gregory D. Hunkins" <greg@ollano.com>

* refactor rope; diff: 0.14990234375; reason and fix: create rope grid on cpu and move to device

Note: The following line diverges from original behaviour. We create the grid on the device, whereas
original implementation creates it on CPU and then moves it to device. This results in numerical
differences in layerwise debugging outputs, but visually it is the same.

* use diffusers timesteps embedding; diff: 0.10205078125

* rename

* convert

* update

* add tests for transformer

* add pipeline tests; text encoder 2 is not optional

* fix attention implementation for torch

* add example

* update docs

* update docs

* apply suggestions from review

* refactor vae

* update

* Apply suggestions from code review
Co-authored-by: hlky <hlky@hlky.ac>

* Update src/diffusers/pipelines/hunyuan_video/pipeline_hunyuan_video.py
Co-authored-by: hlky <hlky@hlky.ac>

* Update src/diffusers/pipelines/hunyuan_video/pipeline_hunyuan_video.py
Co-authored-by: hlky <hlky@hlky.ac>

* make fix-copies

* update

---------
Co-authored-by: "Gregory D. Hunkins" <greg@ollano.com>
Co-authored-by: hlky <hlky@hlky.ac>

aace1f41