Commits · 5ef74fd5f641367c7be6b6cfab95338048d18580 · renzhc / diffusers_dcu

01 Jul, 2025 2 commits

Use real-valued instead of complex tensors in Wan2.1 RoPE (#11649) · 62e847db

Mikko Tukiainen authored Jul 02, 2025



* use real instead of complex tensors in Wan2.1 RoPE

* remove the redundant type conversion

* unpack rotary_emb

* register rotary embedding frequencies as non-persistent buffers

* Apply style fixes

---------
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

62e847db

[single file] Cosmos (#11801) · a79c3af6
Aryan authored Jul 01, 2025
```
* update

* update

* update docs
```
a79c3af6

30 Jun, 2025 3 commits

Remove print statement in SCM Scheduler (#11836) · f064b3bf
Aryan authored Jul 01, 2025
```
remove print
```
f064b3bf

ENH: Improve speed of function expanding LoRA scales (#11834) · 3b079ec3

Benjamin Bossan authored Jun 30, 2025

* ENH Improve speed of expanding LoRA scales

Resolves #11816

The following call proved to be a bottleneck when setting a lot of LoRA
adapters in diffusers:

https://github.com/huggingface/diffusers/blob/cdaf84a708eadf17d731657f4be3fa39d09a12c0/src/diffusers/loaders/peft.py#L482

This is because we would repeatedly call unet.state_dict(), even though
in the standard case, it is not necessary:

https://github.com/huggingface/diffusers/blob/cdaf84a708eadf17d731657f4be3fa39d09a12c0/src/diffusers/loaders/unet_loader_utils.py#L55



This PR fixes this by deferring this call, so that it is only run when
it's necessary, not earlier.

* Small fix

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

3b079ec3

[lora]feat: use exclude modules to loraconfig. (#11806) · bc34fa83

Sayak Paul authored Jun 30, 2025

* feat: use exclude modules to loraconfig.

* version-guard.

* tests and version guard.

* remove print.

* describe the test

* more detailed warning message + shift to debug

* update

* update

* update

* remove test

bc34fa83

28 Jun, 2025 1 commit
- [lora] fix: lora unloading behvaiour (#11822) · 05e7a854
  Sayak Paul authored Jun 28, 2025
```
* fix: lora unloading behvaiour

* fix

* update
```
  05e7a854
27 Jun, 2025 2 commits
- Support dynamically loading/unloading loras with group offloading (#11804) · 76ec3d1f
  Aryan authored Jun 27, 2025
```
* update

* add test

* address review comments

* update

* fixes

* change decorator order to fix tests

* try fix

* fight tests
```
  76ec3d1f
- remove syncs before denoising in Kontext (#11818) · 21543de5
  Sayak Paul authored Jun 27, 2025
  
  21543de5
26 Jun, 2025 6 commits

Kontext fixes (#11815) · d7dd924e
Aryan authored Jun 27, 2025
```
fix
```
d7dd924e

Kontext training (#11813) · 00f95b97

Sayak Paul authored Jun 26, 2025



* support flux kontext

* make fix-copies

* add example

* add tests

* update docs

* update

* add note on integrity checker

* initial commit

* initial commit

* add readme section and fixes in the training script.

* add test

* rectify ckpt_id

* fix ckpt

* fixes

* change id

* update

* Update examples/dreambooth/train_dreambooth_lora_flux_kontext.py
Co-authored-by: Aryan <aryan@huggingface.co>

* Update examples/dreambooth/README_flux.md

---------
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: linoytsaban <linoy@huggingface.co>
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

00f95b97

Flux Kontext (#11812) · eea76892

Aryan authored Jun 26, 2025

* support flux kontext

* make fix-copies

* add example

* add tests

* update docs

* update

* add note on integrity checker

* make fix-copies issue

* add copied froms

* make style

* update repository ids

* more copied froms

eea76892

[rfc][compile] compile method for DiffusionPipeline (#11705) · d93381cd

Animesh Jain authored Jun 25, 2025



* [rfc][compile] compile method for DiffusionPipeline

* Apply suggestions from code review
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Apply style fixes

* Update docs/source/en/optimization/fp16.md

* check

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

d93381cd

Follow up for Group Offload to Disk (#11760) · 3649d7b9

Dhruv Nair authored Jun 26, 2025



* update

* update

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

3649d7b9

[chore] post release v0.34.0 (#11800) · 10c36e0b

Sayak Paul authored Jun 26, 2025



* post release v0.34.0

* code quality

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

10c36e0b

25 Jun, 2025 1 commit
- fix deprecation in lora after 0.34.0 release (#11802) · 88466358
  Sayak Paul authored Jun 26, 2025
  
  88466358
24 Jun, 2025 5 commits
- guard omnigen processor. (#11799) · d3e27e05
  Sayak Paul authored Jun 24, 2025
  
  d3e27e05
- [tests] Fix group offloading and layerwise casting test interaction (#11796) · 5df02fc1
  Aryan authored Jun 24, 2025
```
* update

* update

* update
```
  5df02fc1
- [chore] raise as early as possible in group offloading (#11792) · 7392c8ff
  Sayak Paul authored Jun 24, 2025
```
* raise as early as possible in group offloading

* remove check from ModuleGroup
```
  7392c8ff
- [lora] only remove hooks that we add back (#11768) · 7bc0a07b
  YiYi Xu authored Jun 23, 2025
```
up
```
  7bc0a07b
- [docs] minor cleanups in the lora docs. (#11770) · 92542719
  Sayak Paul authored Jun 24, 2025
```
* minor cleanups in the lora docs.

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* format docs

* fix copies

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
```
  92542719
23 Jun, 2025 2 commits

[Wan] Fix mask padding in Wan VACE pipeline. (#11778) · 798265f2
Yuanchen Guo authored Jun 23, 2025

798265f2

enable cpu offloading of new pipelines on XPU & use device agnostic empty to... · f20b83a0

Yao Matrix authored Jun 23, 2025


enable cpu offloading of new pipelines on XPU & use device agnostic empty to make pipelines work on XPU (#11671)

* commit 1
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* patch 2
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* Update pipeline_pag_sana.py

* Update pipeline_sana.py

* Update pipeline_sana_controlnet.py

* Update pipeline_sana_sprint_img2img.py

* Update pipeline_sana_sprint.py

* fix style
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* fix fat-thumb while merge conflict
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* fix ci issues
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

---------
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>

f20b83a0

21 Jun, 2025 1 commit
- Fix dimensionalities in `apply_rotary_emb` functions' comments (#11717) · 7fc53b5d
  Tolga Cangöz authored Jun 22, 2025
```
Fix dimensionality in `apply_rotary_emb` functions' comments.
```
  7fc53b5d
20 Jun, 2025 2 commits
- Fix failing cpu offload test for LTX Latent Upscale (#11755) · 42077e6c
  Dhruv Nair authored Jun 20, 2025
```
update
```
  42077e6c
- fix invalid component handling behaviour in `PipelineQuantizationConfig` (#11750) · 3d8d8485
  Sayak Paul authored Jun 20, 2025
```
* start

* updates
```
  3d8d8485
19 Jun, 2025 6 commits
- Update Chroma Docs (#11753) · 195926bb
  Dhruv Nair authored Jun 19, 2025
```
* update

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  195926bb
- make group offloading work with disk/nvme transfers (#11682) · 85a916bb
  Sayak Paul authored Jun 19, 2025
```
* start implementing disk offloading in group.

* delete diff file.

* updates.patch

* offload_to_disk_path

* check if safetensors already exist.

* add test and clarify.

* updates

* update todos.

* update more docs.

* update docs
```
  85a916bb
- [LoRA] refactor lora loading at the model-level (#11719) · fb57c76a
  Sayak Paul authored Jun 19, 2025
```
* factor out stuff from load_lora_adapter().

* simplifying text encoder lora loading.

* fix peft.py

* fix logging locations.

* formatting

* fix

* update

* update

* update
```
  fb57c76a
- Add missing HiDream license (#11747) · 3fba74e1
  Aryan authored Jun 19, 2025
```
update
```
  3fba74e1
- Update more licenses to 2025 (#11746) · a4df8dbc
  Aryan authored Jun 19, 2025
```
update
```
  a4df8dbc
- [Quantizers] add `is_compileable` property to quantizers. (#11736) · 48eae6f4
  Sayak Paul authored Jun 19, 2025
```
add is_compileable property to quantizers.
```
  48eae6f4
18 Jun, 2025 3 commits

Chroma Follow Up (#11725) · 66394bf6

Dhruv Nair authored Jun 18, 2025

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* updte

* update

* update

* update

66394bf6

[chore] change to 2025 licensing for remaining (#11741) · 62cce304
Sayak Paul authored Jun 18, 2025
```
change to 2025 licensing for remaining
```
62cce304

⚡

️ Speed up method `AutoencoderKLWan.clear_cache` by 886% (#11665) · 5ce4814a

Saurabh Misra authored Jun 17, 2025

* ⚡

️ Speed up method `AutoencoderKLWan.clear_cache` by 886%

**Key optimizations:**
- Compute the number of `WanCausalConv3d` modules in each model (`encoder`/`decoder`) **only once during initialization**, store in `self._cached_conv_counts`. This removes unnecessary repeated tree traversals at every `clear_cache` call, which was the main bottleneck (from profiling).
- The internal helper `_count_conv3d_fast` is optimized via a generator expression with `sum` for efficiency.

All comments from the original code are preserved, except for updated or removed local docstrings/comments relevant to changed lines.  
**Function signatures and outputs remain unchanged.**

* Apply style fixes

* Apply suggestions from code review
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

* Apply style fixes

---------
Co-authored-by: codeflash-ai[bot] <148906541+codeflash-ai[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>
Co-authored-by: Aseem Saxena <aseem.bits@gmail.com>

5ce4814a

17 Jun, 2025 1 commit
- Support more Wan loras (VACE) (#11726) · 79bd7ecc
  Aryan authored Jun 17, 2025
```
update
```
  79bd7ecc
16 Jun, 2025 2 commits

[training] show how metadata stuff should be incorporated in training scripts. (#11707) · f0dba33d

Sayak Paul authored Jun 16, 2025



* show how metadata stuff should be incorporated in training scripts.

* typing

* fix

---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

f0dba33d

[LoRA ]fix flux lora loader when return_metadata is true for non-diffusers (#11716) · d1db4f85
Sayak Paul authored Jun 16, 2025
```
* fix flux lora loader when return_metadata is true for non-diffusers

* remove annotation
```
d1db4f85

14 Jun, 2025 1 commit

Chroma Pipeline (#11698) · 8adc6003

Edna authored Jun 13, 2025



* working state from hameerabbasi and iddl

* working state form hameerabbasi and iddl (transformer)

* working state (normalization)

* working state (embeddings)

* add chroma loader

* add chroma to mappings

* add chroma to transformer init

* take out variant stuff

* get decently far in changing variant stuff

* add chroma init

* make chroma output class

* add chroma transformer to dummy tp

* add chroma to init

* add chroma to init

* fix single file

* update

* update

* add chroma to auto pipeline

* add chroma to pipeline init

* change to chroma transformer

* take out variant from blocks

* swap embedder location

* remove prompt_2

* work on swapping text encoders

* remove mask function

* dont modify mask (for now)

* wrap attn mask

* no attn mask (can't get it to work)

* remove pooled prompt embeds

* change to my own unpooled embeddeer

* fix load

* take pooled projections out of transformer

* ensure correct dtype for chroma embeddings

* update

* use dn6 attn mask + fix true_cfg_scale

* use chroma pipeline output

* use DN6 embeddings

* remove guidance

* remove guidance embed (pipeline)

* remove guidance from embeddings

* don't return length

* dont change dtype

* remove unused stuff, fix up docs

* add chroma autodoc

* add .md (oops)

* initial chroma docs

* undo don't change dtype

* undo arxiv change

unsure why that happened

* fix hf papers regression in more places

* Update docs/source/en/api/pipelines/chroma.md
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* do_cfg -> self.do_classifier_free_guidance

* Update docs/source/en/api/models/chroma_transformer.md
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* Update chroma.md

* Move chroma layers into transformer

* Remove pruned AdaLayerNorms

* Add chroma fast tests

* (untested) batch cond and uncond

* Add # Copied from for shift

* Update # Copied from statements

* update norm imports

* Revert cond + uncond batching

* Add transformer tests

* move chroma test (oops)

* chroma init

* fix chroma pipeline fast tests

* Update src/diffusers/models/transformers/transformer_chroma.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* Move Approximator and Embeddings

* Fix auto pipeline + make style, quality

* make style

* Apply style fixes

* switch to new input ids

* fix # Copied from error

* remove # Copied from on protected members

* try to fix import

* fix import

* make fix-copes

* revert style fix

* update chroma transformer params

* update chroma transformer approximator init params

* update to pad tokens

* fix batch inference

* Make more pipeline tests work

* Make most transformer tests work

* fix docs

* make style, make quality

* skip batch tests

* fix test skipping

* fix test skipping again

* fix for tests

* Fix all pipeline test

* update

* push local changes, fix docs

* add encoder test, remove pooled dim

* default proj dim

* fix tests

* fix equal size list input

* update

* push local changes, fix docs

* add encoder test, remove pooled dim

* default proj dim

* fix tests

* fix equal size list input

* Revert "fix equal size list input"

This reverts commit 3fe4ad67d58d83715bc238f8654f5e90bfc5653c.

* update

* update

* update

* update

* update

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

8adc6003

13 Jun, 2025 2 commits

Cosmos Predict2 (#11695) · 9f91305f

Aryan authored Jun 14, 2025

* support text-to-image

* update example

* make fix-copies

* support use_flow_sigmas in EDM scheduler instead of maintain cosmos-specific scheduler

* support video-to-world

* update

* rename text2image pipeline

* make fix-copies

* add t2i test

* add test for v2w pipeline

* support edm dpmsolver multistep

* update

* update

* update

* update tests

* fix tests

* safety checker

* make conversion script work without guardrail

9f91305f

[LoRA] parse metadata from LoRA and save metadata (#11324) · 368958df

Sayak Paul authored Jun 13, 2025



* feat: parse metadata from lora state dicts.

* tests

* fix tests

* key renaming

* fix

* smol update

* smol updates

* load metadata.

* automatically save metadata in save_lora_adapter.

* propagate changes.

* changes

* add test to models too.

* tigher tests.

* updates

* fixes

* rename tests.

* sorted.

* Update src/diffusers/loaders/lora_base.py
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

* review suggestions.

* removeprefix.

* propagate changes.

* fix-copies

* sd

* docs.

* fixes

* get review ready.

* one more test to catch error.

* change to a different approach.

* fix-copies.

* todo

* sd3

* update

* revert changes in get_peft_kwargs.

* update

* fixes

* fixes

* simplify _load_sft_state_dict_metadata

* update

* style fix

* uipdate

* update

* update

* empty commit

* _pack_dict_with_prefix

* update

* TODO 1.

* todo: 2.

* todo: 3.

* update

* update

* Apply suggestions from code review
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>

* reraise.

* move argument.

---------
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

368958df