Commits · cd00ba685b2162ef617146a938616dae7653872c · renzhc / diffusers_dcu

04 Dec, 2025 4 commits
- fix spatial compression ratio error for AutoEncoderKLWan doing tiled encode (#12753) · cd00ba68
  Jiang authored Dec 05, 2025
```
fix spatial compression ratio compute error for AutoEncoderKLWan
Co-authored-by: lirui.926 <lirui.926@bytedance.com>
```
  cd00ba68
- Improve docstrings and type hints in scheduling_unipc_multistep.py (#12767) · 2842c14c
  David El Malih authored Dec 04, 2025
```
refactor: add type hints and update docstrings for UniPCMultistepScheduler parameters and methods.
```
  2842c14c
- Update attention_backends.md to format kernels (#12757) · c3186860
  Sayak Paul authored Dec 04, 2025
  
  c3186860
- Z-Image-Turbo `from_single_file` (#12756) · 60286132
  hlky authored Dec 04, 2025
```
* Z-Image-Turbo `from_single_file`

* compute_dtype

* -device cast
```
  60286132
03 Dec, 2025 8 commits

[Z-Image] various small changes, Z-Image transformer tests, etc. (#12741) · a1f36ee3

Sayak Paul authored Dec 03, 2025



* start zimage model tests.

* up

* up

* up

* up

* up

* up

* up

* up

* up

* up

* up

* up

* Revert "up"

This reverts commit bca3e27c96b942db49ccab8ddf824e7a54d43ed1.

* expand upon compilation failure reason.

* Update tests/models/transformers/test_models_transformer_z_image.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>

* reinitialize the padding tokens to ones to prevent NaN problems.

* updates

* up

* skipping ZImage DiT tests

* up

* up

---------
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>

a1f36ee3

[tests] fix hunuyanvideo 1.5 offloading tests. (#12782) · d96cbaca
Sayak Paul authored Dec 03, 2025
```
fix hunuyanvideo 1.5 offloading tests.
```
d96cbaca

Fix: leaf_level offloading breaks after delete_adapters (#12639) · 5ab59469

Aditya Borate authored Dec 03, 2025



* Fix(peft): Re-apply group offloading after deleting adapters

* Test: Add regression test for group offloading + delete_adapters

* Test: Add assertions to verify output changes after deletion

* Test: Add try/finally to clean up group offloading hooks

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

5ab59469

Kandinsky 5.0 Video Pro and Image Lite (#12664) · d0c54e55

Lev Novitskiy authored Dec 03, 2025



* add transformer pipeline first version


---------
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Charles <charles@huggingface.co>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: dmitrienkoae <dmitrienko.ae@phystech.edu>
Co-authored-by: nvvaulin <nvvaulin@gmail.com>

d0c54e55

Deprecate `upcast_vae` in SDXL based pipelines (#12619) · 1908c476

Dhruv Nair authored Dec 03, 2025

* update

* update

* Revert "update"

This reverts commit 73906381ab76da96eb8f9b841177cd4f49861eb1.

* Revert "update"

This reverts commit 21a03f93ef0fbfa5f7a7d97708f75149b1d1b3b0.

* update

* update

* update

* update

* update

1908c476

[core] reuse `AttentionMixin` for compatible classes (#12463) · 759ea587

Sayak Paul authored Dec 03, 2025

* remove attn_processors property

* more

* up

* up more.

* up

* add AttentionMixin to AuraFlow.

* up

* up

* up

* up

759ea587

[core] start varlen variants for attn backend kernels. (#12765) · f48f9c25

Sayak Paul authored Dec 03, 2025

* start varlen variants for attn backend kernels.

* maybe unflatten heads.

* updates

* remove unused function.

* doc

* up

f48f9c25

Fixes #12673. `record_stream` in group offloading is not working properly (#12721) · 3c05b9f7

Kimbing Ng authored Dec 03, 2025



* Fixes #12673.

    Wrong default_stream is used. leading to wrong execution order when record_steram is enabled.

* update

* Update test

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

3c05b9f7

02 Dec, 2025 3 commits

Fix TPU (torch_xla) compatibility Error about tensor repeat func along with empty dim. (#12770) · 9379b239

Jerry Wu authored Dec 03, 2025



* Refactor image padding logic to pervent zero tensor in transformer_z_image.py

* Apply style fixes

* Add more support to fix repeat bug on tpu devices.

* Fix for dynamo compile error for multi if-branches.

---------
Co-authored-by: Mingjia Li <mingjiali@tju.edu.cn>
Co-authored-by: Mingjia Li <mail@mingjia.li>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

9379b239

Add support for Ovis-Image (#12740) · 4f136f84

Guo-Hua Wang authored Dec 03, 2025



* add ovis_image

* fix code quality

* optimize pipeline_ovis_image.py according to the feedbacks

* optimize imports

* add docs

* make style

* make style

* add ovis to toctree

* oops

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

4f136f84

Add ZImage LoRA support and integrate into ZImagePipeline (#12750) · edf36f51

CalamitousFelicitousness authored Dec 02, 2025



* Add ZImage LoRA support and integrate into ZImagePipeline

* Add LoRA test for Z-Image

* Move the LoRA test

* Fix ZImage LoRA scale support and test configuration

* Add ZImage LoRA test overrides for architecture differences

- Override test_lora_fuse_nan to use ZImage's 'layers' attribute
  instead of 'transformer_blocks'
- Skip block-level LoRA scaling test (not supported in ZImage)
- Add required imports: numpy, torch_device, check_if_lora_correctly_set

* Add ZImageLoraLoaderMixin to LoRA documentation

* Use conditional import for peft.LoraConfig in ZImage tests

* Override test_correct_lora_configs_with_different_ranks for ZImage

ZImage uses 'attention.to_k' naming convention instead of 'attn.to_k',
so the base test's module name search loop never finds a match. This
override uses the correct naming pattern for ZImage architecture.

* Add is_flaky decorator to ZImage LoRA tests initialise padding tokens

* Skip ZImage LoRA test class entirely

Skip the entire ZImageLoRATests class due to non-deterministic behavior
from complex64 RoPE operations and torch.empty padding tokens.
LoRA functionality works correctly with real models.

Clean up removed:
- Individual @unittest.skip decorators
- @is_flaky decorator overrides for inherited methods
- Custom test method overrides
- Global torch deterministic settings
- Unused imports (numpy, is_flaky, check_if_lora_correctly_set)

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>

edf36f51

01 Dec, 2025 8 commits

[feat]: implement "local" caption upsampling for Flux.2 (#12718) · 564079f2

Sayak Paul authored Dec 02, 2025

* feat: implement caption upsampling for flux.2.

* doc

* up

* fix

* up

* fix system prompts 🤷‍

* up

* up

* up

564079f2

Update bria_fibo.md with minor fixes (#12731) · 394a48d1

Sayak Paul authored Dec 02, 2025



* Update bria_fibo.md with minor fixes

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

394a48d1

Rename BriaPipeline to BriaFiboPipeline in documentation (#12758) · 99784ae0
Gal Davidi authored Dec 01, 2025

99784ae0
fix FLUX.2 context parallel (#12737) · fffd964a
DefTruth authored Dec 02, 2025

fffd964a
Improve docstrings and type hints in scheduling_euler_ancestral_discrete.py (#12766) · 859b8090
David El Malih authored Dec 01, 2025
```
refactor: add type hints to methods and update docstrings for parameters.
```
859b8090

Improve docstrings and type hints in scheduling_heun_discrete.py (#12726) · d769d8a1

David El Malih authored Dec 01, 2025

refactor: improve type hints for `beta_schedule`, `prediction_type`, and `timestep_spacing` parameters, and add return type hints to several methods.

d769d8a1

[Docs] Update Imagen Video paper link in schedulers (#12724) · c25582d5
David El Malih authored Dec 01, 2025
```
docs: Update Imagen Video paper link in scheduler docstrings.
```
c25582d5

Hunyuanvideo15 (#12696) · 6156cf8f

YiYi Xu authored Nov 30, 2025



* add


---------
Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-161-123.ec2.internal>
Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

6156cf8f

29 Nov, 2025 1 commit
- fix type-check for z-image transformer (#12739) · 152f7ca3
  DefTruth authored Nov 29, 2025
```
* allow type-check for ZImageTransformer2DModel

* make fix-copies
```
  152f7ca3
28 Nov, 2025 2 commits

[Modular] Add single file support to Modular (#12383) · b010a8ce

Dhruv Nair authored Nov 28, 2025



* update

* update

* update

* update

* Apply style fixes

* update

* update

* update

* update

* update

---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

b010a8ce

Fix examples not loading LoRA adapter weights from checkpoint (#12690) · 1b91856d

Ayush Sur authored Nov 28, 2025



* Fix examples not loading LoRA adapter weights from checkpoint

* Updated lora saving logic with accelerate save_model_hook and load_model_hook

* Formatted the changes using ruff

* import and upcasting changed

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

1b91856d

27 Nov, 2025 2 commits
- Enable regional compilation on z-image transformer model (#12736) · 01e35551
  Sayak Paul authored Nov 27, 2025
```
up
```
  01e35551
- [chore] remove torch.save from remnant code. (#12717) · 6bf668c4
  Sayak Paul authored Nov 27, 2025
```
remove torch.save from remnant code.
```
  6bf668c4
26 Nov, 2025 6 commits

Support unittest for Z-image

⚡

️ (#12715) · e6d46123

Jerry Wu authored Nov 27, 2025



* Add Support for Z-Image.

* Reformatting with make style, black & isort.

* Remove init, Modify import utils, Merge forward in transformers block, Remove once func in pipeline.

* modified main model forward, freqs_cis left

* refactored to add B dim

* fixed stack issue

* fixed modulation bug

* fixed modulation bug

* fix bug

* remove value_from_time_aware_config

* styling

* Fix neg embed and devide / bug; Reuse pad zero tensor; Turn cat -> repeat; Add hint for attn processor.

* Replace padding with pad_sequence; Add gradient checkpointing.

* Fix flash_attn3 in dispatch attn backend by _flash_attn_forward, replace its origin implement; Add DocString in pipeline for that.

* Fix Docstring and Make Style.

* Revert "Fix flash_attn3 in dispatch attn backend by _flash_attn_forward, replace its origin implement; Add DocString in pipeline for that."

This reverts commit fbf26b7ed11d55146103c97740bad4a5f91744e0.

* update z-image docstring

* Revert attention dispatcher

* update z-image docstring

* styling

* Recover attention_dispatch.py with its origin impl, later would special commit for fa3 compatibility.

* Fix prev bug, and support for prompt_embeds pass in args after prompt pre-encode as List of torch Tensor.

* Remove einop dependency.

* remove redundant imports & make fix-copies

* fix import

* Support for num_images_per_prompt>1; Remove redundant unquote variables.

* Fix bugs for num_images_per_prompt with actual batch.

* Add unit tests for Z-Image.

* Refine unitest and skip for cases needed separate test env; Fix compatibility with unitest in model, mostly precision formating.

* Add clean env for test_save_load_float16 separ test; Add Note; Styling.

* Update dtype mentioned by yiyi.

---------
Co-authored-by: liudongyang <liudongyang0114@gmail.com>

e6d46123

Improve docstrings and type hints in scheduling_dpmsolver_multistep.py (#12710) · a88a7b4f

David El Malih authored Nov 26, 2025

* Improve docstrings and type hints in multiple diffusion schedulers

* docs: update Imagen Video paper link to Hugging Face Papers.

a88a7b4f

[docs] put autopipeline after overview and hunyuanimage in images (#12548) · c8656ed7
Sayak Paul authored Nov 26, 2025
```
put autopipeline after overview and hunyuanimage in images
```
c8656ed7
[docs] Correct flux2 links (#12716) · 94c9613f
Sayak Paul authored Nov 26, 2025
```
* fix links

* up
```
94c9613f

[lora]: Fix Flux2 LoRA NaN test (#12714) · b91e8c0d

Sayak Paul authored Nov 26, 2025



* up

* Update tests/lora/test_lora_layers_flux2.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>

---------
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>

b91e8c0d

Update script names in README for Flux2 training (#12713) · ac786462
Andrei Filatov authored Nov 26, 2025

ac786462

25 Nov, 2025 3 commits

let's go Flux2

🚀

(#12711) · 5ffb73d4

Sayak Paul authored Nov 25, 2025



* add vae

* Initial commit for Flux 2 Transformer implementation

* add pipeline part

* small edits to the pipeline and conversion

* update conversion script

* fix

* up up

* finish pipeline

* Remove Flux IP Adapter logic for now

* Remove deprecated 3D id logic

* Remove ControlNet logic for now

* Add link to ViT-22B paper as reference for parallel transformer blocks such as the Flux 2 single stream block

* update pipeline

* Don't use biases for input projs and output AdaNorm

* up

* Remove bias for double stream block text QKV projections

* Add script to convert Flux 2 transformer to diffusers

* make style and make quality

* fix a few things.

* allow sft files to go.

* fix image processor

* fix batch

* style a bit

* Fix some bugs in Flux 2 transformer implementation

* Fix dummy input preparation and fix some test bugs

* fix dtype casting in timestep guidance module.

* resolve conflicts.,

* remove ip adapter stuff.

* Fix Flux 2 transformer consistency test

* Fix bug in Flux2TransformerBlock (double stream block)

* Get remaining Flux 2 transformer tests passing

* make style; make quality; make fix-copies

* remove stuff.

* fix type annotaton.

* remove unneeded stuff from tests

* tests

* up

* up

* add sf support

* Remove unused IP Adapter and ControlNet logic from transformer (#9)

* copied from

* Apply suggestions from code review
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: apolinário <joaopaulo.passos@gmail.com>

* up

* up

* up

* up

* up

* Refactor Flux2Attention into separate classes for double stream and single stream attention

* Add _supports_qkv_fusion to AttentionModuleMixin to allow subclasses to disable QKV fusion

* Have Flux2ParallelSelfAttention inherit from AttentionModuleMixin with _supports_qkv_fusion=False

* Log debug message when calling fuse_projections on a AttentionModuleMixin subclass that does not support QKV fusion

* Address review comments

* Update src/diffusers/pipelines/flux2/pipeline_flux2.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* up

* Remove maybe_allow_in_graph decorators for Flux 2 transformer blocks (#12)

* up

* support ostris loras. (#13)

* up

* update schdule

* up

* up (#17)

* add training scripts (#16)

* add training scripts
Co-authored-by: Linoy Tsaban <linoytsaban@gmail.com>

* model cpu offload in validation.

* add flux.2 readme

* add img2img and tests

* cpu offload in log validation

* Apply suggestions from code review

* fix

* up

* fixes

* remove i2i training tests for now.

---------
Co-authored-by: Linoy Tsaban <linoytsaban@gmail.com>
Co-authored-by: linoytsaban <linoy@huggingface.co>

* up

---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>
Co-authored-by: Daniel Gu <dgu8957@gmail.com>
Co-authored-by: yiyi@huggingface.co <yiyi@ip-10-53-87-203.ec2.internal>
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: apolinário <joaopaulo.passos@gmail.com>
Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal>
Co-authored-by: Linoy Tsaban <linoytsaban@gmail.com>
Co-authored-by: linoytsaban <linoy@huggingface.co>

5ffb73d4

Add Support for Z-Image Series (#12703) · 4088e8a8

Jerry Wu authored Nov 25, 2025



* Add Support for Z-Image.

* Reformatting with make style, black & isort.

* Remove init, Modify import utils, Merge forward in transformers block, Remove once func in pipeline.

* modified main model forward, freqs_cis left

* refactored to add B dim

* fixed stack issue

* fixed modulation bug

* fixed modulation bug

* fix bug

* remove value_from_time_aware_config

* styling

* Fix neg embed and devide / bug; Reuse pad zero tensor; Turn cat -> repeat; Add hint for attn processor.

* Replace padding with pad_sequence; Add gradient checkpointing.

* Fix flash_attn3 in dispatch attn backend by _flash_attn_forward, replace its origin implement; Add DocString in pipeline for that.

* Fix Docstring and Make Style.

* Revert "Fix flash_attn3 in dispatch attn backend by _flash_attn_forward, replace its origin implement; Add DocString in pipeline for that."

This reverts commit fbf26b7ed11d55146103c97740bad4a5f91744e0.

* update z-image docstring

* Revert attention dispatcher

* update z-image docstring

* styling

* Recover attention_dispatch.py with its origin impl, later would special commit for fa3 compatibility.

* Fix prev bug, and support for prompt_embeds pass in args after prompt pre-encode as List of torch Tensor.

* Remove einop dependency.

* remove redundant imports & make fix-copies

* fix import

---------
Co-authored-by: liudongyang <liudongyang0114@gmail.com>

4088e8a8

fix typo in docs (#12675) · d33d9f67

Junsong Chen authored Nov 25, 2025



* fix typo in docs

* Update docs/source/en/api/pipelines/sana_video.md
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>

---------
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>

d33d9f67

24 Nov, 2025 3 commits

Fix variable naming typos in community FluxControlNetFillInpaintPipeline (#12701) · dde8754b

sq authored Nov 25, 2025

- Fixed variable naming typos (maskkk -> mask_fill, mask_imagee -> mask_image_fill, masked_imagee -> masked_image_fill, masked_image_latentsss -> masked_latents_fill)

These changes improve code readability without affecting functionality.

dde8754b

[i8n-pt] Fix grammar and expand Portuguese documentation (#12598) · fbcd3ba6

cdutr authored Nov 24, 2025

* Updates Portuguese documentation for Diffusers library

Enhances the Portuguese documentation with:
- Restructured table of contents for improved navigation
- Added placeholder page for in-translation content
- Refined language and improved readability in existing pages
- Introduced a new page on basic Stable Diffusion performance guidance

Improves overall documentation structure and user experience for Portuguese-speaking users

* Removes untranslated sections from Portuguese documentation

Cleans up the Portuguese documentation table of contents by removing placeholder sections marked as "Em tradução" (In translation)

Removes the in_translation.md file and associated table of contents entries for sections that are not yet translated, improving documentation clarity

fbcd3ba6

[core] support sage attention + FA2 through `kernels` (#12439) · d176f61f

Sayak Paul authored Nov 24, 2025

* up

* support automatic dispatch.

* disable compile support for now./

* up

* flash too.

* document.

* up

* up

* up

* up

d176f61f