Commits · dd285099ebe556da550dc0b7c2130cc829ce6395 · renzhc / diffusers_dcu

25 Jun, 2025 2 commits

adjust to get CI test cases passed on XPU (#11759) · dd285099

kaixuanliu authored Jun 25, 2025



* adjust to get CI test cases passed on XPU
Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>

* fix format issue
Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>

* Apply style fixes

---------
Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Aryan <aryan@huggingface.co>

dd285099

[tests] skip instead of returning. (#11793) · 80f27d7e
Sayak Paul authored Jun 25, 2025
```
skip instead of returning.
```
80f27d7e

24 Jun, 2025 6 commits
- guard omnigen processor. (#11799) · d3e27e05
  Sayak Paul authored Jun 24, 2025
  
  d3e27e05
- [tests] Fix group offloading and layerwise casting test interaction (#11796) · 5df02fc1
  Aryan authored Jun 24, 2025
```
* update

* update

* update
```
  5df02fc1
- [chore] raise as early as possible in group offloading (#11792) · 7392c8ff
  Sayak Paul authored Jun 24, 2025
```
* raise as early as possible in group offloading

* remove check from ModuleGroup
```
  7392c8ff
- [tests] Fix HunyuanVideo Framepack device tests (#11789) · 474a248f
  Aryan authored Jun 24, 2025
```
update
```
  474a248f
- [lora] only remove hooks that we add back (#11768) · 7bc0a07b
  YiYi Xu authored Jun 23, 2025
```
up
```
  7bc0a07b
- [docs] minor cleanups in the lora docs. (#11770) · 92542719
  Sayak Paul authored Jun 24, 2025
```
* minor cleanups in the lora docs.

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* format docs

* fix copies

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
```
  92542719
23 Jun, 2025 6 commits

Add --lora_alpha and metadata handling to train_dreambooth_lora_sana.py (#11744) · 67603002
imbr92 authored Jun 23, 2025
```
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>
```
67603002
[Wan] Fix mask padding in Wan VACE pipeline. (#11778) · 798265f2
Yuanchen Guo authored Jun 23, 2025

798265f2
[CI] Skip ONNX Upscale tests (#11774) · cd813499
Dhruv Nair authored Jun 23, 2025
```
update
```
cd813499
[tests] properly skip tests instead of `return` (#11771) · fbddf028
Sayak Paul authored Jun 23, 2025
```
model test updates
```
fbddf028

enable cpu offloading of new pipelines on XPU & use device agnostic empty to... · f20b83a0

Yao Matrix authored Jun 23, 2025


enable cpu offloading of new pipelines on XPU & use device agnostic empty to make pipelines work on XPU (#11671)

* commit 1
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* patch 2
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* Update pipeline_pag_sana.py

* Update pipeline_sana.py

* Update pipeline_sana_controlnet.py

* Update pipeline_sana_sprint_img2img.py

* Update pipeline_sana_sprint.py

* fix style
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* fix fat-thumb while merge conflict
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* fix ci issues
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

---------
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>

f20b83a0

enable deterministic in bnb 4 bit tests (#11738) · ee40088f

jiqing-feng authored Jun 23, 2025



* enable deterministic in bnb 4 bit tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

* fix 8bit test
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

ee40088f

21 Jun, 2025 1 commit
- Fix dimensionalities in `apply_rotary_emb` functions' comments (#11717) · 7fc53b5d
  Tolga Cangöz authored Jun 22, 2025
```
Fix dimensionality in `apply_rotary_emb` functions' comments.
```
  7fc53b5d
20 Jun, 2025 5 commits
- [docs] LoRA scale scheduling (#11727) · 0874dd04
  Steven Liu authored Jun 20, 2025
```
draft
```
  0874dd04
- [docs] device_map (#11711) · 6184d8a4
  Steven Liu authored Jun 20, 2025
```
draft
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  6184d8a4
- [docs] Quantization + torch.compile + offloading (#11703) · 5a6e3864
  Steven Liu authored Jun 20, 2025
```
* draft

* feedback

* update

* feedback

* fix

* feedback

* feedback

* fix

* feedback
```
  5a6e3864
- Fix failing cpu offload test for LTX Latent Upscale (#11755) · 42077e6c
  Dhruv Nair authored Jun 20, 2025
```
update
```
  42077e6c
- fix invalid component handling behaviour in `PipelineQuantizationConfig` (#11750) · 3d8d8485
  Sayak Paul authored Jun 20, 2025
```
* start

* updates
```
  3d8d8485
19 Jun, 2025 10 commits

Update Chroma Docs (#11753) · 195926bb

Dhruv Nair authored Jun 19, 2025



* update

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

195926bb

make group offloading work with disk/nvme transfers (#11682) · 85a916bb

Sayak Paul authored Jun 19, 2025

* start implementing disk offloading in group.

* delete diff file.

* updates.patch

* offload_to_disk_path

* check if safetensors already exist.

* add test and clarify.

* updates

* update todos.

* update more docs.

* update docs

85a916bb

Fix HiDream pipeline test module (#11754) · 3287ce28
Dhruv Nair authored Jun 19, 2025
```
update
```
3287ce28
[CI] Fix SANA tests (#11756) · 0c11c8c1
Dhruv Nair authored Jun 19, 2025
```
update
```
0c11c8c1
[CI] Fix WAN VACE tests (#11757) · fc51583c
Dhruv Nair authored Jun 19, 2025
```
update
```
fc51583c

[LoRA] refactor lora loading at the model-level (#11719) · fb57c76a

Sayak Paul authored Jun 19, 2025

* factor out stuff from load_lora_adapter().

* simplifying text encoder lora loading.

* fix peft.py

* fix logging locations.

* formatting

* fix

* update

* update

* update

fb57c76a

Bump urllib3 from 2.2.3 to 2.5.0 in /examples/server (#11748) · 7251bb4f

dependabot[bot] authored Jun 19, 2025

Bumps [urllib3](https://github.com/urllib3/urllib3) from 2.2.3 to 2.5.0.
- [Release notes](https://github.com/urllib3/urllib3/releases)
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst)
- [Commits](https://github.com/urllib3/urllib3/compare/2.2.3...2.5.0

)

---
updated-dependencies:
- dependency-name: urllib3
  dependency-version: 2.5.0
  dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

7251bb4f

Add missing HiDream license (#11747) · 3fba74e1
Aryan authored Jun 19, 2025
```
update
```
3fba74e1
Update more licenses to 2025 (#11746) · a4df8dbc
Aryan authored Jun 19, 2025
```
update
```
a4df8dbc
[Quantizers] add `is_compileable` property to quantizers. (#11736) · 48eae6f4
Sayak Paul authored Jun 19, 2025
```
add is_compileable property to quantizers.
```
48eae6f4

18 Jun, 2025 5 commits

Chroma Follow Up (#11725) · 66394bf6

Dhruv Nair authored Jun 18, 2025

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* updte

* update

* update

* update

66394bf6

[chore] change to 2025 licensing for remaining (#11741) · 62cce304
Sayak Paul authored Jun 18, 2025
```
change to 2025 licensing for remaining
```
62cce304

[tests] device_map tests for all models. (#11708) · 05e86778

Sayak Paul authored Jun 18, 2025



* device_map tests for all models.

* updates

* Update tests/models/test_modeling_common.py
Co-authored-by: Aryan <aryan@huggingface.co>

* fix device_map in test

---------
Co-authored-by: Aryan <aryan@huggingface.co>

05e86778

[training] add ds support to lora hidream (#11737) · d72184eb

Leo Jiang authored Jun 17, 2025



* [training] add ds support to lora hidream

* Apply style fixes

---------
Co-authored-by: J石页 <jiangshuo9@h-partners.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

d72184eb

⚡

️ Speed up method `AutoencoderKLWan.clear_cache` by 886% (#11665) · 5ce4814a

Saurabh Misra authored Jun 17, 2025

* ⚡

️ Speed up method `AutoencoderKLWan.clear_cache` by 886%

**Key optimizations:**
- Compute the number of `WanCausalConv3d` modules in each model (`encoder`/`decoder`) **only once during initialization**, store in `self._cached_conv_counts`. This removes unnecessary repeated tree traversals at every `clear_cache` call, which was the main bottleneck (from profiling).
- The internal helper `_count_conv3d_fast` is optimized via a generator expression with `sum` for efficiency.

All comments from the original code are preserved, except for updated or removed local docstrings/comments relevant to changed lines.  
**Function signatures and outputs remain unchanged.**

* Apply style fixes

* Apply suggestions from code review
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

* Apply style fixes

---------
Co-authored-by: codeflash-ai[bot] <148906541+codeflash-ai[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>
Co-authored-by: Aseem Saxena <aseem.bits@gmail.com>

5ce4814a

17 Jun, 2025 2 commits

[LoRA training] update metadata use for lora alpha + README (#11723) · 1bc6f3dc

Linoy Tsaban authored Jun 17, 2025



* lora alpha

* Apply style fixes

* Update examples/advanced_diffusion_training/README_flux.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* fix readme format

---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

1bc6f3dc

Support more Wan loras (VACE) (#11726) · 79bd7ecc
Aryan authored Jun 17, 2025
```
update
```
79bd7ecc

16 Jun, 2025 3 commits

Add Pruna optimization framework documentation (#11688) · 9b834f87

David Berenstein authored Jun 16, 2025

* Add Pruna optimization framework documentation

- Introduced a new section for Pruna in the table of contents.
- Added comprehensive documentation for Pruna, detailing its optimization techniques, installation instructions, and examples for optimizing and evaluating models

* Enhance Pruna documentation with image alt text and code block formatting

- Added alt text to images for better accessibility and context.
- Changed code block syntax from diff to python for improved clarity.

* Add installation section to Pruna documentation

- Introduced a new installation section in the Pruna documentation to guide users on how to install the framework.
- Enhanced the overall clarity and usability of the documentation for new users.

* Update pruna.md

* Update Pruna documentation for model optimization and evaluation

- Changed section titles for consistency and clarity, from "Optimizing models" to "Optimize models" and "Evaluating and benchmarking optimized models" to "Evaluate and benchmark models".
- Enhanced descriptions to clarify the use of `diffusers` models and the evaluation process.
- Added a new example for evaluating standalone `diffusers` models.
- Updated references and links for better navigation within the documentation.

* Refactor Pruna documentation for clarity and consistency

- Removed outdated references to FLUX-juiced and streamlined the explanation of benchmarking.
- Enhanced the description of evaluating standalone `diffusers` models.
- Cleaned up code examples by removing unnecessary imports and comments for better readability.

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Enhance Pruna documentation with new examples and clarifications

- Added an image to illustrate the optimization process.
- Updated the explanation for sharing and loading optimized models on the Hugging Face Hub.
- Clarified the evaluation process for optimized models using the EvaluationAgent.
- Improved descriptions for defining metrics and evaluating standalone diffusers models.

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

9b834f87

Fix misleading comment (#11722) · 81426b0f
Carl Thomé authored Jun 16, 2025

81426b0f

[training] show how metadata stuff should be incorporated in training scripts. (#11707) · f0dba33d

Sayak Paul authored Jun 16, 2025



* show how metadata stuff should be incorporated in training scripts.

* typing

* fix

---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

f0dba33d