Commits · 9ae5b6299d3b1a7b0378dc77c3c69baf521587d2 · renzhc / diffusers_dcu

02 Oct, 2025 1 commit
- [ci] xfail failing tests in CI. (#12418) · 9ae5b629
  Sayak Paul authored Oct 02, 2025
```
xfail failing tests in CI.
```
  9ae5b629
01 Oct, 2025 1 commit

[tests] cache non lora pipeline outputs. (#12298) · 814d710e

Sayak Paul authored Oct 01, 2025

* cache non lora pipeline outputs.

* up

* up

* up

* up

* Revert "up"

This reverts commit 772c32e43397f25919c29bbbe8ef9dc7d581cfb8.

* up

* Revert "up"

This reverts commit cca03df7fce55550ed28b59cadec12d1db188283.

* up

* up

* add .

* up

* up

* up

* up

* up

* up

814d710e

30 Sep, 2025 5 commits
- [docs] Migrate syntax (#12390) · cc5b31ff
  Steven Liu authored Sep 30, 2025
```
* change syntax

* make style
```
  cc5b31ff
- [docs] CP (#12331) · d7a1a036
  Steven Liu authored Sep 30, 2025
```
* init

* feedback

* feedback

* feedback

* feedback

* feedback

* feedback
```
  d7a1a036
- Install latest prerelease from huggingface_hub when installing transformers from main (#12395) · b5965454
  Lucain authored Sep 30, 2025
```
* Allow prerelease when installing transformers from main

* maybe better

* maybe better

* and now?

* just bored

* should be better

* works now
```
  b5965454
- fix 3 xpu failures uts w/ latest pytorch (#12408) · 0e12ba74
  Yao Matrix authored Sep 30, 2025
```
fix xpu ut failures w/ latest pytorch
Signed-off-by: Yao, Matrix <matrix.yao@intel.com>
```
  0e12ba74
- [Tests] Add single file tester mixin for Models and remove unittest dependency (#12352) · 20fd00b1
  Dhruv Nair authored Sep 30, 2025
```
* update

* update

* update

* update

* update
```
  20fd00b1
29 Sep, 2025 6 commits

[modular]some small fix (#12307) · 76d4e416

YiYi Xu authored Sep 29, 2025

* fix

* add mellon node registry

* style

* update docstring to include more info!

* support custom node mellon

* HTTPErrpr -> HfHubHTTPErrpr

* up

* Update src/diffusers/modular_pipelines/qwenimage/node_utils.py

76d4e416

[docs] Model formats (#12256) · c07fcf78

Steven Liu authored Sep 29, 2025

* init

* config

* lora metadata

* feedback

* fix

* cache allocator warmup for from_single_file

* feedback

* feedback

c07fcf78

[docs] Distributed inference (#12285) · ccedeca9

Steven Liu authored Sep 29, 2025



* init

* feedback

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

ccedeca9

[quantization] feat: support aobaseconfig classes in `TorchAOConfig` (#12275) · 64a5187d

Sayak Paul authored Sep 29, 2025



* feat: support aobaseconfig classes.

* [docs] AOBaseConfig (#12302)

init
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* up

* replace with is_torchao_version

* up

* up

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

64a5187d

Fix #12116: preserve boolean dtype for attention masks in ChromaPipeline (#12263) · 0a151115

Akshay Babbar authored Sep 29, 2025



* fix: preserve boolean dtype for attention masks in ChromaPipeline

- Convert attention masks to bool and prevent dtype corruption
- Fix both positive and negative mask handling in _get_t5_prompt_embeds
- Remove float conversion in _prepare_attention_mask method

Fixes #12116

* test: add ChromaPipeline attention mask dtype tests

* test: add slow ChromaPipeline attention mask tests

* chore: removed comments

* refactor: removing redundant type conversion

* Remove dedicated dtype tests as per  feedback

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

0a151115

Don't skip Qwen model tests for group offloading with disk (#12382) · 19085ac8
Sayak Paul authored Sep 29, 2025
```
u[
```
19085ac8

26 Sep, 2025 3 commits

[docs] remove docstrings from repeated methods in `lora_pipeline.py` (#12393) · 041501ae

Sayak Paul authored Sep 26, 2025

* start unbloating docstrings (save_lora_weights).

* load_lora_weights()

* lora_state_dict

* fuse_lora

* unfuse_lora

* load_lora_into_transformer

041501ae

[docs] slight edits to the attention backends docs. (#12394) · 9c094458

Sayak Paul authored Sep 26, 2025



* slight edits to the attention backends docs.

* Update docs/source/en/optimization/attention_backends.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

9c094458

[CI] disable installing transformers from main in ci for now. (#12397) · 4588bbeb
Sayak Paul authored Sep 26, 2025
```
* disable installing transformers from main in ci for now.

* up

* u[p
```
4588bbeb

25 Sep, 2025 1 commit
- Support both huggingface_hub `v0.x` and `v1.x` (#12389) · ec5449f3
  Lucain authored Sep 25, 2025
```
* Support huggingface_hub 0.x and 1.x

* httpx
```
  ec5449f3
24 Sep, 2025 8 commits

Introduce cache-dit to community optimization (#12366) · 310fdaf5

DefTruth authored Sep 25, 2025

* docs: introduce cache-dit to diffusers

* docs: introduce cache-dit to diffusers

* docs: introduce cache-dit to diffusers

* docs: introduce cache-dit to diffusers

* docs: introduce cache-dit to diffusers

* docs: introduce cache-dit to diffusers

* docs: introduce cache-dit to diffusers

* misc: update examples link

* misc: update examples link

* docs: introduce cache-dit to diffusers

* docs: introduce cache-dit to diffusers

* docs: introduce cache-dit to diffusers

* docs: introduce cache-dit to diffusers

* docs: introduce cache-dit to diffusers

* Refine documentation for CacheDiT features

Updated the wording for clarity and consistency in the documentation. Adjusted sections on cache acceleration, automatic block adapter, patch functor, and hybrid cache configuration.

310fdaf5

Context Parallel w/ Ring & Ulysses & Unified Attention (#11941) · dcb6dd9b

Aryan authored Sep 24, 2025



* update

* update

* add coauthor
Co-Authored-By: Dhruv Nair <dhruv.nair@gmail.com>

* improve test

* handle ip adapter params correctly

* fix chroma qkv fusion test

* fix fastercache implementation

* fix more tests

* fight more tests

* add back set_attention_backend

* update

* update

* make style

* make fix-copies

* make ip adapter processor compatible with attention dispatcher

* refactor chroma as well

* remove rmsnorm assert

* minify and deprecate npu/xla processors

* update

* refactor

* refactor; support flash attention 2 with cp

* fix

* support sage attention with cp

* make torch compile compatible

* update

* refactor

* update

* refactor

* refactor

* add ulysses backward

* try to make dreambooth script work; accelerator backward not playing well

* Revert "try to make dreambooth script work; accelerator backward not playing well"

This reverts commit 768d0ea6fa6a305d12df1feda2afae3ec80aa449.

* workaround compilation problems with triton when doing all-to-all

* support wan

* handle backward correctly

* support qwen

* support ltx

* make fix-copies

* Update src/diffusers/models/modeling_utils.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* apply review suggestions

* update docs

* add explanation

* make fix-copies

* add docstrings

* support passing parallel_config to from_pretrained

* apply review suggestions

* make style

* update

* Update docs/source/en/api/parallel.md
Co-authored-by: Aryan <aryan@huggingface.co>

* up

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>

dcb6dd9b

Fix WanVACEPipeline to allow prompt to be None and skip encoding step (#12251) · 043ab252
Alberto Chimenti authored Sep 24, 2025
```
Fixed WanVACEPipeline to allow prompt to be None and skip encoding step
```
043ab252
fix marigold ut case fail on xpu (#12350) · 08c29020
Yao Matrix authored Sep 23, 2025
```
Signed-off-by: Yao, Matrix <matrix.yao@intel.com>
```
08c29020
xpu enabling for 4 cases (#12345) · 7a587349
Yao Matrix authored Sep 23, 2025
```
Signed-off-by: Yao, Matrix <matrix.yao@intel.com>
```
7a587349
[tests] disable xformer tests for pipelines it isn't popular. (#12277) · 9ef11850
Sayak Paul authored Sep 24, 2025
```
disable xformer tests for pipelines it isn't popular.
```
9ef11850
Fix Custom Code loading (#12378) · 7c54a7b3
Dhruv Nair authored Sep 24, 2025
```
* update

* update

* update
```
7c54a7b3
[tests] Single scheduler in lora tests (#12315) · 09e777a3
Sayak Paul authored Sep 24, 2025
```
* single scheduler please.

* up

* up

* up
```
09e777a3

23 Sep, 2025 3 commits
- [docs] Attention backends (#12320) · a72bc0c4
  Steven Liu authored Sep 23, 2025
```
* init

* feedback

* update

* feedback

* fixes
```
  a72bc0c4
- Allow Automodel to support custom model code (#12353) · 80de641c
  Dhruv Nair authored Sep 23, 2025
```
* update

* update
```
  80de641c
- [docs] Schedulers (#12246) · 76810eca
  Steven Liu authored Sep 23, 2025
```
* init

* toctree

* scheduler suggestions

* toctree
```
  76810eca
22 Sep, 2025 7 commits
- [Fix] chroma docs (#12360) · 1448b035
  SahilCarterr authored Sep 23, 2025
```
* Fixes chroma docs

* fix docs

fixed docs are now consistent
```
  1448b035
- add test and doc for QwenImageEdit Plus (#12363) · 57967350
  Sayak Paul authored Sep 22, 2025
```
* up

* xfail some tests

* up

* up
```
  57967350
- [lora] factor out the overlaps in `save_lora_weights()`. (#12027) · d8310a8f
  Sayak Paul authored Sep 22, 2025
```
* factor out the overlaps in save_lora_weights().

* remove comment.

* remove comment.

* up

* fix-copies
```
  d8310a8f
- [Fix] enable_xformers_memory_efficient_attention() in Flux Pipeline (#12337) · 78031c29
  SahilCarterr authored Sep 22, 2025
```
* FIxes enable_xformers_memory_efficient_attention()

* Update attention.py
```
  78031c29
- Fix bug with VAE slicing in autoencoder_dc.py (#12343) · d83d35c1
  Chen Mingyi authored Sep 22, 2025
  
  d83d35c1
- [tests] xfail some kandinsky tests. (#12364) · 843355f8
  Sayak Paul authored Sep 22, 2025
```
xfail some kandinsky tests.
```
  843355f8
- Fix example server install instructions (#12362) · c006a95d
  Jason Cox authored Sep 21, 2025
```
* Upgrade huggingface-hub to version 0.35.0

Updated huggingface-hub version from 0.26.1 to 0.35.0.

* Add uvicorn and accelerate to requirements

* Fix install instructions for server
```
  c006a95d
21 Sep, 2025 1 commit

feat: Add QwenImageEditPlus to support future feature upgrades (#12357) · df267ee4

naykun authored Sep 22, 2025

* feat: add support of qwenimageeditplus

* add copies statement

* fix copies statement

* remove vl_processor reference

df267ee4

20 Sep, 2025 1 commit
- [CI] Fix TRANSFORMERS_FLAX_WEIGHTS_NAME import issue (#12354) · edd614ea
  Dhruv Nair authored Sep 20, 2025
```
update
```
  edd614ea
18 Sep, 2025 2 commits

Convert alphas for embedders for sd-scripts to ai toolkit conversion (#12332) · 7e7e62c6

Dave Lage authored Sep 18, 2025



* Convert alphas for embedders for sd-scripts to ai toolkit conversion

* Add kohya embedders conversion test

* Apply style fixes

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

7e7e62c6

Add RequestScopedPipeline for safe concurrent inference, tokenizer lock and... · eda9ff83

Fredy authored Sep 18, 2025


Add RequestScopedPipeline for safe concurrent inference, tokenizer lock and non-mutating retrieve_timesteps (#12328)

* Basic implementation of request scheduling

* Basic editing in SD and Flux Pipelines

* Small Fix

* Fix

* Update for more pipelines

* Add examples/server-async

* Add examples/server-async

* Updated RequestScopedPipeline to handle a single tokenizer lock to avoid race conditions

* Fix

* Fix _TokenizerLockWrapper

* Fix _TokenizerLockWrapper

* Delete _TokenizerLockWrapper

* Fix tokenizer

* Update examples/server-async

* Fix server-async

* Optimizations in examples/server-async

* We keep the implementation simple in examples/server-async

* Update examples/server-async/README.md

* Update examples/server-async/README.md for changes to tokenizer locks and backward-compatible retrieve_timesteps

* The changes to the diffusers core have been undone and all logic is being moved to exmaples/server-async

* Update examples/server-async/utils/*

* Fix BaseAsyncScheduler

* Rollback in the core of the diffusers

* Update examples/server-async/README.md

* Complete rollback of diffusers core files

* Simple implementation of an asynchronous server compatible with SD3-3.5 and Flux Pipelines

* Update examples/server-async/README.md

* Fixed import errors in 'examples/server-async/serverasync.py'

* Flux Pipeline Discard

* Update examples/server-async/README.md

* Apply style fixes

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

eda9ff83

17 Sep, 2025 1 commit

Fix many type hint errors (#12289) · efb7a299

DefTruth authored Sep 17, 2025

* fix hidream type hint

* fix hunyuan-video type hint

* fix many type hint

* fix many type hint errors

* fix many type hint errors

* fix many type hint errors

* make stype & make quality

efb7a299