Commits · e48f6aeeb47aff48497a6d2b491d09689e25ebeb · renzhc / diffusers_dcu

09 May, 2025 2 commits

enable dit integration cases on xpu (#11523) · d6bf268a

Yao Matrix authored May 09, 2025



* enable dit integration test on XPU
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

* fix style
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

---------
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

d6bf268a

enable 7 cases on XPU (#11503) · 2d380895

Yao Matrix authored May 09, 2025



* enable 7 cases on XPU
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

* calibrate A100 expectations
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

---------
Signed-off-by: Yao Matrix <matrix.yao@intel.com>
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

2d380895

07 May, 2025 1 commit

Cosmos (#10660) · 7b904941

Aryan authored May 07, 2025



* begin transformer conversion

* refactor

* refactor

* refactor

* refactor

* refactor

* refactor

* update

* add conversion script

* add pipeline

* make fix-copies

* remove einops

* update docs

* gradient checkpointing

* add transformer test

* update

* debug

* remove prints

* match sigmas

* add vae pt. 1

* finish CV* vae

* update

* update

* update

* update

* update

* update

* make fix-copies

* update

* make fix-copies

* fix

* update

* update

* make fix-copies

* update

* update tests

* handle device and dtype for safety checker; required in latest diffusers

* remove enable_gqa and use repeat_interleave instead

* enforce safety checker; use dummy checker in fast tests

* add review suggestion for ONNX export
Co-Authored-By: Asfiya Baig <asfiyab@nvidia.com>

* fix safety_checker issues when not passed explicitly

We could either do what's done in this commit, or update the Cosmos examples to explicitly pass the safety checker

* use cosmos guardrail package

* auto format docs

* update conversion script to support 14B models

* update name CosmosPipeline -> CosmosTextToWorldPipeline

* update docs

* fix docs

* fix group offload test failing for vae

---------
Co-authored-by: Asfiya Baig <asfiyab@nvidia.com>

7b904941

06 May, 2025 1 commit

Hunyuan Video Framepack (#11428) · d7ffe601

Aryan authored May 06, 2025

* add transformer

* add pipeline

* fixes

* make fix-copies

* update

* add flux mu shift

* update example snippet

* debug

* cleanup

* batch_size=1 optimization

* add pipeline test

* fix for model cpu offloading'

* add last_image support; credits: https://github.com/lllyasviel/FramePack/pull/167

* update example with flf2v

* update penguin url

* fix test

* address review comment: https://github.com/huggingface/diffusers/pull/11428#discussion_r2071032371

* address review comment: https://github.com/huggingface/diffusers/pull/11428#discussion_r2071087689



* Update src/diffusers/pipelines/hunyuan_video/pipeline_hunyuan_video_framepack.py

---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

d7ffe601

05 May, 2025 1 commit
- enable semantic diffusion and stable diffusion panorama cases on XPU (#11459) · a674914f
  Yao Matrix authored May 05, 2025
```
Signed-off-by: Yao Matrix <matrix.yao@intel.com>
```
  a674914f
01 May, 2025 2 commits

[tests] xfail recent pipeline tests for specific methods. (#11469) · 5dcdf4ac
Sayak Paul authored May 01, 2025
```
xfail recent pipeline tests for specific methods.
```
5dcdf4ac

Fix typos in docs and comments (#11416) · 86294d3c

co63oc authored May 01, 2025



* Fix typos in docs and comments

* Apply style fixes

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

86294d3c

30 Apr, 2025 5 commits

make autoencoders. controlnet_flux and wan_transformer3d_single_file pass on xpu (#11461) · 06beecaf

Yao Matrix authored May 01, 2025



* make autoencoders. controlnet_flux and wan_transformer3d_single_file
pass on XPU
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

* Apply style fixes

---------
Signed-off-by: Yao Matrix <matrix.yao@intel.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Aryan <aryan@huggingface.co>

06beecaf

make safe diffusion test cases pass on XPU and A100 (#11458) · 23c98025

Yao Matrix authored Apr 30, 2025



* make safe diffusion test cases pass on XPU and A100
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

* calibrate A100 expected values
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

---------
Signed-off-by: Yao Matrix <matrix.yao@intel.com>
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

23c98025

enable unidiffuser test cases on xpu (#11444) · 35fada41

Yao Matrix authored Apr 30, 2025



* enable unidiffuser cases on XPU
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

* fix a typo
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

* fix style
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

---------
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

35fada41

enable consistency test cases on XPU, all passed (#11446) · fbe2fe55
Yao Matrix authored Apr 30, 2025
```
Signed-off-by: Yao Matrix <matrix.yao@intel.com>
```
fbe2fe55
enable marigold_intrinsics cases on XPU (#11445) · 60892c55
Yao Matrix authored Apr 30, 2025
```
Signed-off-by: Yao Matrix <matrix.yao@intel.com>
```
60892c55

28 Apr, 2025 2 commits

enable group_offload cases and quanto cases on XPU (#11405) · 9ce89e2e

Yao Matrix authored Apr 28, 2025



* enable group_offload cases and quanto cases on XPU
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* use backend APIs
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

* fix style
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

---------
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
Signed-off-by: Yao Matrix <matrix.yao@intel.com>

9ce89e2e

[tests] add tests to check for graph breaks, recompilation, cuda syncs in... · aa5f5d41

Sayak Paul authored Apr 28, 2025

[tests] add tests to check for graph breaks, recompilation, cuda syncs in pipelines during torch.compile() (#11085)

* test for better torch.compile stuff.

* fixes

* recompilation and graph break.

* clear compilation cache.

* change to modeling level test.

* allow running compilation tests during nightlies.

aa5f5d41

23 Apr, 2025 1 commit

Kolors additional pipelines, community contrib (#11372) · b4be4228

Teriks authored Apr 23, 2025



* Kolors additional pipelines, community contrib

---------
Co-authored-by: Teriks <Teriks@users.noreply.github.com>
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

b4be4228

18 Apr, 2025 1 commit

support Wan-FLF2V (#11353) · 0021bfa1

YiYi Xu authored Apr 18, 2025



* update transformer

---------
Co-authored-by: Aryan <aryan@huggingface.co>

0021bfa1

17 Apr, 2025 1 commit
- [Hi Dream] follow-up (#11296) · 05679329
  YiYi Xu authored Apr 17, 2025
```
* add
```
  05679329
16 Apr, 2025 1 commit
- Hunyuan I2V fast tests fix (#11341) · 59f1b7b1
  Dhruv Nair authored Apr 16, 2025
```
* update

* update
```
  59f1b7b1
15 Apr, 2025 2 commits

post release 0.33.0 (#11255) · 4b868f14

Sayak Paul authored Apr 15, 2025



* post release

* update

* fix deprecations

* remaining

* update

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

4b868f14

fix CPU offloading related fail cases on XPU (#11288) · 7edace9a

Yao Matrix authored Apr 15, 2025



* fix CPU offloading related fail cases on XPU
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* fix style
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* Apply style fixes

* trigger tests

* test_pipe_same_device_id_offload

---------
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: hlky <hlky@hlky.ac>

7edace9a

14 Apr, 2025 3 commits
- make `KolorsPipelineFastTests::test_inference_batch_single_identical` pass on XPU (#11313) · c7f2d239
  Fanli Lin authored Apr 14, 2025
```
adjust diff
```
  c7f2d239
- make test_stable_diffusion_karras_sigmas pass on XPU (#11310) · fa1ac50a
  Yao Matrix authored Apr 14, 2025
```
Signed-off-by: Matrix Yao <matrix.yao@intel.com>
```
  fa1ac50a
- make KandinskyV22PipelineInpaintCombinedFastTests::test_float16_inference pass on XPU (#11308) · aa541b9f
  Yao Matrix authored Apr 14, 2025
```
loose expected_max_diff from 5e-1 to 8e-1 to make
KandinskyV22PipelineInpaintCombinedFastTests::test_float16_inference
pass on XPU
Signed-off-by: Matrix Yao <matrix.yao@intel.com>
```
  aa541b9f
13 Apr, 2025 1 commit

[ControlNet] Adds controlnet for SanaTransformer (#11040) · f1f38ffb

Ishan Modi authored Apr 13, 2025



* added controlnet for sana transformer

* improve code quality

* addressed PR comments

* bug fixes

* added test cases

* update

* added dummy objects

* addressed PR comments

* update

* Forcing update

* add to docs

* code quality

* addressed PR comments

* addressed PR comments

* update

* addressed PR comments

* added proper styling

* update

* Revert "added proper styling"

This reverts commit 344ee8a7014ada095b295034ef84341f03b0e359.

* manually ordered

* Apply suggestions from code review

---------
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

f1f38ffb

11 Apr, 2025 2 commits

HiDream Image (#11231) · 0ef29355

hlky authored Apr 11, 2025



* HiDream Image


---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>

0ef29355

[CI] relax tolerance for unclip further (#11268) · 511d7381
Sayak Paul authored Apr 11, 2025
```
relax tolerance for unclip further.
```
511d7381

10 Apr, 2025 4 commits
- make test_instant_style_multiple_masks pass on XPU (#11266) · 31c4f24f
  Yao Matrix authored Apr 10, 2025
```
Signed-off-by: Matrix Yao <matrix.yao@intel.com>
```
  31c4f24f
- make test_dict_tuple_outputs_equivalent pass on XPU (#11265) · 450dc48a
  Yao Matrix authored Apr 10, 2025
```
Signed-off-by: Matrix Yao <matrix.yao@intel.com>
```
  450dc48a
- make test_stable_diffusion_inpaint_fp16 pass on XPU (#11264) · 77b4f66b
  Yao Matrix authored Apr 10, 2025
```
Signed-off-by: Matrix Yao <matrix.yao@intel.com>
```
  77b4f66b
- fix test_vanilla_funetuning failure on XPU and A100 (#11263) · 68663f8a
  Yao Matrix authored Apr 10, 2025
```
* fix test_vanilla_funetuning failure on XPU and A100
Signed-off-by: Matrix Yao <matrix.yao@intel.com>

* change back to 5e-2
Signed-off-by: Matrix Yao <matrix.yao@intel.com>

---------
Signed-off-by: Matrix Yao <matrix.yao@intel.com>
```
  68663f8a
09 Apr, 2025 3 commits

Update Ruff to latest Version (#10919) · edc154da
Dhruv Nair authored Apr 09, 2025
```
* update

* update

* update

* update
```
edc154da

fix FluxReduxSlowTests::test_flux_redux_inference case failure on XPU (#11245) · c36c745c

Yao Matrix authored Apr 09, 2025



* loose test_float16_inference's tolerance from 5e-2 to 6e-2, so XPU can
pass UT
Signed-off-by: Matrix Yao <matrix.yao@intel.com>

* fix test_pipeline_flux_redux fail on XPU
Signed-off-by: Matrix Yao <matrix.yao@intel.com>

---------
Signed-off-by: Matrix Yao <matrix.yao@intel.com>

c36c745c

AudioLDM2 Fixes (#11244) · 9ee3dd38
hlky authored Apr 09, 2025

9ee3dd38

08 Apr, 2025 2 commits

introduce compute arch specific expectations and fix test_sd3_img2img_inference failure (#11227) · c51b6bd8

Yao Matrix authored Apr 08, 2025



* add arch specfic expectations support, to support different arch's numerical characteristics
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* fix typo
Signed-off-by: YAO Matrix <matrix.yao@intel.com>

* Apply suggestions from code review

* Apply style fixes

* Update src/diffusers/utils/testing_utils.py

---------
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
Co-authored-by: hlky <hlky@hlky.ac>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

c51b6bd8

[LoRA] Implement hot-swapping of LoRA (#9453) · fb544996

Benjamin Bossan authored Apr 08, 2025

* [WIP][LoRA] Implement hot-swapping of LoRA

This PR adds the possibility to hot-swap LoRA adapters. It is WIP.

Description

As of now, users can already load multiple LoRA adapters. They can
offload existing adapters or they can unload them (i.e. delete them).
However, they cannot "hotswap" adapters yet, i.e. substitute the weights
from one LoRA adapter with the weights of another, without the need to
create a separate LoRA adapter.

Generally, hot-swapping may not appear not super useful but when the
model is compiled, it is necessary to prevent recompilation. See #9279
for more context.

Caveats

To hot-swap a LoRA adapter for another, these two adapters should target
exactly the same layers and the "hyper-parameters" of the two adapters
should be identical. For instance, the LoRA alpha has to be the same:
Given that we keep the alpha from the first adapter, the LoRA scaling
would be incorrect for the second adapter otherwise.

Theoretically, we could override the scaling dict with the alpha values
derived from the second adapter's config, but changing the dict will
trigger a guard for recompilation, defeating the main purpose of the
feature.

I also found that compilation flags can have an impact on whether this
works or not. E.g. when passing "reduce-overhead", there will be errors
of the type:

> input name: arg861_1. data pointer changed from 139647332027392 to
139647331054592

I don't know enough about compilation to determine whether this is
problematic or not.

Current state

This is obviously WIP right now to collect feedback and discuss which
direction to take this. If this PR turns out to be useful, the
hot-swapping functions will be added to PEFT itself and can be imported
here (or there is a separate copy in diffusers to avoid the need for a
min PEFT version to use this feature).

Moreover, more tests need to be added to better cover this feature,
although we don't necessarily need tests for the hot-swapping
functionality itself, since those tests will be added to PEFT.

Furthermore, as of now, this is only implemented for the unet. Other
pipeline components have yet to implement this feature.

Finally, it should be properly documented.

I would like to collect feedback on the current state of the PR before
putting more time into finalizing it.

* Reviewer feedback

* Reviewer feedback, adjust test

* Fix, doc

* Make fix

* Fix for possible g++ error

* Add test for recompilation w/o hotswapping

* Make hotswap work

Requires https://github.com/huggingface/peft/pull/2366

More changes to make hotswapping work. Together with the mentioned PEFT
PR, the tests pass for me locally.

List of changes:

- docstring for hotswap
- remove code copied from PEFT, import from PEFT now
- adjustments to PeftAdapterMixin.load_lora_adapter (unfortunately, some
  state dict renaming was necessary, LMK if there is a better solution)
- adjustments to UNet2DConditionLoadersMixin._process_lora: LMK if this
  is even necessary or not, I'm unsure what the overall relationship is
  between this and PeftAdapterMixin.load_lora_adapter
- also in UNet2DConditionLoadersMixin._process_lora, I saw that there is
  no LoRA unloading when loading the adapter fails, so I added it
  there (in line with what happens in PeftAdapterMixin.load_lora_adapter)
- rewritten tests to avoid shelling out, make the test more precise by
  making sure that the outputs align, parametrize it
- also checked the pipeline code mentioned in this comment:
  https://github.com/huggingface/diffusers/pull/9453#issuecomment-2418508871;


  when running this inside the with
  torch._dynamo.config.patch(error_on_recompile=True) context, there is
  no error, so I think hotswapping is now working with pipelines.

* Address reviewer feedback:

- Revert deprecated method
- Fix PEFT doc link to main
- Don't use private function
- Clarify magic numbers
- Add pipeline test

Moreover:
- Extend docstrings
- Extend existing test for outputs != 0
- Extend existing test for wrong adapter name

* Change order of test decorators

parameterized.expand seems to ignore skip decorators if added in last
place (i.e. innermost decorator).

* Split model and pipeline tests

Also increase test coverage by also targeting conv2d layers (support of
which was added recently on the PEFT PR).

* Reviewer feedback: Move decorator to test classes

... instead of having them on each test method.

* Apply suggestions from code review
Co-authored-by: hlky <hlky@hlky.ac>

* Reviewer feedback: version check, TODO comment

* Add enable_lora_hotswap method

* Reviewer feedback: check _lora_loadable_modules

* Revert changes in unet.py

* Add possibility to ignore enabled at wrong time

* Fix docstrings

* Log possible PEFT error, test

* Raise helpful error if hotswap not supported

I.e. for the text encoder

* Formatting

* More linter

* More ruff

* Doc-builder complaint

* Update docstring:

- mention no text encoder support yet
- make it clear that LoRA is meant
- mention that same adapter name should be passed

* Fix error in docstring

* Update more methods with hotswap argument

- SDXL
- SD3
- Flux

No changes were made to load_lora_into_transformer.

* Add hotswap argument to load_lora_into_transformer

For SD3 and Flux. Use shorter docstring for brevity.

* Extend docstrings

* Add version guards to tests

* Formatting

* Fix LoRA loading call to add prefix=None

See:
https://github.com/huggingface/diffusers/pull/10187#issuecomment-2717571064



* Run make fix-copies

* Add hot swap documentation to the docs

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: hlky <hlky@hlky.ac>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

fb544996

02 Apr, 2025 3 commits
- Revert `save_model` in ModelMixin save_pretrained and use safe_serialization=False in test (#11196) · da857beb
  hlky authored Apr 02, 2025
  
  da857beb
- [tests] HunyuanDiTControlNetPipeline inference precision issue on XPU (#11197) · 52b460fe
  Fanli Lin authored Apr 02, 2025
```
* add xpu part

* fix more cases

* remove some cases

* no canny

* format fix
```
  52b460fe
- allow models to run with a user-provided dtype map instead of a single dtype (#10301) · d8c617cc
  hlky authored Apr 02, 2025
```
* allow models to run with a user-provided dtype map instead of a single dtype

* make style

* Add warning, change `_` to `default`

* make style

* add test

* handle shared tensors

* remove warning

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  d8c617cc
01 Apr, 2025 1 commit

[WIP] Add Wan Video2Video (#11053) · df1d7b01

Dhruv Nair authored Apr 01, 2025

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

df1d7b01

24 Mar, 2025 1 commit

New HunyuanVideo-I2V (#11066) · 8907a70a

Aryan authored Mar 24, 2025

* update

* update

* update

* add tests

* update docs

* raise value error

* warning for true cfg and guidance scale

* fix test

8907a70a