Commits · b00a564dac2c6d430780159ca21982cdff31b2a9 · renzhc / diffusers_dcu

16 Apr, 2025 2 commits
- Fix wrong dtype argument name as torch_dtype (#11346) · 3e59d531
  nPeppon authored Apr 16, 2025
  
  3e59d531
- [single file] enable telemetry for single file loading when using GGUF. (#11284) · 7212f35d
  Sayak Paul authored Apr 16, 2025
```
* enable telemetry for single file loading when using GGUF.

* quality
```
  7212f35d
15 Apr, 2025 1 commit

[LoRA] Add LoRA support to AuraFlow (#10216) · 9352a5ca

Hameer Abbasi authored Apr 15, 2025



* Add AuraFlowLoraLoaderMixin

* Add comments, remove qkv fusion

* Add Tests

* Add AuraFlowLoraLoaderMixin to documentation

* Add Suggested changes

* Change attention_kwargs->joint_attention_kwargs

* Rebasing derp.

* fix

* fix

* Quality fixes.

* make style

* `make fix-copies`

* `ruff check --fix`

* Attept 1 to fix tests.

* Attept 2 to fix tests.

* Attept 3 to fix tests.

* Address review comments.

* Rebasing derp.

* Get more tests passing by copying from Flux. Address review comments.

* `joint_attention_kwargs`->`attention_kwargs`

* Add `lora_scale` property for te LoRAs.

* Make test better.

* Remove useless property.

* Skip TE-only tests for AuraFlow.

* Support LoRA for non-CLIP TEs.

* Restore LoRA tests.

* Undo adding LoRA support for non-CLIP TEs.

* Undo support for TE in AuraFlow LoRA.

* `make fix-copies`

* Sync with upstream changes.

* Remove unneeded stuff.

* Mirror `Lumina2`.

* Skip for MPS.

* Address review comments.

* Remove duplicated code.

* Remove unnecessary code.

* Remove repeated docs.

* Propagate attention.

* Fix TE target modules.

* MPS fix for LoRA tests.

* Unrelated TE LoRA tests fix.

* Fix AuraFlow LoRA tests by applying to the right denoiser layers.
Co-authored-by: AstraliteHeart <81396681+AstraliteHeart@users.noreply.github.com>

* Apply style fixes

* empty commit

* Fix the repo consistency issues.

* Remove unrelated changes.

* Style.

* Fix `test_lora_fuse_nan`.

* fix quality issues.

* `pytest.xfail` -> `ValueError`.

* Add back `skip_mps`.

* Apply style fixes

* `make fix-copies`

---------
Co-authored-by: Warlord-K <warlordk28@gmail.com>
Co-authored-by: hlky <hlky@hlky.ac>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: AstraliteHeart <81396681+AstraliteHeart@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

9352a5ca

14 Apr, 2025 1 commit

[LoRA] support more SDXL loras. (#11292) · a8f5134c

Sayak Paul authored Apr 14, 2025



* support more SDXL loras.

* update

---------
Co-authored-by: hlky <hlky@hlky.ac>

a8f5134c

10 Apr, 2025 2 commits

Fix LTX 0.9.5 single file (#11271) · b8093e66
hlky authored Apr 10, 2025

b8093e66

[LoRA] support musubi wan loras. (#11243) · ffda8735

Sayak Paul authored Apr 10, 2025



* support musubi wan loras.

* Update src/diffusers/loaders/lora_conversion_utils.py
Co-authored-by: hlky <hlky@hlky.ac>

* support i2v loras from musubi too.

---------
Co-authored-by: hlky <hlky@hlky.ac>

ffda8735

09 Apr, 2025 2 commits

Update Ruff to latest Version (#10919) · edc154da
Dhruv Nair authored Apr 09, 2025
```
* update

* update

* update

* update
```
edc154da

[LoRA] support more comyui loras for Flux

🚨

(#10985) · 6bfacf04

Sayak Paul authored Apr 09, 2025

* support more comyui loras.

* fix

* fixes

* revert changes in LoRA base.

* no position_embedding

* 🚨

 introduce a breaking change to let peft handle module ambiguity

* styling

* remove position embeddings.

* improvements.

* style

* make info instead of NotImplementedError

* Update src/diffusers/loaders/peft.py
Co-authored-by: hlky <hlky@hlky.ac>

* add example.

* robust checks

* updates

---------
Co-authored-by: hlky <hlky@hlky.ac>

6bfacf04

08 Apr, 2025 2 commits

Flux quantized with lora (#10990) · 5d49b3e8

hlky authored Apr 08, 2025



* Flux quantized with lora

* fix

* changes

* Apply suggestions from code review
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Apply style fixes

* enable model cpu offload()

* Update src/diffusers/loaders/lora_pipeline.py
Co-authored-by: hlky <hlky@hlky.ac>

* update

* Apply suggestions from code review

* update

* add peft as an additional dependency for gguf

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

5d49b3e8

[LoRA] Implement hot-swapping of LoRA (#9453) · fb544996

Benjamin Bossan authored Apr 08, 2025

* [WIP][LoRA] Implement hot-swapping of LoRA

This PR adds the possibility to hot-swap LoRA adapters. It is WIP.

Description

As of now, users can already load multiple LoRA adapters. They can
offload existing adapters or they can unload them (i.e. delete them).
However, they cannot "hotswap" adapters yet, i.e. substitute the weights
from one LoRA adapter with the weights of another, without the need to
create a separate LoRA adapter.

Generally, hot-swapping may not appear not super useful but when the
model is compiled, it is necessary to prevent recompilation. See #9279
for more context.

Caveats

To hot-swap a LoRA adapter for another, these two adapters should target
exactly the same layers and the "hyper-parameters" of the two adapters
should be identical. For instance, the LoRA alpha has to be the same:
Given that we keep the alpha from the first adapter, the LoRA scaling
would be incorrect for the second adapter otherwise.

Theoretically, we could override the scaling dict with the alpha values
derived from the second adapter's config, but changing the dict will
trigger a guard for recompilation, defeating the main purpose of the
feature.

I also found that compilation flags can have an impact on whether this
works or not. E.g. when passing "reduce-overhead", there will be errors
of the type:

> input name: arg861_1. data pointer changed from 139647332027392 to
139647331054592

I don't know enough about compilation to determine whether this is
problematic or not.

Current state

This is obviously WIP right now to collect feedback and discuss which
direction to take this. If this PR turns out to be useful, the
hot-swapping functions will be added to PEFT itself and can be imported
here (or there is a separate copy in diffusers to avoid the need for a
min PEFT version to use this feature).

Moreover, more tests need to be added to better cover this feature,
although we don't necessarily need tests for the hot-swapping
functionality itself, since those tests will be added to PEFT.

Furthermore, as of now, this is only implemented for the unet. Other
pipeline components have yet to implement this feature.

Finally, it should be properly documented.

I would like to collect feedback on the current state of the PR before
putting more time into finalizing it.

* Reviewer feedback

* Reviewer feedback, adjust test

* Fix, doc

* Make fix

* Fix for possible g++ error

* Add test for recompilation w/o hotswapping

* Make hotswap work

Requires https://github.com/huggingface/peft/pull/2366

More changes to make hotswapping work. Together with the mentioned PEFT
PR, the tests pass for me locally.

List of changes:

- docstring for hotswap
- remove code copied from PEFT, import from PEFT now
- adjustments to PeftAdapterMixin.load_lora_adapter (unfortunately, some
  state dict renaming was necessary, LMK if there is a better solution)
- adjustments to UNet2DConditionLoadersMixin._process_lora: LMK if this
  is even necessary or not, I'm unsure what the overall relationship is
  between this and PeftAdapterMixin.load_lora_adapter
- also in UNet2DConditionLoadersMixin._process_lora, I saw that there is
  no LoRA unloading when loading the adapter fails, so I added it
  there (in line with what happens in PeftAdapterMixin.load_lora_adapter)
- rewritten tests to avoid shelling out, make the test more precise by
  making sure that the outputs align, parametrize it
- also checked the pipeline code mentioned in this comment:
  https://github.com/huggingface/diffusers/pull/9453#issuecomment-2418508871;


  when running this inside the with
  torch._dynamo.config.patch(error_on_recompile=True) context, there is
  no error, so I think hotswapping is now working with pipelines.

* Address reviewer feedback:

- Revert deprecated method
- Fix PEFT doc link to main
- Don't use private function
- Clarify magic numbers
- Add pipeline test

Moreover:
- Extend docstrings
- Extend existing test for outputs != 0
- Extend existing test for wrong adapter name

* Change order of test decorators

parameterized.expand seems to ignore skip decorators if added in last
place (i.e. innermost decorator).

* Split model and pipeline tests

Also increase test coverage by also targeting conv2d layers (support of
which was added recently on the PEFT PR).

* Reviewer feedback: Move decorator to test classes

... instead of having them on each test method.

* Apply suggestions from code review
Co-authored-by: hlky <hlky@hlky.ac>

* Reviewer feedback: version check, TODO comment

* Add enable_lora_hotswap method

* Reviewer feedback: check _lora_loadable_modules

* Revert changes in unet.py

* Add possibility to ignore enabled at wrong time

* Fix docstrings

* Log possible PEFT error, test

* Raise helpful error if hotswap not supported

I.e. for the text encoder

* Formatting

* More linter

* More ruff

* Doc-builder complaint

* Update docstring:

- mention no text encoder support yet
- make it clear that LoRA is meant
- mention that same adapter name should be passed

* Fix error in docstring

* Update more methods with hotswap argument

- SDXL
- SD3
- Flux

No changes were made to load_lora_into_transformer.

* Add hotswap argument to load_lora_into_transformer

For SD3 and Flux. Use shorter docstring for brevity.

* Extend docstrings

* Add version guards to tests

* Formatting

* Fix LoRA loading call to add prefix=None

See:
https://github.com/huggingface/diffusers/pull/10187#issuecomment-2717571064



* Run make fix-copies

* Add hot swap documentation to the docs

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: hlky <hlky@hlky.ac>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

fb544996

04 Apr, 2025 2 commits

Fix Single File loading for LTX VAE (#11200) · aabf8ce2
Dhruv Nair authored Apr 04, 2025
```
update
```
aabf8ce2

Fixed requests.get function call by adding timeout parameter. (#11156) · f10775b1

Kenneth Gerald Hamilton authored Apr 04, 2025



* Fixed requests.get function call by adding timeout parameter.

* declare DIFFUSERS_REQUEST_TIMEOUT in constants and import when needed

* remove unneeded os import

* Apply style fixes

---------
Co-authored-by: Sai-Suraj-27 <sai.suraj.27.729@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

f10775b1

26 Mar, 2025 2 commits

Set self._hf_peft_config_loaded to True when LoRA is loaded using... · de6a88c2

kentdan3msu authored Mar 26, 2025

Set self._hf_peft_config_loaded to True when LoRA is loaded using `load_lora_adapter` in PeftAdapterMixin class (#11155)

set self._hf_peft_config_loaded to True on successful lora load

Sets the `_hf_peft_config_loaded` flag if a LoRA is successfully loaded in `load_lora_adapter`. Fixes bug huggingface/diffusers/issues/11148
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

de6a88c2

[Quantization] dtype fix for GGUF + fix BnB tests (#11159) · 7dc52ea7
Dhruv Nair authored Mar 26, 2025
```
* update

* update

* update

* update
```
7dc52ea7

21 Mar, 2025 1 commit

Don't override `torch_dtype` and don't use when `quantization_config` is set (#11039) · a7d53a59

hlky authored Mar 21, 2025



* Don't use `torch_dtype` when `quantization_config` is set

* up

* djkajka

* Apply suggestions from code review

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

a7d53a59

20 Mar, 2025 1 commit

[tests] make cuda only tests device-agnostic (#11058) · 15ad97f7

Fanli Lin authored Mar 20, 2025

* enable bnb on xpu

* add 2 more cases

* add missing change

* add missing change

* add one more

* enable cuda only tests on xpu

* enable big gpu cases

15ad97f7

19 Mar, 2025 1 commit

[Wan LoRAs] make T2V LoRAs compatible with Wan I2V (#11107) · a34d97ce

Linoy Tsaban authored Mar 19, 2025



* @hlky t2v->i2v

* Apply style fixes

* try with ones to not nullify layers

* fix method name

* revert to zeros

* add check to state_dict keys

* add comment

* copies fix

* Revert "copies fix"

This reverts commit 051f534d185c0ea065bf36a9926c4b48f496d429.

* remove copied from

* Update src/diffusers/loaders/lora_pipeline.py
Co-authored-by: hlky <hlky@hlky.ac>

* Update src/diffusers/loaders/lora_pipeline.py
Co-authored-by: hlky <hlky@hlky.ac>

* update

* update

* Update src/diffusers/loaders/lora_pipeline.py
Co-authored-by: hlky <hlky@hlky.ac>

* Apply style fixes

---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Linoy <linoy@hf.co>
Co-authored-by: hlky <hlky@hlky.ac>

a34d97ce

14 Mar, 2025 1 commit
- [LoRA] feat: support non-diffusers wan t2v loras. (#11059) · 124ac3e8
  Sayak Paul authored Mar 14, 2025
```
feat: support non-diffusers wan t2v loras.
```
  124ac3e8
12 Mar, 2025 1 commit
- [LoRA] change to warning from info when notifying the users about a LoRA no-op (#11044) · 20e4b6a6
  Sayak Paul authored Mar 12, 2025
```
* move to warning.

* test related changes.
```
  20e4b6a6
11 Mar, 2025 3 commits

Fix missing **kwargs in lora_pipeline.py (#11011) · d87ce2ce

CyberVy authored Mar 12, 2025



* Update lora_pipeline.py

* Apply style fixes

* fix-copies

---------
Co-authored-by: hlky <hlky@hlky.ac>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

d87ce2ce

Fix SD3 IPAdapter feature extractor (#11027) · 7e0db46f
hlky authored Mar 11, 2025

7e0db46f
[LoRA] support wan i2v loras from the world. (#11025) · e4b056fe
Sayak Paul authored Mar 11, 2025
```
* support wan i2v loras from the world.

* remove copied from.

* upates

* add lora.
```
e4b056fe

10 Mar, 2025 3 commits

[LoRA] CogView4 (#10981) · 8eefed65
Aryan authored Mar 10, 2025
```
* update

* make fix-copies

* update
```
8eefed65

[LoRA] Improve warning messages when LoRA loading becomes a no-op (#10187) · 26149c0e

Sayak Paul authored Mar 10, 2025



* updates

* updates

* updates

* updates

* notebooks revert

* fix-copies.

* seeing

* fix

* revert

* fixes

* fixes

* fixes

* remove print

* fix

* conflicts ii.

* updates

* fixes

* better filtering of prefix.

---------
Co-authored-by: hlky <hlky@hlky.ac>

26149c0e

[Single File] Add single file loading for SANA Transformer (#10947) · 0703ce88

Ishan Modi authored Mar 10, 2025



* added support for from_single_file

* added diffusers mapping script

* added testcase

* bug fix

* updated tests

* corrected code quality

* corrected code quality

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

0703ce88

08 Mar, 2025 1 commit
- [LoRA] Improve copied from comments in the LoRA loader classes (#10995) · 1fddee21
  Sayak Paul authored Mar 08, 2025
```
* more sanity of mind with copied from ...

* better

* better
```
  1fddee21
07 Mar, 2025 2 commits
- [Single File] Add single file support for Wan T2V/I2V (#10991) · 1357931d
  Dhruv Nair authored Mar 07, 2025
```
* update

* update

* update

* update

* update

* update

* update
```
  1357931d
- [LoRA] remove full key prefix from peft. (#11004) · a2d3d6af
  Sayak Paul authored Mar 07, 2025
```
remove full key prefix from peft.
```
  a2d3d6af
06 Mar, 2025 2 commits
- [Single File] Add user agent to SF download requests. (#10979) · 790a909b
  Dhruv Nair authored Mar 07, 2025
```
update
```
  790a909b
- Fix loading OneTrainer Flux LoRA (#10978) · b1502763
  hlky authored Mar 06, 2025
```
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  b1502763
04 Mar, 2025 3 commits

[LoRA] Support Wan (#10943) · 3ee899fa

Aryan authored Mar 05, 2025

* update

* refactor image-to-video pipeline

* update

* fix copied from

* use FP32LayerNorm

3ee899fa

[LoRA] feat: support non-diffusers lumina2 LoRAs. (#10909) · 97fda1b7

Sayak Paul authored Mar 04, 2025

* feat: support non-diffusers lumina2 LoRAs.

* revert ipynb changes (but I don't know why this is required ☹

️)

* empty

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

97fda1b7

Improve load_ip_adapter RAM Usage (#10948) · 30cef6bf

CyberVy authored Mar 04, 2025



* Update ip_adapter.py

* Update ip_adapter.py

* Update ip_adapter.py

* Update ip_adapter.py

* Update ip_adapter.py

* Apply style fixes

---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: hlky <hlky@hlky.ac>

30cef6bf

03 Mar, 2025 1 commit

Fix SD2.X clip single file load projection_dim (#10770) · 9e910c46

Teriks authored Mar 03, 2025



* Fix SD2.X clip single file load projection_dim

Infer projection_dim from the checkpoint before loading
from pretrained, override any incorrect hub config.

Hub configuration for SD2.X specifies projection_dim=512
which is incorrect for SD2.X checkpoints loaded from civitai
and similar.

Exception was previously thrown upon attempting to
load_model_dict_into_meta for SD2.X single file checkpoints.

Such LDM models usually require projection_dim=1024

* convert_open_clip_checkpoint use hidden_size for text_proj_dim

* convert_open_clip_checkpoint, revert checkpoint[text_proj_key].shape[1] -> [0]

values are identical

---------
Co-authored-by: Teriks <Teriks@users.noreply.github.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

9e910c46

25 Feb, 2025 1 commit

Multi IP-Adapter for Flux pipelines (#10867) · 1450c2ac

Daniel Regado authored Feb 25, 2025



* Initial implementation of Flux multi IP-Adapter

* Update src/diffusers/pipelines/flux/pipeline_flux.py
Co-authored-by: hlky <hlky@hlky.ac>

* Update src/diffusers/pipelines/flux/pipeline_flux.py
Co-authored-by: hlky <hlky@hlky.ac>

* Changes for ipa image embeds

* Update src/diffusers/pipelines/flux/pipeline_flux.py
Co-authored-by: hlky <hlky@hlky.ac>

* Update src/diffusers/pipelines/flux/pipeline_flux.py
Co-authored-by: hlky <hlky@hlky.ac>

* make style && make quality

* Updated ip_adapter test

* Created typing_utils.py

---------
Co-authored-by: hlky <hlky@hlky.ac>

1450c2ac

24 Feb, 2025 2 commits
- [LoRA] restrict certain keys to be checked for peft config update. (#10808) · b0550a66
  Sayak Paul authored Feb 24, 2025
```
* restruct certain keys to be checked for peft config update.

* updates

* finish./

* finish 2.

* updates
```
  b0550a66
- Fix `torch_dtype` in Kolors text encoder with `transformers` v4.49 (#10816) · 6f74ef55
  hlky authored Feb 24, 2025
```
* Fix `torch_dtype` in Kolors text encoder with `transformers` v4.49

* Default torch_dtype and warning
```
  6f74ef55
21 Feb, 2025 1 commit

`device_map` in `load_model_dict_into_meta` (#10851) · d75ea3c7

hlky authored Feb 21, 2025

* `device_map` in `load_model_dict_into_meta`

* _LOW_CPU_MEM_USAGE_DEFAULT

* fix is_peft_version is_bitsandbytes_version

d75ea3c7

20 Feb, 2025 2 commits

SD3 IP-Adapter runtime checkpoint conversion (#10718) · d9ee3879
Daniel Regado authored Feb 20, 2025
```
* Added runtime checkpoint conversion

* Updated docs

* Fix for quantized model
```
d9ee3879

[LoRA] add LoRA support to Lumina2 and fine-tuning script (#10818) · f10d3c6d

Sayak Paul authored Feb 20, 2025

* feat: lora support for Lumina2.

* fix-copies.

* updates

* updates

* docs.

* fix

* add: training script.

* tests

* updates

* updates

* major updates.

* updates

* fixes

* docs.

* updates

* updates

f10d3c6d