Commits · 5ffb73d4aeac9eaef8366d7b21872d64009bd1c7 · renzhc / diffusers_dcu

25 Nov, 2025 1 commit

(#12711) · 5ffb73d4

Sayak Paul authored Nov 25, 2025



* add vae

* Initial commit for Flux 2 Transformer implementation

* add pipeline part

* small edits to the pipeline and conversion

* update conversion script

* fix

* up up

* finish pipeline

* Remove Flux IP Adapter logic for now

* Remove deprecated 3D id logic

* Remove ControlNet logic for now

* Add link to ViT-22B paper as reference for parallel transformer blocks such as the Flux 2 single stream block

* update pipeline

* Don't use biases for input projs and output AdaNorm

* up

* Remove bias for double stream block text QKV projections

* Add script to convert Flux 2 transformer to diffusers

* make style and make quality

* fix a few things.

* allow sft files to go.

* fix image processor

* fix batch

* style a bit

* Fix some bugs in Flux 2 transformer implementation

* Fix dummy input preparation and fix some test bugs

* fix dtype casting in timestep guidance module.

* resolve conflicts.,

* remove ip adapter stuff.

* Fix Flux 2 transformer consistency test

* Fix bug in Flux2TransformerBlock (double stream block)

* Get remaining Flux 2 transformer tests passing

* make style; make quality; make fix-copies

* remove stuff.

* fix type annotaton.

* remove unneeded stuff from tests

* tests

* up

* up

* add sf support

* Remove unused IP Adapter and ControlNet logic from transformer (#9)

* copied from

* Apply suggestions from code review
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: apolinário <joaopaulo.passos@gmail.com>

* up

* up

* up

* up

* up

* Refactor Flux2Attention into separate classes for double stream and single stream attention

* Add _supports_qkv_fusion to AttentionModuleMixin to allow subclasses to disable QKV fusion

* Have Flux2ParallelSelfAttention inherit from AttentionModuleMixin with _supports_qkv_fusion=False

* Log debug message when calling fuse_projections on a AttentionModuleMixin subclass that does not support QKV fusion

* Address review comments

* Update src/diffusers/pipelines/flux2/pipeline_flux2.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* up

* Remove maybe_allow_in_graph decorators for Flux 2 transformer blocks (#12)

* up

* support ostris loras. (#13)

* up

* update schdule

* up

* up (#17)

* add training scripts (#16)

* add training scripts
Co-authored-by: Linoy Tsaban <linoytsaban@gmail.com>

* model cpu offload in validation.

* add flux.2 readme

* add img2img and tests

* cpu offload in log validation

* Apply suggestions from code review

* fix

* up

* fixes

* remove i2i training tests for now.

---------
Co-authored-by: Linoy Tsaban <linoytsaban@gmail.com>
Co-authored-by: linoytsaban <linoy@huggingface.co>

* up

---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>
Co-authored-by: Daniel Gu <dgu8957@gmail.com>
Co-authored-by: yiyi@huggingface.co <yiyi@ip-10-53-87-203.ec2.internal>
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: apolinário <joaopaulo.passos@gmail.com>
Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal>
Co-authored-by: Linoy Tsaban <linoytsaban@gmail.com>
Co-authored-by: linoytsaban <linoy@huggingface.co>

5ffb73d4

07 Oct, 2025 1 commit

[Qwen LoRA training] fix bug when offloading (#12440) · 1066de8c

Linoy Tsaban authored Oct 07, 2025

* fix bug when offload and cache_latents both enabled

* fix bug when offload and cache_latents both enabled

* fix bug when offload and cache_latents both enabled

* fix bug when offload and cache_latents both enabled

* fix bug when offload and cache_latents both enabled

* fix bug when offload and cache_latents both enabled

* fix bug when offload and cache_latents both enabled

* fix bug when offload and cache_latents both enabled

* fix bug when offload and cache_latents both enabled

1066de8c

03 Oct, 2025 1 commit

[training-scripts] Make more examples UV-compatible (follow up on #12000) (#12407) · 941ac9c3

Linoy Tsaban authored Oct 03, 2025



* make qwen and kontext uv compatible

* add torchvision

* add torchvision

* add datasets, bitsandbytes, prodigyopt

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

941ac9c3

19 Aug, 2025 1 commit
- post release v0.35.0 (#12184) · 7a2b78bf
  Sayak Paul authored Aug 19, 2025
```
* post release v0.35.0

* quality
```
  7a2b78bf
05 Aug, 2025 1 commit

[wip] feat: support lora in qwen image and training script (#12056) · 9c1d4e3b

Sayak Paul authored Aug 05, 2025



* feat: support lora in qwen image and training script

* up

* up

* up

* up

* up

* up

* add lora tests

* fix

* add tests

* fix

* reviewer feedback

* up[

* Apply suggestions from code review
Co-authored-by: Aryan <aryan@huggingface.co>

---------
Co-authored-by: Aryan <aryan@huggingface.co>

9c1d4e3b

29 Jul, 2025 1 commit

Fix huggingface-hub failing tests (#11994) · edcbe803

Álvaro Somoza authored Jul 29, 2025

* login

* more logins

* uploads

* missed login

* another missed login

* downloads

* examples and more logins

* fix

* setup

* Apply style fixes

* fix

* Apply style fixes

edcbe803

16 Jul, 2025 1 commit

[training] add an offload utility that can be used as a context manager. (#11775) · 9c13f865

Sayak Paul authored Jul 16, 2025



* add an offload utility that can be used as a context manager.

* update

---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

9c13f865

26 Jun, 2025 1 commit

[chore] post release v0.34.0 (#11800) · 10c36e0b

Sayak Paul authored Jun 26, 2025



* post release v0.34.0

* code quality

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

10c36e0b

18 Jun, 2025 1 commit

[training] add ds support to lora hidream (#11737) · d72184eb

Leo Jiang authored Jun 17, 2025



* [training] add ds support to lora hidream

* Apply style fixes

---------
Co-authored-by: J石页 <jiangshuo9@h-partners.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

d72184eb

08 May, 2025 1 commit

[LoRA] make lora alpha and dropout configurable (#11467) · 66e50d4e

Linoy Tsaban authored May 08, 2025



* add lora_alpha and lora_dropout

* Apply style fixes

* add lora_alpha and lora_dropout

* Apply style fixes

* revert lora_alpha until #11324 is merged

* Apply style fixes

* empty commit

---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

66e50d4e

05 May, 2025 1 commit

[training] feat: enable quantization for hidream lora training. (#11494) · 071807c8

Sayak Paul authored May 05, 2025



* feat: enable quantization for hidream lora training.

* better handle compute dtype.

* finalize.

* fix dtype.

---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

071807c8

01 May, 2025 1 commit

Fix typos in docs and comments (#11416) · 86294d3c

co63oc authored May 01, 2025



* Fix typos in docs and comments

* Apply style fixes

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

86294d3c

28 Apr, 2025 1 commit

[Hi-Dream LoRA] fix bug in validation (#11439) · 0ac1d5b4

Linoy Tsaban authored Apr 28, 2025



remove unnecessary pipeline moving to cpu in validation
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

0ac1d5b4

24 Apr, 2025 1 commit

[HiDream LoRA] optimizations + small updates (#11381) · edd78804

Linoy Tsaban authored Apr 24, 2025



* 1. add pre-computation of prompt embeddings when custom prompts are used as well
2. save model card even if model is not pushed to hub
3. remove scheduler initialization from code example - not necessary anymore (it's now if the base model's config)
4. add skip_final_inference - to allow to run with validation, but skip the final loading of the pipeline with the lora weights to reduce memory reqs

* pre encode validation prompt as well

* Update examples/dreambooth/train_dreambooth_lora_hidream.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/dreambooth/train_dreambooth_lora_hidream.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/dreambooth/train_dreambooth_lora_hidream.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* pre encode validation prompt as well

* Apply style fixes

* empty commit

* change default trained modules

* empty commit

* address comments + change encoding of validation prompt (before it was only pre-encoded if custom prompts are provided, but should be pre-encoded either way)

* Apply style fixes

* empty commit

* fix validation_embeddings definition

* fix final inference condition

* fix pipeline deletion in last inference

* Apply style fixes

* empty commit

* layers

* remove readme remarks on only pre-computing when instance prompt is provided and change example to 3d icons

* smol fix

* empty commit

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

edd78804

22 Apr, 2025 1 commit

[LoRA] add LoRA support to HiDream and fine-tuning script (#11281) · e30d3bf5

Linoy Tsaban authored Apr 22, 2025



* initial commit

* initial commit

* initial commit

* initial commit

* initial commit

* initial commit

* Update examples/dreambooth/train_dreambooth_lora_hidream.py
Co-authored-by: Bagheera <59658056+bghira@users.noreply.github.com>

* move prompt embeds, pooled embeds outside

* Update examples/dreambooth/train_dreambooth_lora_hidream.py
Co-authored-by: hlky <hlky@hlky.ac>

* Update examples/dreambooth/train_dreambooth_lora_hidream.py
Co-authored-by: hlky <hlky@hlky.ac>

* fix import

* fix import and tokenizer 4, text encoder 4 loading

* te

* prompt embeds

* fix naming

* shapes

* initial commit to add HiDreamImageLoraLoaderMixin

* fix init

* add tests

* loader

* fix model input

* add code example to readme

* fix default max length of text encoders

* prints

* nullify training cond in unpatchify for temp fix to incompatible shaping of transformer output during training

* smol fix

* unpatchify

* unpatchify

* fix validation

* flip pred and loss

* fix shift!!!

* revert unpatchify changes (for now)

* smol fix

* Apply style fixes

* workaround moe training

* workaround moe training

* remove prints

* to reduce some memory, keep vae in `weight_dtype` same as we have for flux (as it's the same vae)
https://github.com/huggingface/diffusers/blob/bbd0c161b55ba2234304f1e6325832dd69c60565/examples/dreambooth/train_dreambooth_lora_flux.py#L1207



* refactor to align with HiDream refactor

* refactor to align with HiDream refactor

* refactor to align with HiDream refactor

* add support for cpu offloading of text encoders

* Apply style fixes

* adjust lr and rank for train example

* fix copies

* Apply style fixes

* update README

* update README

* update README

* fix license

* keep prompt2,3,4 as None in validation

* remove reverse ode comment

* Update examples/dreambooth/train_dreambooth_lora_hidream.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/dreambooth/train_dreambooth_lora_hidream.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* vae offload change

* fix text encoder offloading

* Apply style fixes

* cleaner to_kwargs

* fix module name in copied from

* add requirements

* fix offloading

* fix offloading

* fix offloading

* update transformers version in reqs

* try AutoTokenizer

* try AutoTokenizer

* Apply style fixes

* empty commit

* Delete tests/lora/test_lora_layers_hidream.py

* change tokenizer_4 to load with AutoTokenizer as well

* make text_encoder_four and tokenizer_four configurable

* save model card

* save model card

* revert T5

* fix test

* remove non diffusers lumina2 conversion

---------
Co-authored-by: Bagheera <59658056+bghira@users.noreply.github.com>
Co-authored-by: hlky <hlky@hlky.ac>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

e30d3bf5

15 Apr, 2025 1 commit

post release 0.33.0 (#11255) · 4b868f14

Sayak Paul authored Apr 15, 2025



* post release

* update

* fix deprecations

* remaining

* update

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

4b868f14

09 Apr, 2025 1 commit
- Update Ruff to latest Version (#10919) · edc154da
  Dhruv Nair authored Apr 09, 2025
```
* update

* update

* update

* update
```
  edc154da
04 Mar, 2025 1 commit

Fix incorrect seed initialization when args.seed is 0 (#10964) · b8215b1c

Alexey Zolotenkov authored Mar 04, 2025



* Fix seed initialization to handle args.seed = 0 correctly

* Apply style fixes

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

b8215b1c

20 Feb, 2025 1 commit

[LoRA] add LoRA support to Lumina2 and fine-tuning script (#10818) · f10d3c6d

Sayak Paul authored Feb 20, 2025

* feat: lora support for Lumina2.

* fix-copies.

* updates

* updates

* docs.

* fix

* add: training script.

* tests

* updates

* updates

* major updates.

* updates

* fixes

* docs.

* updates

* updates

f10d3c6d

06 Feb, 2025 1 commit

[bugfix] NPU Adaption for Sana (#10724) · cd0a4a82

Leo Jiang authored Feb 06, 2025



* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* [bugfix]NPU Adaption for Sanna

---------
Co-authored-by: J石页 <jiangshuo9@h-partners.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

cd0a4a82

24 Jan, 2025 1 commit

NPU Adaption for Sanna (#10409) · 07860f99

Leo Jiang authored Jan 24, 2025



* NPU Adaption for Sanna


---------
Co-authored-by: J石页 <jiangshuo9@h-partners.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

07860f99

21 Jan, 2025 1 commit
- [chore] change licensing to 2025 from 2024. (#10615) · 4ace7d04
  Sayak Paul authored Jan 21, 2025
```
change licensing to 2025 from 2024.
```
  4ace7d04
15 Jan, 2025 1 commit
- [Sana 4K] Add vae tiling option to avoid OOM (#10583) · b0c89738
  Leo Jiang authored Jan 15, 2025
```
Co-authored-by: J石页 <jiangshuo9@h-partners.com>
```
  b0c89738
23 Dec, 2024 2 commits

[chore] post release 0.32.0 (#10361) · 92933ec3
Sayak Paul authored Dec 24, 2024
```
* post release 0.32.0

* stylew
```
92933ec3

[SANA LoRA] sana lora training tests and misc. (#10296) · 76e2727b

Sayak Paul authored Dec 23, 2024



* sana lora training tests and misc.

* remove push to hub

* Update examples/dreambooth/train_dreambooth_lora_sana.py
Co-authored-by: Aryan <aryan@huggingface.co>

---------
Co-authored-by: Aryan <aryan@huggingface.co>

76e2727b

18 Dec, 2024 1 commit

[LoRA] feat: lora support for SANA. (#10234) · 9408aa2d

Sayak Paul authored Dec 18, 2024



* feat: lora support for SANA.

* make fix-copies

* rename test class.

* attention_kwargs -> cross_attention_kwargs.

* Revert "attention_kwargs -> cross_attention_kwargs."

This reverts commit 23433bf9bccc12e0f2f55df26bae58a894e8b43b.

* exhaust 119 max line limit

* sana lora fine-tuning script.

* readme

* add a note about the supported models.

* Apply suggestions from code review
Co-authored-by: Aryan <aryan@huggingface.co>

* style

* docs for attention_kwargs.

* remove lora_scale from pag pipeline.

* copy fix

---------
Co-authored-by: Aryan <aryan@huggingface.co>

9408aa2d

19 Nov, 2024 1 commit

[advanced flux training] bug fix + reduce memory cost as in #9829 (#9838) · acf479bd

Linoy Tsaban authored Nov 18, 2024

* memory improvement as done here: https://github.com/huggingface/diffusers/pull/9829



* fix bug

* fix bug

* style

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

acf479bd

01 Nov, 2024 2 commits

Reduce Memory Cost in Flux Training (#9829) · a98a839d

Leo Jiang authored Nov 01, 2024



* Improve NPU performance

* Improve NPU performance

* Improve NPU performance

* Improve NPU performance

* [bugfix] bugfix for npu free memory

* [bugfix] bugfix for npu free memory

* [bugfix] bugfix for npu free memory

* Reduce memory cost for flux training process

---------
Co-authored-by: 蒋硕 <jiangshuo9@h-partners.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

a98a839d

Handling mixed precision for dreambooth flux lora training (#9565) · 3deed729

Boseong Jeon authored Nov 01, 2024



Handling mixed precision and add unwarp
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

3deed729

31 Oct, 2024 1 commit

[training] use the lr when using 8bit adam. (#9796) · 8ce37ab0

Sayak Paul authored Oct 31, 2024



* use the lr when using 8bit adam.

* remove lr as we pack it in params_to_optimize.

---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

8ce37ab0

28 Oct, 2024 2 commits

[flux dreambooth lora training] make LoRA target modules configurable + small bug fix (#9646) · 743a5697

Linoy Tsaban authored Oct 28, 2024

* make lora target modules configurable and change the default

* style

* make lora target modules configurable and change the default

* fix bug when using prodigy and training te

* fix mixed precision training as  proposed in https://github.com/huggingface/diffusers/pull/9565

 for full dreambooth as well

* add test and notes

* style

* address sayaks comments

* style

* fix test

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

743a5697

[Fix] remove setting lr for T5 text encoder when using prodigy in flux... · 493aa743

Biswaroop authored Oct 28, 2024


[Fix] remove setting lr for T5 text encoder when using prodigy in flux dreambooth lora script (#9473)

* fix: removed setting of text encoder lr for T5 as it's not being tuned

* fix: removed setting of text encoder lr for T5 as it's not being tuned

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

493aa743

25 Oct, 2024 1 commit
- [refactor] enhance readability of flux related pipelines (#9711) · 73b59f52
  Ina authored Oct 26, 2024
```
* flux pipline: readability enhancement.
```
  73b59f52
22 Oct, 2024 1 commit
- post-release 0.31.0 (#9742) · e45c25d0
  Sayak Paul authored Oct 22, 2024
```
* post-release

* style
```
  e45c25d0
15 Oct, 2024 1 commit
- Dreambooth lora flux bug 3dtensor to 2dtensor (#9653) · dccf39f0
  0x名無し authored Oct 15, 2024
```
* fixed issue #9350, Tensor is deprecated

* ran make style
```
  dccf39f0
28 Sep, 2024 1 commit
- [chore] fix: retain memory utility. (#9543) · 8e7d6c03
  Sayak Paul authored Sep 28, 2024
```
* fix: retain memory utility.

* fix

* quality

* free_memory.
```
  8e7d6c03
15 Sep, 2024 1 commit

[Flux Dreambooth lora] add latent caching (#9160) · 37e3603c

Linoy Tsaban authored Sep 15, 2024

* add ostris trainer to README & add cache latents of vae

* add ostris trainer to README & add cache latents of vae

* style

* readme

* add test for latent caching

* add ostris noise scheduler
https://github.com/ostris/ai-toolkit/blob/9ee1ef2a0a2a9a02b92d114a95f21312e5906e54/toolkit/samplers/custom_flowmatch_sampler.py#L95



* style

* fix import

* style

* fix tests

* style

* --change upcasting of transformer?

* update readme according to main

* keep only latent caching

* add configurable param for final saving of trained layers- --upcast_before_saving

* style

* Update examples/dreambooth/README_flux.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update examples/dreambooth/README_flux.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* use clear_objs_and_retain_memory from utilities

* style

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

37e3603c

14 Sep, 2024 1 commit

Fix the issue on sd3 dreambooth w./w.t. lora training (#9419) · e2ead7cd

Leo Jiang authored Sep 14, 2024



* Fix dtype error

* [bugfix] Fixed the issue on sd3 dreambooth training

* [bugfix] Fixed the issue on sd3 dreambooth training

---------
Co-authored-by: 蒋硕 <jiangshuo9@h-partners.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

e2ead7cd

11 Sep, 2024 1 commit
- [Tests] fix some fast gpu tests. (#9379) · adf1f911
  Sayak Paul authored Sep 11, 2024
```
fix some fast gpu tests.
```
  adf1f911
14 Aug, 2024 1 commit
- post release 0.30.0 (#9173) · 82058a54
  Álvaro Somoza authored Aug 14, 2024
```
* post release

* fix quality
```
  82058a54