Commits · a4df8dbc40e170ff828f8d8f79c2c861c9f1748d · renzhc / diffusers_dcu

19 Jun, 2025 1 commit
- Update more licenses to 2025 (#11746) · a4df8dbc
  Aryan authored Jun 19, 2025
```
update
```
  a4df8dbc
19 May, 2025 1 commit

Quentin Gallouédec authored May 19, 2025



* Use HF Papers

* Apply style fixes

---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

c8bb1ff5

01 May, 2025 1 commit

Fix typos in docs and comments (#11416) · 86294d3c

co63oc authored May 01, 2025



* Fix typos in docs and comments

* Apply style fixes

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

86294d3c

24 Apr, 2025 1 commit
- Fix typos in strings and comments (#11407) · f00a9957
  co63oc authored Apr 25, 2025
  
  f00a9957
09 Apr, 2025 1 commit
- Update Ruff to latest Version (#10919) · edc154da
  Dhruv Nair authored Apr 09, 2025
```
* update

* update

* update

* update
```
  edc154da
22 Feb, 2025 1 commit

Comprehensive type checking for `from_pretrained` kwargs (#10758) · 9c7e2051

Daniel Regado authored Feb 22, 2025



* More robust from_pretrained init_kwargs type checking

* Corrected for Python 3.10

* Type checks subclasses and fixed type warnings

* More type corrections and skip tokenizer type checking

* make style && make quality

* Updated docs and types for Lumina pipelines

* Fixed check for empty signature

* changed location of helper functions

* make style

---------
Co-authored-by: hlky <hlky@hlky.ac>

9c7e2051

20 Feb, 2025 1 commit

[tests] test `encode_prompt()` in isolation (#10438) · b2ca39c8

Sayak Paul authored Feb 20, 2025

* poc encode_prompt() tests

* fix

* updates.

* fixes

* fixes

* updates

* updates

* updates

* revert

* updates

* updates

* updates

* updates

* remove SDXLOptionalComponentsTesterMixin.

* remove tests that directly leveraged encode_prompt() in some way or the other.

* fix imports.

* remove _save_load

* fixes

* fixes

* fixes

* fixes

b2ca39c8

20 Jan, 2025 2 commits

bugfix for npu not support float64 (#10123) · 75a636da

baymax591 authored Jan 21, 2025



* bugfix for npu not support float64

* is_mps is_npu

---------
Co-authored-by: 白超 <baichao19@huawei.com>
Co-authored-by: hlky <hlky@hlky.ac>

75a636da

chore: remove redundant words (#10609) · 4842f5d8
sunxunle authored Jan 21, 2025
```
Signed-off-by: sunxunle <sunxunle@ampere.tech>
```
4842f5d8

14 Jan, 2025 1 commit

[Sana-4K] (#10537) · 3d707773

Junsong Chen authored Jan 15, 2025



* [Sana 4K]
add 4K support for Sana

* [Sana-4K] fix SanaPAGPipeline

* add VAE automatically tiling function;

* set clean_caption to False;

* add warnings for VAE OOM.

* style

---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>

3d707773

12 Jan, 2025 1 commit

[Docs] Add negative prompt docs to FluxPipeline (#10531) · 0785dba4

Sayak Paul authored Jan 12, 2025



* add negative_prompt documentation.

* add proper docs for negative prompts

* fix-copies

* remove comment.

* Apply suggestions from code review
Co-authored-by: hlky <hlky@hlky.ac>

* fix-copies

---------
Co-authored-by: hlky <hlky@hlky.ac>

0785dba4

11 Jan, 2025 1 commit

[DC-AE] support tiling for DC-AE (#10510) · e7db062e

Junyu Chen authored Jan 11, 2025



* autoencoder_dc tiling

* add tiling and slicing support in SANA pipelines

* create variables for padding length because the line becomes too long

* add tiling and slicing support in pag SANA pipelines

* revert changes to tile size

* make style

* add vae tiling test

---------
Co-authored-by: Aryan <aryan@huggingface.co>

e7db062e

10 Jan, 2025 1 commit

Use Pipelines without unet (#10440) · 12fbe3f7

hlky authored Jan 10, 2025



* Use Pipelines without unet

* unet.config.in_channels

* default_sample_size

* is_unet_version_less_0_9_0

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

12fbe3f7

08 Jan, 2025 2 commits

PyTorch/XLA support (#10498) · 95c5ce4e
hlky authored Jan 08, 2025
```
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
95c5ce4e

fix for #7365, prevent pipelines from overriding provided prompt embeds (#7926) · a0acbdc9

Bagheera authored Jan 08, 2025



* fix for #7365, prevent pipelines from overriding provided prompt embeds

* fix-copies

* fix implementation

* update

---------
Co-authored-by: bghira <bghira@users.github.com>
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>

a0acbdc9

07 Jan, 2025 2 commits

Use pipelines without vae (#10441) · ee7e141d

hlky authored Jan 07, 2025



* Use pipelines without vae

* getattr

* vqvae

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

ee7e141d

Use Pipelines without scheduler (#10439) · 628f2c54
hlky authored Jan 07, 2025
```
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
628f2c54

23 Dec, 2024 1 commit

[Sana bug] bug fix for 2K model config (#10340) · b58868e6

Junsong Chen authored Dec 23, 2024



* fix the Positinoal Embedding bug in 2K model;

* Change the default model to the BF16 one for more stable training and output

* make style

* substract buffer size

* add compute_module_persistent_sizes

---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>

b58868e6

18 Dec, 2024 2 commits

Use `torch` in `get_2d_rotary_pos_embed` (#10155) · 0ac52d6f
hlky authored Dec 18, 2024
```
* Use `torch` in `get_2d_rotary_pos_embed`

* Add deprecation
```
0ac52d6f

[LoRA] feat: lora support for SANA. (#10234) · 9408aa2d

Sayak Paul authored Dec 18, 2024



* feat: lora support for SANA.

* make fix-copies

* rename test class.

* attention_kwargs -> cross_attention_kwargs.

* Revert "attention_kwargs -> cross_attention_kwargs."

This reverts commit 23433bf9bccc12e0f2f55df26bae58a894e8b43b.

* exhaust 119 max line limit

* sana lora fine-tuning script.

* readme

* add a note about the supported models.

* Apply suggestions from code review
Co-authored-by: Aryan <aryan@huggingface.co>

* style

* docs for attention_kwargs.

* remove lora_scale from pag pipeline.

* copy fix

---------
Co-authored-by: Aryan <aryan@huggingface.co>

9408aa2d

16 Dec, 2024 1 commit
- Use `t` instead of `timestep` in `_apply_perturbed_attention_guidance` (#10243) · 672bd495
  hlky authored Dec 16, 2024
  
  672bd495
15 Dec, 2024 1 commit

[Sana] Add Sana, including `SanaPipeline`, `SanaPAGPipeline`,... · 5a196e3d

Junsong Chen authored Dec 16, 2024


[Sana] Add Sana, including `SanaPipeline`, `SanaPAGPipeline`, `LinearAttentionProcessor`, `Flow-based DPM-sovler` and so on. (#9982)

* first add a script for DC-AE;

* DC-AE init

* replace triton with custom implementation

* 1. rename file and remove un-used codes;

* no longer rely on omegaconf and dataclass

* replace custom activation with diffuers activation

* remove dc_ae attention in attention_processor.py

* iinherit from ModelMixin

* inherit from ConfigMixin

* dc-ae reduce to one file

* update downsample and upsample

* clean code

* support DecoderOutput

* remove get_same_padding and val2tuple

* remove autocast and some assert

* update ResBlock

* remove contents within super().__init__

* Update src/diffusers/models/autoencoders/dc_ae.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* remove opsequential

* update other blocks to support the removal of build_norm

* remove build encoder/decoder project in/out

* remove inheritance of RMSNorm2d from LayerNorm

* remove reset_parameters for RMSNorm2d
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* remove device and dtype in RMSNorm2d __init__
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/dc_ae.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/dc_ae.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/dc_ae.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* remove op_list & build_block

* remove build_stage_main

* change file name to autoencoder_dc

* move LiteMLA to attention.py

* align with other vae decode output;

* add DC-AE into init files;

* update

* make quality && make style;

* quick push before dgx disappears again

* update

* make style

* update

* update

* fix

* refactor

* refactor

* refactor

* update

* possibly change to nn.Linear

* refactor

* make fix-copies

* replace vae with ae

* replace get_block_from_block_type to get_block

* replace downsample_block_type from Conv to conv for consistency

* add scaling factors

* incorporate changes for all checkpoints

* make style

* move mla to attention processor file; split qkv conv to linears

* refactor

* add tests

* from original file loader

* add docs

* add standard autoencoder methods

* combine attention processor

* fix tests

* update

* minor fix

* minor fix

* minor fix & in/out shortcut rename

* minor fix

* make style

* fix paper link

* update docs

* update single file loading

* make style

* remove single file loading support; todo for DN6

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* add abstract

* 1. add DCAE into diffusers;
2. make style and make quality;

* add DCAE_HF into diffusers;

* bug fixed;

* add SanaPipeline, SanaTransformer2D into diffusers;

* add sanaLinearAttnProcessor2_0;

* first update for SanaTransformer;

* first update for SanaPipeline;

* first success run SanaPipeline;

* model output finally match with original model with the same intput;

* code update;

* code update;

* add a flow dpm-solver scripts

* 🎉[important update]
1. Integrate flow-dpm-sovler into diffusers;
2. finally run successfully on both `FlowMatchEulerDiscreteScheduler` and `FlowDPMSolverMultistepScheduler`;

* 🎉🔧

[important update & fix huge bugs!!]
1. add SanaPAGPipeline & several related Sana linear attention operators;
2. `SanaTransformer2DModel` not supports multi-resolution input;
2. fix the multi-scale HW bugs in SanaPipeline and SanaPAGPipeline;
3. fix the flow-dpm-solver set_timestep() init `model_output` and `lower_order_nums` bugs;

* remove prints;

* add convert sana official checkpoint to diffusers format Safetensor.

* Update src/diffusers/models/transformers/sana_transformer_2d.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update src/diffusers/models/transformers/sana_transformer_2d.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update src/diffusers/models/transformers/sana_transformer_2d.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update src/diffusers/pipelines/pag/pipeline_pag_sana.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update src/diffusers/models/transformers/sana_transformer_2d.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update src/diffusers/models/transformers/sana_transformer_2d.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update src/diffusers/pipelines/sana/pipeline_sana.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update src/diffusers/pipelines/sana/pipeline_sana.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* update Sana for DC-AE's recent commit;

* make style && make quality

* Add StableDiffusion3PAGImg2Img Pipeline + Fix SD3 Unconditional PAG (#9932)

* fix progress bar updates in SD 1.5 PAG Img2Img pipeline

---------
Co-authored-by: Vinh H. Pham <phamvinh257@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* make the vae can be None in `__init__` of `SanaPipeline`

* Update src/diffusers/models/transformers/sana_transformer_2d.py
Co-authored-by: hlky <hlky@hlky.ac>

* change the ae related code due to the latest update of DCAE branch;

* change the ae related code due to the latest update of DCAE branch;

* 1. change code based on AutoencoderDC;
2. fix the bug of new GLUMBConv;
3. run success;

* update for solving conversation.

* 1. fix bugs and run convert script success;
2. Downloading ckpt from hub automatically;

* make style && make quality;

* 1. remove un-unsed parameters in init;
2. code update;

* remove test file

* refactor; add docs; add tests; update conversion script

* make style

* make fix-copies

* refactor

* udpate pipelines

* pag tests and refactor

* remove sana pag conversion script

* handle weight casting in conversion script

* update conversion script

* add a processor

* 1. add bf16 pth file path;
2. add complex human instruct in pipeline;

* fix fast \tests

* change gemma-2-2b-it ckpt to a non-gated repo;

* fix the pth path bug in conversion script;

* change grad ckpt to original; make style

* fix the complex_human_instruct bug and typo;

* remove dpmsolver flow scheduler

* apply review suggestions

* change the `FlowMatchEulerDiscreteScheduler` to default `DPMSolverMultistepScheduler` with flow matching scheduler.

* fix the tokenizer.padding_side='right' bug;

* update docs

* make fix-copies

* fix imports

* fix docs

* add integration test

* update docs

* update examples

* fix convert_model_output in schedulers

* fix failing tests

---------
Co-authored-by: Junyu Chen <chenjydl2003@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: chenjy2003 <70215701+chenjy2003@users.noreply.github.com>
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: hlky <hlky@hlky.ac>

5a196e3d

12 Dec, 2024 1 commit

update StableDiffusion3Img2ImgPipeline.add image size validation (#10166) · bdbaea8f

Bios authored Dec 13, 2024



* update StableDiffusion3Img2ImgPipeline.add image size validation

---------
Co-authored-by: hlky <hlky@hlky.ac>

bdbaea8f

10 Dec, 2024 1 commit

Add PAG Support for Stable Diffusion Inpaint Pipeline (#9386) · 65b98b5d

Darshil Jariwala authored Dec 11, 2024



* using sd inpaint pipeline and sdxl pag inpaint pipeline to add changes

* using sd inpaint pipeline and sdxl pag inpaint pipeline to add changes

* finished the call function

* added auto pipeline

* merging diffusers

* ready to test

* ready to test

* added copied from and removed unnecessary tests

* make style changes

* doc changes

* updating example doc string

* style fix

* init

* adding imports

* quality

* Update src/diffusers/pipelines/pag/pipeline_pag_sd_inpaint.py

* make

* Update tests/pipelines/pag/test_pag_sd_inpaint.py

* slice and size

* slice

---------
Co-authored-by: Darshil Jariwala <darshiljariwala@Darshils-MacBook-Air.local>
Co-authored-by: Darshil Jariwala <jariwala.darshil2002@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: hlky <hlky@hlky.ac>

65b98b5d

04 Dec, 2024 1 commit
- Add `sigmas` to pipelines using FlowMatch (#10116) · a2d424eb
  hlky authored Dec 04, 2024
  
  a2d424eb
03 Dec, 2024 2 commits

Fix multi-prompt inference (#10103) · 6a51427b

hlky authored Dec 03, 2024



* Fix multi-prompt inference

Fix generation of multiple images with multiple prompts, e.g len(prompts)>1, num_images_per_prompt>1

* make

* fix copies

---------
Co-authored-by: Nikita Balabin <nikita@mxl.ru>

6a51427b

Add StableDiffusion3PAGImg2Img Pipeline + Fix SD3 Unconditional PAG (#9932) · 63b631f3

Benjamin Paine authored Dec 03, 2024



* fix progress bar updates in SD 1.5 PAG Img2Img pipeline



---------
Co-authored-by: Vinh H. Pham <phamvinh257@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

63b631f3

20 Nov, 2024 1 commit
- fix controlnet module refactor (#9968) · e564abe2
  YiYi Xu authored Nov 20, 2024
```
* fix
```
  e564abe2
14 Nov, 2024 1 commit
- Fix Progress Bar Updates in SD 1.5 PAG Img2Img pipeline (#9925) · d74483c4
  Benjamin Paine authored Nov 14, 2024
```
fix progress bar updates in SD 1.5 PAG Img2Img pipeline
```
  d74483c4
21 Oct, 2024 1 commit

[docs] add docstrings in `pipline_stable_diffusion.py` (#9590) · bcd61fd3

timdalxx authored Oct 22, 2024



* fix the issue on flux dreambooth lora training

* update : origin main code

* docs: update pipeline_stable_diffusion docstring

* docs: update pipeline_stable_diffusion docstring

* Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* fix: style

* fix: style

* fix: copies

* make fix-copies

* remove extra newline

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

bcd61fd3

09 Oct, 2024 1 commit

add PAG support for SD Img2Img (#9463) · af28ae2d

SahilCarterr authored Oct 10, 2024



* added pag to sd img2img pipeline


---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

af28ae2d

08 Oct, 2024 1 commit

Fixed noise_pred_text referenced before assignment. (#9537) · 86bd991e

v2ray authored Oct 09, 2024

* Fixed local variable noise_pred_text referenced before assignment when using PAG with guidance scale and guidance rescale at the same time.

* Fixed style.

* Made returning text pred noise an argument.

86bd991e

03 Oct, 2024 1 commit
- [sd3] make sure height and size are divisible by `16` (#9573) · 99f60821
  YiYi Xu authored Oct 03, 2024
```
* check size

* up
```
  99f60821
01 Oct, 2024 1 commit
- Add PAG support to StableDiffusionControlNetPAGInpaintPipeline (#8875) · 33fafe3d
  JuanCarlosPi authored Oct 01, 2024
```
* Add pag to controlnet inpainting pipeline


---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>
```
  33fafe3d
09 Sep, 2024 1 commit
- refactor `get_timesteps` for SDXL img2img + add set_begin_index (#9375) · 485b8bb0
  YiYi Xu authored Sep 09, 2024
```
* refator + add begin_index

* add kolors img2img to doc
```
  485b8bb0
28 Aug, 2024 1 commit

AnimateDiff prompt travel (#9231) · cbc2ec8f

Aryan authored Aug 28, 2024

* update

* implement prompt interpolation

* make style

* resnet memory optimizations

* more memory optimizations; todo: refactor

* update

* update animatediff controlnet with latest changes

* refactor chunked inference changes

* remove print statements

* undo memory optimization changes

* update docstrings

* fix tests

* fix pia tests

* apply suggestions from review

* add tests

* update comment

cbc2ec8f

21 Aug, 2024 1 commit

Add StableDiffusionXLControlNetPAGImg2ImgPipeline (#8990) · 9003d75f

satani99 authored Aug 21, 2024



* Added pad controlnet sdxl img2img pipeline

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

9003d75f

20 Aug, 2024 1 commit
- Fix StableDiffusionXLPAGInpaintPipeline (#9128) · 16a3dad4
  Sangwon Lee authored Aug 21, 2024
  
  16a3dad4
07 Aug, 2024 2 commits

Freenoise change `vae_batch_size` to `decode_chunk_size` (#9110) · e3568d14
Dhruv Nair authored Aug 07, 2024
```
* update

* update
```
e3568d14

[core] FreeNoise (#8948) · 16a93f1a

Aryan authored Aug 07, 2024



* initial work draft for freenoise; needs massive cleanup

* fix freeinit bug

* add animatediff controlnet implementation

* revert attention changes

* add freenoise

* remove old helper functions

* add decode batch size param to all pipelines

* make style

* fix copied from comments

* make fix-copies

* make style

* copy animatediff controlnet implementation from #8972

* add experimental support for num_frames not perfectly fitting context length, ocntext stride

* make unet motion model lora work again based on #8995

* copy load video utils from #8972

* copied from AnimateDiff::prepare_latents

* address the case where last batch of frames does not match length of indices in prepare latents

* decode_batch_size->vae_batch_size; batch vae encode support in animatediff vid2vid

* revert sparsectrl and sdxl freenoise changes

* revert pia

* add freenoise tests

* make fix-copies

* improve docstrings

* add freenoise tests to animatediff controlnet

* update tests

* Update src/diffusers/models/unets/unet_motion_model.py

* add freenoise to animatediff pag

* address review comments

* make style

* update tests

* make fix-copies

* fix error message

* remove copied from comment

* fix imports in tests

* update

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

16a93f1a