Commits · 82188cef0487837b8c70fc3f36ea63c05c85f341 · renzhc / diffusers_dcu

15 Mar, 2025 1 commit

CogView4 Control Block (#10809) · 82188cef

Yuxuan Zhang authored Mar 16, 2025




* cogview4 control training


---------
Co-authored-by: OleehyO <leehy0357@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com>

82188cef

13 Mar, 2025 1 commit

Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline (#10827) · 5551506b

hlky authored Mar 13, 2025



* Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline


---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

5551506b

24 Feb, 2025 1 commit
- Add SD3 ControlNet to AutoPipeline (#10888) · aba4a579
  hlky authored Feb 24, 2025
```
Co-authored-by: puhuk <wetr235@gmail.com>
```
  aba4a579
15 Feb, 2025 1 commit

CogView4 (supports different length c and uc) (#10649) · d90cd362

Yuxuan Zhang authored Feb 16, 2025



* init

* encode with glm

* draft schedule

* feat(scheduler): Add CogView scheduler implementation

* feat(embeddings): add CogView 2D rotary positional embedding

* 1

* Update pipeline_cogview4.py

* fix the timestep init and sigma

* update latent

* draft patch(not work)

* fix

* [WIP][cogview4]: implement initial CogView4 pipeline

Implement the basic CogView4 pipeline structure with the following changes:
- Add CogView4 pipeline implementation
- Implement DDIM scheduler for CogView4
- Add CogView3Plus transformer architecture
- Update embedding models

Current limitations:
- CFG implementation uses padding for sequence length alignment
- Need to verify transformer inference alignment with Megatron

TODO:
- Consider separate forward passes for condition/uncondition
  instead of padding approach

* [WIP][cogview4][refactor]: Split condition/uncondition forward pass in CogView4 pipeline

Split the forward pass for conditional and unconditional predictions in the CogView4 pipeline to match the original implementation. The noise prediction is now done separately for each case before combining them for guidance. However, the results still need improvement.

This is a work in progress as the generated images are not yet matching expected quality.

* use with -2 hidden state

* remove text_projector

* 1

* [WIP] Add tensor-reload to align input from transformer block

* [WIP] for older glm

* use with cogview4 transformers forward twice of u and uc

* Update convert_cogview4_to_diffusers.py

* remove this

* use main example

* change back

* reset

* setback

* back

* back 4

* Fix qkv conversion logic for CogView4 to Diffusers format

* back5

* revert to sat to cogview4 version

* update a new convert from megatron

* [WIP][cogview4]: implement CogView4 attention processor

Add CogView4AttnProcessor class for implementing scaled dot-product attention
with rotary embeddings for the CogVideoX model. This processor concatenates
encoder and hidden states, applies QKV projections and RoPE, but does not
include spatial normalization.

TODO:
- Fix incorrect QKV projection weights
- Resolve ~25% error in RoPE implementation compared to Megatron

* [cogview4] implement CogView4 transformer block

Implement CogView4 transformer block following the Megatron architecture:
- Add multi-modulate and multi-gate mechanisms for adaptive layer normalization
- Implement dual-stream attention with encoder-decoder structure
- Add feed-forward network with GELU activation
- Support rotary position embeddings for image tokens

The implementation follows the original CogView4 architecture while adapting
it to work within the diffusers framework.

* with new attn

* [bugfix] fix dimension mismatch in CogView4 attention

* [cogview4][WIP]: update final normalization in CogView4 transformer

Refactored the final normalization layer in CogView4 transformer to use separate layernorm and AdaLN operations instead of combined AdaLayerNormContinuous. This matches the original implementation but needs validation.

Needs verification against reference implementation.

* 1

* put back

* Update transformer_cogview4.py

* change time_shift

* Update pipeline_cogview4.py

* change timesteps

* fix

* change text_encoder_id

* [cogview4][rope] align RoPE implementation with Megatron

- Implement apply_rope method in attention processor to match Megatron's implementation
- Update position embeddings to ensure compatibility with Megatron-style rotary embeddings
- Ensure consistent rotary position encoding across attention layers

This change improves compatibility with Megatron-based models and provides
better alignment with the original implementation's positional encoding approach.

* [cogview4][bugfix] apply silu activation to time embeddings in CogView4

Applied silu activation to time embeddings before splitting into conditional
and unconditional parts in CogView4Transformer2DModel. This matches the
original implementation and helps ensure correct time conditioning behavior.

* [cogview4][chore] clean up pipeline code

- Remove commented out code and debug statements
- Remove unused retrieve_timesteps function
- Clean up code formatting and documentation

This commit focuses on code cleanup in the CogView4 pipeline implementation, removing unnecessary commented code and improving readability without changing functionality.

* [cogview4][scheduler] Implement CogView4 scheduler and pipeline

* now It work

* add timestep

* batch

* change convert scipt

* refactor pt. 1; make style

* refactor pt. 2

* refactor pt. 3

* add tests

* make fix-copies

* update toctree.yml

* use flow match scheduler instead of custom

* remove scheduling_cogview.py

* add tiktoken to test dependencies

* Update src/diffusers/models/embeddings.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* apply suggestions from review

* use diffusers apply_rotary_emb

* update flow match scheduler to accept timesteps

* fix comment

* apply review sugestions

* Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

---------
Co-authored-by: 三洋三洋 <1258009915@qq.com>
Co-authored-by: OleehyO <leehy0357@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

d90cd362

11 Feb, 2025 1 commit

Add support for lumina2 (#10642) · 81440fd4

Le Zhuo authored Feb 12, 2025



* Add support for lumina2


---------
Co-authored-by: csuhan <hanjiaming@whu.edu.cn>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: hlky <hlky@hlky.ac>

81440fd4

21 Jan, 2025 1 commit
- [chore] change licensing to 2025 from 2024. (#10615) · 4ace7d04
  Sayak Paul authored Jan 21, 2025
```
change licensing to 2025 from 2024.
```
  4ace7d04
13 Jan, 2025 1 commit
- [Sana] add Sana to auto-text2image-pipeline; (#10538) · ae019da9
  Junsong Chen authored Jan 14, 2025
```
add Sana to auto-text2image-pipeline;
```
  ae019da9
06 Jan, 2025 1 commit

Regarding the RunwayML path for V1.5 did change to... · 4f5e3e35

Ameer Azam authored Jan 07, 2025

Regarding the RunwayML path for V1.5 did change to stable-diffusion-v1-5/[stable-diffusion-v1-5/ stable-diffusion-inpainting] (#10476)

* Update pipeline_controlnet.py

* Update pipeline_controlnet_img2img.py

runwayml Take-down so change all from to this
stable-diffusion-v1-5/stable-diffusion-v1-5

* Update pipeline_controlnet_inpaint.py

* runwayml take-down make change to sd-legacy

* runwayml take-down make change to sd-legacy

* runwayml take-down make change to sd-legacy

* runwayml take-down make change to sd-legacy

* Update convert_blipdiffusion_to_diffusers.py

style change

4f5e3e35

02 Jan, 2025 1 commit

Fix AutoPipeline `from_pipe` where source pipeline is missing target... · c28db0aa

hlky authored Jan 02, 2025


Fix AutoPipeline `from_pipe` where source pipeline is missing target pipeline's optional components (#10400)

* Optional components in AutoPipeline

* missing_modules

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

c28db0aa

19 Dec, 2024 1 commit
- Add Flux Control to AutoPipeline (#10292) · 4450d26b
  hlky authored Dec 19, 2024
  
  4450d26b
16 Dec, 2024 1 commit
- Add ControlNetUnion to AutoPipeline from_pretrained (#10219) · 5ed761a6
  hlky authored Dec 16, 2024
  
  5ed761a6
10 Dec, 2024 1 commit

Add PAG Support for Stable Diffusion Inpaint Pipeline (#9386) · 65b98b5d

Darshil Jariwala authored Dec 11, 2024



* using sd inpaint pipeline and sdxl pag inpaint pipeline to add changes

* using sd inpaint pipeline and sdxl pag inpaint pipeline to add changes

* finished the call function

* added auto pipeline

* merging diffusers

* ready to test

* ready to test

* added copied from and removed unnecessary tests

* make style changes

* doc changes

* updating example doc string

* style fix

* init

* adding imports

* quality

* Update src/diffusers/pipelines/pag/pipeline_pag_sd_inpaint.py

* make

* Update tests/pipelines/pag/test_pag_sd_inpaint.py

* slice and size

* slice

---------
Co-authored-by: Darshil Jariwala <darshiljariwala@Darshils-MacBook-Air.local>
Co-authored-by: Darshil Jariwala <jariwala.darshil2002@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: hlky <hlky@hlky.ac>

65b98b5d

03 Dec, 2024 1 commit

Add StableDiffusion3PAGImg2Img Pipeline + Fix SD3 Unconditional PAG (#9932) · 63b631f3

Benjamin Paine authored Dec 03, 2024



* fix progress bar updates in SD 1.5 PAG Img2Img pipeline



---------
Co-authored-by: Vinh H. Pham <phamvinh257@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

63b631f3

14 Oct, 2024 1 commit

CogView3Plus DiT (#9570) · 8d81564b

Yuxuan.Zhang authored Oct 14, 2024

* merge 9588

* max_shard_size="5GB" for colab running

* conversion script updates; modeling test; refactor transformer

* make fix-copies

* Update convert_cogview3_to_diffusers.py

* initial pipeline draft

* make style

* fight bugs 🐛

🪳

* add example

* add tests; refactor

* make style

* make fix-copies

* add co-author

YiYi Xu <yixu310@gmail.com>

* remove files

* add docs

* add co-author
Co-Authored-By: YiYi Xu <yixu310@gmail.com>

* fight docs

* address reviews

* make style

* make model work

* remove qkv fusion

* remove qkv fusion tets

* address review comments

* fix make fix-copies error

* remove None and TODO

* for FP16(draft)

* make style

* remove dynamic cfg

* remove pooled_projection_dim as a parameter

* fix tests

---------
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

8d81564b

09 Oct, 2024 1 commit

add PAG support for SD Img2Img (#9463) · af28ae2d

SahilCarterr authored Oct 10, 2024



* added pag to sd img2img pipeline


---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

af28ae2d

01 Oct, 2024 1 commit
- Add PAG support to StableDiffusionControlNetPAGInpaintPipeline (#8875) · 33fafe3d
  JuanCarlosPi authored Oct 01, 2024
```
* Add pag to controlnet inpainting pipeline


---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>
```
  33fafe3d
20 Sep, 2024 1 commit
- Several fixes to Flux ControlNet pipelines (#9472) · 14a1b86f
  Vladimir Mandic authored Sep 19, 2024
```
* fix flux controlnet pipelines

---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>
```
  14a1b86f
06 Sep, 2024 1 commit
- add flux inpaint + img2img + controlnet to auto pipeline (#9367) · 8cdcdd9e
  YiYi Xu authored Sep 06, 2024
  
  8cdcdd9e
21 Aug, 2024 1 commit

Add StableDiffusionXLControlNetPAGImg2ImgPipeline (#8990) · 9003d75f

satani99 authored Aug 21, 2024



* Added pad controlnet sdxl img2img pipeline

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

9003d75f

20 Aug, 2024 1 commit
- Fix StableDiffusionXLPAGInpaintPipeline (#9128) · 16a3dad4
  Sangwon Lee authored Aug 21, 2024
  
  16a3dad4
19 Aug, 2024 1 commit
- fix autopipeline for kolors img2img (#9212) · 67f5cce2
  YiYi Xu authored Aug 19, 2024
```
fix
```
  67f5cce2
17 Aug, 2024 1 commit
- Add Lumina T2I Auto Pipe Mapping (#8962) · b3825500
  Beinsezii authored Aug 17, 2024
  
  b3825500
07 Aug, 2024 1 commit

[Kolors] Add PAG (#8934) · 39e1f7ea

Álvaro Somoza authored Aug 06, 2024



* txt2img pag added

* autopipe added, fixed case

* style

* apply suggestions

* added fast tests, added todo tests

* revert dummy objects for kolors

* fix pag dummies

* fix test imports

* update pag tests

* add kolor pag to docs

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

39e1f7ea

06 Aug, 2024 1 commit

add PAG support for Stable Diffusion 3 (#8861) · 926daa30

Ahn Donghoon (안동훈 / suno) authored Aug 07, 2024



add pag sd3


---------
Co-authored-by: HyoungwonCho <jhw9811@korea.ac.kr>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: crepejung00 <jaewoojung00@naver.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>

926daa30

05 Aug, 2024 2 commits

add sentencepiece as a soft dependency (#9065) · bc3c73ad

YiYi Xu authored Aug 05, 2024



* add sentencepiece as  soft dependency for kolors

* up

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

bc3c73ad

PAG variant for HunyuanDiT, PAG refactor (#8936) · b7058d14

Aryan authored Aug 05, 2024



* copy hunyuandit pipeline

* pag variant of hunyuan dit

* add tests

* update docs

* make style

* make fix-copies

* Update src/diffusers/pipelines/pag/pag_utils.py

* remove incorrect copied from

* remove pag hunyuan attn procs to resolve conflicts

* add pag attn procs again

* new implementation for pag_utils

* revert pag changes

* add pag refactor back; update pixart sigma

* update pixart pag tests

* apply suggestions from review

Co-Authored-By: yixu310@gmail.com

* make style

* update docs, fix tests

* fix tests

* fix test_components_function since list not accepted as valid __init__ param

* apply patch to fix broken tests
Co-Authored-By: Sayak Paul <spsayakpaul@gmail.com>

* make style

* fix hunyuan tests

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

b7058d14

02 Aug, 2024 1 commit

[Core] Add PAG support for PixArtSigma (#8921) · 7b98c4cc

Sayak Paul authored Aug 02, 2024

* feat: add pixart sigma pag.

* inits.

* fixes

* fix

* remove print.

* copy paste methods to the pixart pag mixin

* fix-copies

* add documentation.

* add tests.

* remove correction file.

* remove pag_applied_layers

* empty

7b98c4cc

01 Aug, 2024 1 commit

Flux pipeline (#9043) · 27637a54

Sayak Paul authored Aug 02, 2024



add flux!
Signed-off-by: Adrien <adrien@huggingface.co>
Co-authored-by: Adrien <adrien.69740@gmail.com>
Co-authored-by: Anatoly Belikov <abelikov@singularitynet.io>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com>

27637a54

18 Jul, 2024 1 commit
- [Core] remove `resume_download` from Hub related stuff (#8648) · e02ec27e
  Sayak Paul authored Jul 18, 2024
```
* remove resume_download

* fix: _fetch_index_file call.

* remove resume_download from docs.
```
  e02ec27e
17 Jul, 2024 1 commit

Add AuraFlowPipeline and KolorsPipeline to auto map (#8849) · e15a8e7f

Beinsezii authored Jul 16, 2024



* Add AuraFlowPipeline and KolorsPipeline to auto map

Just T2I. Validated using `quickdif`

* Add Kolors I2I and SD3 Inpaint auto maps

* style

---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>

e15a8e7f

12 Jul, 2024 1 commit

add PAG support sd15 controlnet (#8820) · d704b3bf

Nguyễn Công Tú Anh authored Jul 12, 2024



* add pag support sd15 controlnet

* fix quality import

* remove unecessary import

* remove if state

* fix tests

* remove useless function

* add sd1.5 controlnet pag docs

---------
Co-authored-by: anhnct8 <anhnct8@fpt.com>

d704b3bf

29 Jun, 2024 1 commit
- add PAG support for SD architecture (#8725) · 8690e8b9
  Shauray Singh authored Jun 30, 2024
```
* add pag to sd pipelines
```
  8690e8b9
25 Jun, 2024 1 commit

add PAG support (#7944) · 540399f5

YiYi Xu authored Jun 25, 2024



* first draft


---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: Junhwa Song <ethan9867@gmail.com>
Co-authored-by: Ahn Donghoon (안동훈 / suno) <suno.vivid@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

540399f5

13 Jun, 2024 1 commit
- Add Hunyuan AutoPipe mapping (#8505) · 7f51f286
  Beinsezii authored Jun 12, 2024
  
  7f51f286
12 Jun, 2024 1 commit
- Add SD3 AutoPipeline mappings (#8489) · 24bdf4b2
  Beinsezii authored Jun 12, 2024
  
  24bdf4b2
03 May, 2024 1 commit

Respect `resume_download` deprecation (#7843) · 6a479588

Lucain authored May 03, 2024



* Deprecate resume_download

* align docstring with transformers

* style

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

6a479588

26 Apr, 2024 2 commits
- Add PixArtSigmaPipeline to AutoPipeline mapping (#7783) · 0d2d424f
  Beinsezii authored Apr 26, 2024
  
  0d2d424f
- [docs] Fix AutoPipeline docstring (#7779) · e24e54fd
  Steven Liu authored Apr 26, 2024
```
fix
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  e24e54fd
18 Mar, 2024 1 commit

Add Cascade to Auto T2I + Decoder mappings (#7362) · ad0308b3

Beinsezii authored Mar 18, 2024



* Add Cascade to Auto T2I + Decoder mappings

* ruff autofix

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

ad0308b3

04 Mar, 2024 1 commit

Fix typos (#7181) · f4977abc

M. Tolga Cangöz authored Mar 04, 2024

* Fix typos

* Fix typos

* Fix typos and update documentation in lora.md

f4977abc