Commits · 27637a54021e045d1eb3568a4ce76eaa39efcb26 · renzhc / diffusers_dcu

"vscode:/vscode.git/clone" did not exist on "cccde032f42f9351bc7b7cb4e36928c212f5c7ce"

01 Aug, 2024 1 commit

Sayak Paul authored Aug 02, 2024



add flux!
Signed-off-by: Adrien <adrien@huggingface.co>
Co-authored-by: Adrien <adrien.69740@gmail.com>
Co-authored-by: Anatoly Belikov <abelikov@singularitynet.io>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com>

27637a54

30 Jul, 2024 2 commits

Stable Audio integration (#8716) · 69e72b1d

Yoach Lacombe authored Jul 30, 2024



* WIP modeling code and pipeline

* add custom attention processor + custom activation + add to init

* correct ProjectionModel forward

* add stable audio to __initèè

* add autoencoder and update pipeline and modeling code

* add half Rope

* add partial rotary v2

* add temporary modfis to scheduler

* add EDM DPM Solver

* remove TODOs

* clean GLU

* remove att.group_norm to attn processor

* revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py

* refactor GLU -> SwiGLU

* remove redundant args

* add channel multiples in autoencoder docstrings

* changes in docsrtings and copyright headers

* clean pipeline

* further cleaning

* remove peft and lora and fromoriginalmodel

* Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace

* make style

* dummy models

* fix copied from

* add fast oobleck tests

* add brownian tree

* oobleck autoencoder slow tests

* remove TODO

* fast stable audio pipeline tests

* add slow tests

* make style

* add first version of docs

* wrap is_torchsde_available to the scheduler

* fix slow test

* test with input waveform

* add input waveform

* remove some todos

* create stableaudio gaussian projection + make style

* add pipeline to toctree

* fix copied from

* make quality

* refactor timestep_features->time_proj

* refactor joint_attention_kwargs->cross_attention_kwargs

* remove forward_chunk

* move StableAudioDitModel to transformers folder

* correct convert + remove partial rotary embed

* apply suggestions from yiyixuxu -> removing attn.kv_heads

* remove temb

* remove cross_attention_kwargs

* further removal of cross_attention_kwargs

* remove text encoder autocast to fp16

* continue removing autocast

* make style

* refactor how text and audio are embedded

* add paper

* update example code

* make style

* unify projection model forward + fix device placement

* make style

* remove fuse qkv

* apply suggestions from review

* Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* make style

* smaller models in fast tests

* pass sequential offloading fast tests

* add docs for vae and autoencoder

* make style and update example

* remove useless import

* add cosine scheduler

* dummy classes

* cosine scheduler docs

* better description of scheduler

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

69e72b1d

[LoRA] fix: animate diff lora stuff. (#8995) · 8c4856cd
Sayak Paul authored Jul 30, 2024
```
* fix: animate diff lora stuff.

* fix scaling function for UNetMotionModel

* emoty
```
8c4856cd

26 Jul, 2024 5 commits

[Kolors] Add IP Adapter (#8901) · 73acebb8

Álvaro Somoza authored Jul 26, 2024

* initial draft

* apply suggestions

* fix failing test

* added ipa to img2img

* add docs

* apply suggestions

73acebb8

remove unused code from pag attn procs (#8928) · ca0747a0
Aryan authored Jul 26, 2024

ca0747a0

[core] AnimateDiff SparseCtrl (#8897) · 5c53ca5e

Aryan authored Jul 26, 2024

* initial sparse control model draft

* remove unnecessary implementation

* copy animatediff pipeline

* remove deprecated callbacks

* update

* update pipeline implementation progress

* make style

* make fix-copies

* update progress

* add partially working pipeline

* remove debug prints

* add model docs

* dummy objects

* improve motion lora conversion script

* fix bugs

* update docstrings

* remove unnecessary model params; docs

* address review comment

* add copied from to zero_module

* copy animatediff test

* add fast tests

* update docs

* update

* update pipeline docs

* fix expected slice values

* fix license

* remove get_down_block usage

* remove temporal_double_self_attention from get_down_block

* update

* update docs with org and documentation images

* make from_unet work in sparsecontrolnetmodel

* add latest freeinit test from #8969

* make fix-copies

* LoraLoaderMixin -> StableDiffsuionLoraLoaderMixin

5c53ca5e

[Chore] add `LoraLoaderMixin` to the inits (#8981) · d87fe95f

Sayak Paul authored Jul 26, 2024



* introduce  to promote reusability.

* up

* add more tests

* up

* remove comments.

* fix fuse_nan test

* clarify the scope of fuse_lora and unfuse_lora

* remove space

* rewrite fuse_lora a bit.

* feedback

* copy over load_lora_into_text_encoder.

* address dhruv's feedback.

* fix-copies

* fix issubclass.

* num_fused_loras

* fix

* fix

* remove mapping

* up

* fix

* style

* fix-copies

* change to SD3TransformerLoRALoadersMixin

* Apply suggestions from code review
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* up

* handle wuerstchen

* up

* move lora to lora_pipeline.py

* up

* fix-copies

* fix documentation.

* comment set_adapters().

* fix-copies

* fix set_adapters() at the model level.

* fix?

* fix

* loraloadermixin.

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

d87fe95f

[Chore] remove all is from auraflow. (#8980) · 50e66f2f
Sayak Paul authored Jul 26, 2024
```
remove all is from auraflow.
```
50e66f2f

25 Jul, 2024 3 commits

Revert "[LoRA] introduce LoraBaseMixin to promote reusability." (#8976) · 62863bb1
YiYi Xu authored Jul 25, 2024
```
Revert "[LoRA] introduce LoraBaseMixin to promote reusability. (#8774)"

This reverts commit 527430d0.
```
62863bb1

[LoRA] introduce LoraBaseMixin to promote reusability. (#8774) · 527430d0

Sayak Paul authored Jul 25, 2024



* introduce  to promote reusability.

* up

* add more tests

* up

* remove comments.

* fix fuse_nan test

* clarify the scope of fuse_lora and unfuse_lora

* remove space

* rewrite fuse_lora a bit.

* feedback

* copy over load_lora_into_text_encoder.

* address dhruv's feedback.

* fix-copies

* fix issubclass.

* num_fused_loras

* fix

* fix

* remove mapping

* up

* fix

* style

* fix-copies

* change to SD3TransformerLoRALoadersMixin

* Apply suggestions from code review
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* up

* handle wuerstchen

* up

* move lora to lora_pipeline.py

* up

* fix-copies

* fix documentation.

* comment set_adapters().

* fix-copies

* fix set_adapters() at the model level.

* fix?

* fix

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

527430d0

[tests] speed up animatediff tests (#8846) · 3ae0ee88

Aryan authored Jul 25, 2024



* speed up animatediff tests

* fix pia test_ip_adapter_single

* fix tests/pipelines/pia/test_pia.py::PIAPipelineFastTests::test_dict_tuple_outputs_equivalent

* update

* fix ip adapter tests

* skip test_from_pipe_consistent_config tests

* fix prompt_embeds test

* update test_from_pipe_consistent_config tests

* fix expected_slice values

* remove temporal_norm_num_groups from UpBlockMotion

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

3ae0ee88

24 Jul, 2024 2 commits

remove residual i from auraflow. (#8949) · 41b705f4
Sayak Paul authored Jul 24, 2024
```
* remove residual i.

* rename to aura_flow in pipeline test
```
41b705f4

[Core] fix QKV fusion for attention (#8829) · 50d21f7c

Sayak Paul authored Jul 24, 2024

* start debugging the problem,

* start

* fix

* fix

* fix imports.

* handle hunyuan

* remove residuals.

* add a check for making sure there's appropriate procs.

* add more rigor to the tests.

* fix test

* remove redundant check

* fix-copies

* move check_qkv_fusion_matches_attn_procs_length and check_qkv_fusion_processors_exist.

50d21f7c

23 Jul, 2024 1 commit

Add attentionless VAE support (#8769) · 77c5de2e

Vishnu V Jaddipal authored Jul 23, 2024



* Add attentionless VAE support

* make style and quality, fix-copies

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

77c5de2e

20 Jul, 2024 2 commits
- [fix code annotation] Adjust the dimensions of the rotary positional embedding. (#8890) · 461efc57
  王奇勋 authored Jul 20, 2024
```
* 2d rotary pos emb dim

* make style

---------
Co-authored-by: haofanwang <haofanwang.ai@gmail.com>
```
  461efc57
- fix loop bug in SlicedAttnProcessor (#8836) · 3b04cdc8
  shinetzh authored Jul 20, 2024
```
* fix loop bug in SlicedAttnProcessor


---------
Co-authored-by: neoshang <neoshang@tencent.com>
```
  3b04cdc8
18 Jul, 2024 2 commits
- [Core] remove `resume_download` from Hub related stuff (#8648) · e02ec27e
  Sayak Paul authored Jul 18, 2024
```
* remove resume_download

* fix: _fetch_index_file call.

* remove resume_download from docs.
```
  e02ec27e
- [Chore] add disable forward chunking to SD3 transformer. (#8838) · a41e4c50
  Sayak Paul authored Jul 18, 2024
```
add disable forward chunking to SD3 transformer.
```
  a41e4c50
12 Jul, 2024 2 commits
- [Docs] add AuraFlow docs (#8851) · 973a62d4
  Sayak Paul authored Jul 12, 2024
```
* add pipeline documentation.

* add api spec for pipeline

* model documentation

* model spec
```
  973a62d4
- Add single file loading support for AnimateDiff (#8819) · 11d18f32
  Dhruv Nair authored Jul 12, 2024
```
* update

* update

* update

* update
```
  11d18f32
11 Jul, 2024 4 commits

Add VAE tiling option for SD3 (#8791) · d2df40c6
Dhruv Nair authored Jul 12, 2024
```
update
```
d2df40c6

[Core] Add AuraFlow (#8796) · 2261510b

Sayak Paul authored Jul 11, 2024



* add lavender flow transformer

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

2261510b

Latte: Latent Diffusion Transformer for Video Generation (#8404) · b8cf84a3

Xin Ma authored Jul 11, 2024



* add Latte to diffusers

* remove print

* remove print

* remove print

* remove unuse codes

* remove layer_norm_latte and add a flag

* remove layer_norm_latte and add a flag

* update latte_pipeline

* update latte_pipeline

* remove unuse squeeze

* add norm_hidden_states.ndim == 2: # for Latte

* fixed test latte pipeline bugs

* fixed test latte pipeline bugs

* delete sh

* add doc for latte

* add licensing

* Move Transformer3DModelOutput to modeling_outputs

* give a default value to sample_size

* remove the einops dependency

* change norm2 for latte

* modify pipeline of latte

* update test for Latte

* modify some codes for latte

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* modify for Latte pipeline

* video_length -> num_frames; update prepare_latents copied from

* make fix-copies

* make style

* typo: videe -> video

* update

* modify for Latte pipeline

* modify latte pipeline

* modify latte pipeline

* modify latte pipeline

* modify latte pipeline

* modify for Latte pipeline

* Delete .vscode directory

* make style

* make fix-copies

* add latte transformer 3d to docs _toctree.yml

* update example

* reduce frames for test

* fixed bug of _text_preprocessing

* set num frame to 1 for testing

* remove unuse print

* add text = self._clean_caption(text) again

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>

b8cf84a3

Reformat docstring for `get_timestep_embedding` (#8811) · 673eb60f

Alan Du authored Jul 10, 2024



* Reformat docstring for `get_timestep_embedding`


---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

673eb60f

08 Jul, 2024 2 commits

Remove unnecessary lines (#8569) · 57084dac

Tolga Cangöz authored Jul 08, 2024



* Remove unused line


---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

57084dac

[Alpha-VLLM Team] Add Lumina-T2X to diffusers (#8652) · 98388670

PommesPeter authored Jul 08, 2024




---------
Co-authored-by: zhuole1025 <zhuole1025@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

98388670

06 Jul, 2024 1 commit

fix loading sharded checkpoints from subfolder (#8798) · 9e9ed353

YiYi Xu authored Jul 06, 2024



* fix load sharded checkpoints from subfolder{

* style

* os.path.join

* add a small test

---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>

9e9ed353

04 Jul, 2024 1 commit
- [Tests] fix sharding tests (#8764) · 31adeb41
  Sayak Paul authored Jul 04, 2024
```
fix sharding tests
```
  31adeb41
03 Jul, 2024 4 commits

[Tencent Hunyuan Team] Add checkpoint conversion scripts and changed controlnet (#8783) · 6b6b4bcf

XCL authored Jul 04, 2024



* add conversion files; changed controlnet for hunyuandit

* style

---------
Co-authored-by: xingchaoliu <xingchaoliu@tencent.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com>

6b6b4bcf

[Chore] add dummy lora attention processors to prevent failures in other libs (#8777) · 06ee4db3
Sayak Paul authored Jul 03, 2024
```
add dummy lora attention processors to prevent failures in other libs
```
06ee4db3
Revert "[LoRA] introduce `LoraBaseMixin` to promote reusability." (#8773) · 984d3405
Sayak Paul authored Jul 03, 2024
```
Revert "[LoRA] introduce `LoraBaseMixin` to promote reusability. (#8670)"

This reverts commit a2071a18.
```
984d3405

[LoRA] introduce `LoraBaseMixin` to promote reusability. (#8670) · a2071a18

Sayak Paul authored Jul 03, 2024

* introduce  to promote reusability.

* up

* add more tests

* up

* remove comments.

* fix fuse_nan test

* clarify the scope of fuse_lora and unfuse_lora

* remove space

a2071a18

02 Jul, 2024 3 commits

correct `attention_head_dim` for `JointTransformerBlock` (#8608) · d9f71ab3

YiYi Xu authored Jul 02, 2024



* add

* update sd3 controlnet

* Update src/diffusers/models/controlnet_sd3.py

---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

d9f71ab3

Fix warning in UNetMotionModel (#8756) · c104482b

Dhruv Nair authored Jul 02, 2024



* update

* Update src/diffusers/models/unets/unet_motion_model.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

c104482b

[hunyuan-dit] refactor `HunyuanCombinedTimestepTextSizeStyleEmbedding` (#8761) · 8b1e3ec9
YiYi Xu authored Jul 01, 2024
```
up
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
8b1e3ec9

01 Jul, 2024 2 commits

Allow from_transformer in SD3ControlNetModel (#8749) · 0bae6e44
Haofan Wang authored Jul 02, 2024
```
* Update controlnet_sd3.py

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>
```
0bae6e44

[Tencent Hunyuan Team] Add HunyuanDiT-v1.2 Support (#8747) · a3904d7e

XCL authored Jul 01, 2024



* add v1.2 support

---------
Co-authored-by: xingchaoliu <xingchaoliu@tencent.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com>

a3904d7e

27 Jun, 2024 2 commits

[Chore] perform better deprecation for vqmodeloutput (#8719) · d5dd8df3
Sayak Paul authored Jun 27, 2024
```
perform better deprecation for vqmodeloutput
```
d5dd8df3

Motion Model / Adapter versatility (#8301) · 3e0d128d

Mathis Koroglu authored Jun 27, 2024

* Motion Model / Adapter versatility

- allow to use a different number of layers per block
- allow to use a different number of transformer per layers per block
- allow a different number of motion attention head per block
- use dropout argument in get_down/up_block in 3d blocks

* Motion Model added arguments renamed & refactoring

* Add test for asymmetric UNetMotionModel

3e0d128d

26 Jun, 2024 1 commit
- [Chore] remove deprecation from transformer2d regarding the output class. (#8698) · 10b4e354
  Sayak Paul authored Jun 26, 2024
```
* remove deprecation from transformer2d regarding the output class.

* up

* deprecate more
```
  10b4e354