Commits · 7d0b9c4d4ee4ef08908ccc77ee91104d5498feb3 · renzhc / diffusers_dcu

19 Nov, 2024 1 commit
- [LoRA] feat: `save_lora_adapter()` (#9862) · 7d0b9c4d
  Sayak Paul authored Nov 19, 2024
```
* feat: save_lora_adapter.
```
  7d0b9c4d
18 Nov, 2024 1 commit

Yuxuan.Zhang authored Nov 19, 2024



* CogVideoX1_1PatchEmbed test

* 1360 * 768

* refactor

* make style

* update docs

* add modeling tests for cogvideox 1.5

* update

* make fix-copies

* add ofs embed(for convert)

* add ofs embed(for convert)

* more resolution for cogvideox1.5-5b-i2v

* use even number of latent frames only

* update pipeline implementations

* make style

* set patch_size_t as None by default

* #skip frames 0

* refactor

* make style

* update docs

* fix ofs_embed

* update docs

* invert_scale_latents

* update

* fix

* Update docs/source/en/api/pipelines/cogvideox.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/api/pipelines/cogvideox.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/api/pipelines/cogvideox.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/api/pipelines/cogvideox.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update src/diffusers/models/transformers/cogvideox_transformer_3d.py

* update conversion script

* remove copied from

* fix test

* Update docs/source/en/api/pipelines/cogvideox.md

* Update docs/source/en/api/pipelines/cogvideox.md

* Update docs/source/en/api/pipelines/cogvideox.md

* Update docs/source/en/api/pipelines/cogvideox.md

---------
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

3b283061

05 Nov, 2024 1 commit

[core] Mochi T2V (#9769) · 3f329a42

Aryan authored Nov 05, 2024



* update

* udpate

* update transformer

* make style

* fix

* add conversion script

* update

* fix

* update

* fix

* update

* fixes

* make style

* update

* update

* update

* init

* update

* update

* add

* up

* up

* up

* update

* mochi transformer

* remove original implementation

* make style

* update inits

* update conversion script

* docs

* Update src/diffusers/pipelines/mochi/pipeline_mochi.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* Update src/diffusers/pipelines/mochi/pipeline_mochi.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* fix docs

* pipeline fixes

* make style

* invert sigmas in scheduler; fix pipeline

* fix pipeline num_frames

* flip proj and gate in swiglu

* make style

* fix

* make style

* fix tests

* latent mean and std fix

* update

* cherry-pick 1069d210e1b9e84a366cdc7a13965626ea258178

* remove additional sigma already handled by flow match scheduler

* fix

* remove hardcoded value

* replace conv1x1 with linear

* Update src/diffusers/pipelines/mochi/pipeline_mochi.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* framewise decoding and conv_cache

* make style

* Apply suggestions from code review

* mochi vae encoder changes

* rebase correctly

* Update scripts/convert_mochi_to_diffusers.py

* fix tests

* fixes

* make style

* update

* make style

* update

* add framewise and tiled encoding

* make style

* make original vae implementation behaviour the default; note: framewise encoding does not work

* remove framewise encoding implementation due to presence of attn layers

* fight test 1

* fight test 2

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com>

3f329a42

31 Oct, 2024 1 commit
- [Tests] clean up and refactor gradient checkpointing tests (#9494) · 4adf6aff
  Sayak Paul authored Oct 31, 2024
```
* check.

* fixes

* fixes

* updates

* fixes

* fixes
```
  4adf6aff
29 Oct, 2024 1 commit

[core] Allegro T2V (#9736) · 0d1d267b

Aryan authored Oct 29, 2024



* update

* refactor transformer part 1

* refactor part 2

* refactor part 3

* make style

* refactor part 4; modeling tests

* make style

* refactor part 5

* refactor part 6

* gradient checkpointing

* pipeline tests (broken atm)

* update

* add coauthor
Co-Authored-By: Huan Yang <hyang@fastmail.com>

* refactor part 7

* add docs

* make style

* add coauthor
Co-Authored-By: YiYi Xu <yixu310@gmail.com>

* make fix-copies

* undo unrelated change

* revert changes to embeddings, normalization, transformer

* refactor part 8

* make style

* refactor part 9

* make style

* fix

* apply suggestions from review

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* update example

* remove attention mask for self-attention

* update

* copied from

* update

* update

---------
Co-authored-by: Huan Yang <hyang@fastmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

0d1d267b

21 Oct, 2024 1 commit

minor doc/test update (#9734) · e2d037bb

YiYi Xu authored Oct 21, 2024



* update some docs and tests!

---------
Co-authored-by: Aryan <contact.aryanvs@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: apolinário <joaopaulo.passos@gmail.com>

e2d037bb

14 Oct, 2024 1 commit

CogView3Plus DiT (#9570) · 8d81564b

Yuxuan.Zhang authored Oct 14, 2024

* merge 9588

* max_shard_size="5GB" for colab running

* conversion script updates; modeling test; refactor transformer

* make fix-copies

* Update convert_cogview3_to_diffusers.py

* initial pipeline draft

* make style

* fight bugs 🐛

🪳

* add example

* add tests; refactor

* make style

* make fix-copies

* add co-author

YiYi Xu <yixu310@gmail.com>

* remove files

* add docs

* add co-author
Co-Authored-By: YiYi Xu <yixu310@gmail.com>

* fight docs

* address reviews

* make style

* make model work

* remove qkv fusion

* remove qkv fusion tets

* address review comments

* fix make fix-copies error

* remove None and TODO

* for FP16(draft)

* make style

* remove dynamic cfg

* remove pooled_projection_dim as a parameter

* fix tests

---------
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

8d81564b

02 Oct, 2024 1 commit

Support bfloat16 for Upsample2D (#9480) · 61d37640

Darren Hsu authored Oct 01, 2024



* Support bfloat16 for Upsample2D

* Add test and use is_torch_version

* Resolve comments and add decorator

* Simplify require_torch_version_greater_equal decorator

* Run make style

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

61d37640

28 Sep, 2024 1 commit

[Core] fix variant-identification. (#9253) · 11542431

Sayak Paul authored Sep 28, 2024



* fix variant-idenitification.

* fix variant

* fix sharded variant checkpoint loading.

* Apply suggestions from code review

* fixes.

* more fixes.

* remove print.

* fixes

* fixes

* comments

* fixes

* apply suggestions.

* hub_utils.py

* fix test

* updates

* fixes

* fixes

* Apply suggestions from code review
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* updates.

* removep patch file.

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

11542431

24 Sep, 2024 1 commit
- a few fix for SingleFile tests (#9522) · bac8a241
  YiYi Xu authored Sep 24, 2024
```
* update sd15 repo

* update more
```
  bac8a241
21 Sep, 2024 1 commit

[CI] fix nightly model tests (#9483) · aa73072f

Sayak Paul authored Sep 21, 2024

* check if default attn procs fix it.

* print

* print

* replace

* style./

* replace revision with variant.

* replace with stable-diffusion-v1-5/stable-diffusion-inpainting.

* replace with stable-diffusion-v1-5/stable-diffusion-v1-5.

* fix

aa73072f

12 Sep, 2024 1 commit

[CI] Nightly Test Updates (#9380) · 1e8cf276

Dhruv Nair authored Sep 12, 2024



* update

* update

* update

* update

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

1e8cf276

04 Sep, 2024 1 commit
- [tests] make 2 tests device-agnostic (#9347) · 2ee32159
  Fanli Lin authored Sep 04, 2024
```
* enabel on xpu

* fix style
```
  2ee32159
03 Sep, 2024 2 commits

[tests] remove/speedup some low signal tests (#9285) · 24053832

Aryan authored Sep 03, 2024

* remove 2 shapes from SDFunctionTesterMixin::test_vae_tiling

* combine freeu enable/disable test to reduce many inference runs

* remove low signal unet test for signature

* remove low signal embeddings test

* remove low signal progress bar test from PipelineTesterMixin

* combine ip-adapter single and multi tests to save many inferences

* fix broken tests

* Update tests/pipelines/test_pipelines_common.py

* Update tests/pipelines/test_pipelines_common.py

* add progress bar tests

24053832

[CI] More Fast GPU Test Fixes (#9346) · f6f16a0c
Dhruv Nair authored Sep 03, 2024
```
* update

* update

* update

* update
```
f6f16a0c

02 Sep, 2024 1 commit
- [CI] More fixes for Fast GPU Tests on main (#9300) · 007ad0e2
  Dhruv Nair authored Sep 02, 2024
```
update
```
  007ad0e2
28 Aug, 2024 1 commit

AnimateDiff prompt travel (#9231) · cbc2ec8f

Aryan authored Aug 28, 2024

* update

* implement prompt interpolation

* make style

* resnet memory optimizations

* more memory optimizations; todo: refactor

* update

* update animatediff controlnet with latest changes

* refactor chunked inference changes

* remove print statements

* undo memory optimization changes

* update docstrings

* fix tests

* fix pia tests

* apply suggestions from review

* add tests

* update comment

cbc2ec8f

21 Aug, 2024 1 commit

Flux followup (#9074) · c2916175

YiYi Xu authored Aug 21, 2024

* refactor rotary embeds

* adding jsmidt as co-author of this PR for https://github.com/huggingface/diffusers/pull/9133



---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Joseph Smidt <josephsmidt@gmail.com>

c2916175

19 Aug, 2024 2 commits
- [CI] Multiple Slow Test fixes. (#9198) · 940b8e03
  Dhruv Nair authored Aug 19, 2024
```
* update

* update

* update

* update
```
  940b8e03
- [Tests] Improve transformers model test suite coverage - Lumina (#8987) · ba4348d9
  M Saqlain authored Aug 19, 2024
```
* Added test suite for lumina

* Fixed failing tests

* Improved code quality

* Added function docstrings

* Improved formatting
```
  ba4348d9
18 Aug, 2024 1 commit
- feat: allow sharding for auraflow. (#8853) · f848feba
  Sayak Paul authored Aug 18, 2024
  
  f848feba
16 Aug, 2024 1 commit
- feat: allow flux transformer to be sharded during inference (#9159) · 39b87b14
  Sayak Paul authored Aug 16, 2024
```
* feat: support sharding for flux.

* tests
```
  39b87b14
13 Aug, 2024 1 commit

[refactor] CogVideoX followups + tiled decoding support (#9150) · a85b34e7

Aryan authored Aug 14, 2024

* refactor context parallel cache; update torch compile time benchmark

* add tiling support

* make style

* remove num_frames % 8 == 0 requirement

* update default num_frames to original value

* add explanations + refactor

* update torch compile example

* update docs

* update

* clean up if-statements

* address review comments

* add test for vae tiling

* update docs

* update docs

* update docstrings

* add modeling test for cogvideox transformer

* make style

a85b34e7

06 Aug, 2024 2 commits

Fix loading sharded checkpoints when we have variants (#9061) · e4325606

Marc Sun authored Aug 07, 2024



* Fix loading sharded checkpoint when we have variant

* add test

* remote print

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

e4325606

[Tests] Improve transformers model test suite coverage - Hunyuan DiT (#8916) · 87e50a2f

Vinh H. Pham authored Aug 06, 2024



* add hunyuan model test

* apply suggestions

* reduce dims further

* reduce dims further

* run make style

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

87e50a2f

05 Aug, 2024 1 commit

[Tests] Improve transformers model test suite coverage - Latte (#8919) · e1d508ae

Vinh H. Pham authored Aug 05, 2024



* add LatteTransformer3DModel model test

* change patch_size to 1

* reduce req len

* reduce channel dims

* increase num_layers

* reduce dims further

* run make style

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>

e1d508ae

02 Aug, 2024 1 commit

[Flux] allow tests to run (#9050) · 0e460675

Sayak Paul authored Aug 02, 2024

* fix tests

* fix

* float64 skip

* remove sample_size.

* remove

* remove more

* default_sample_size.

* credit black forest for flux model.

* skip

* fix: tests

* remove OriginalModelMixin

* add transformer model test

* add: transformer model tests

0e460675

01 Aug, 2024 1 commit
- fix load sharded checkpoint from a subfolder (local path) (#8913) · 95a78328
  YiYi Xu authored Aug 01, 2024
```
fix
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  95a78328
30 Jul, 2024 2 commits

Fix Stable Audio repository id (#9016) · ea1b4ea7
Yoach Lacombe authored Jul 30, 2024
```
Fix Stable Audio repo id
```
ea1b4ea7

Stable Audio integration (#8716) · 69e72b1d

Yoach Lacombe authored Jul 30, 2024



* WIP modeling code and pipeline

* add custom attention processor + custom activation + add to init

* correct ProjectionModel forward

* add stable audio to __initèè

* add autoencoder and update pipeline and modeling code

* add half Rope

* add partial rotary v2

* add temporary modfis to scheduler

* add EDM DPM Solver

* remove TODOs

* clean GLU

* remove att.group_norm to attn processor

* revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py

* refactor GLU -> SwiGLU

* remove redundant args

* add channel multiples in autoencoder docstrings

* changes in docsrtings and copyright headers

* clean pipeline

* further cleaning

* remove peft and lora and fromoriginalmodel

* Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace

* make style

* dummy models

* fix copied from

* add fast oobleck tests

* add brownian tree

* oobleck autoencoder slow tests

* remove TODO

* fast stable audio pipeline tests

* add slow tests

* make style

* add first version of docs

* wrap is_torchsde_available to the scheduler

* fix slow test

* test with input waveform

* add input waveform

* remove some todos

* create stableaudio gaussian projection + make style

* add pipeline to toctree

* fix copied from

* make quality

* refactor timestep_features->time_proj

* refactor joint_attention_kwargs->cross_attention_kwargs

* remove forward_chunk

* move StableAudioDitModel to transformers folder

* correct convert + remove partial rotary embed

* apply suggestions from yiyixuxu -> removing attn.kv_heads

* remove temb

* remove cross_attention_kwargs

* further removal of cross_attention_kwargs

* remove text encoder autocast to fp16

* continue removing autocast

* make style

* refactor how text and audio are embedded

* add paper

* update example code

* make style

* unify projection model forward + fix device placement

* make style

* remove fuse qkv

* apply suggestions from review

* Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* make style

* smaller models in fast tests

* pass sequential offloading fast tests

* add docs for vae and autoencoder

* make style and update example

* remove useless import

* add cosine scheduler

* dummy classes

* cosine scheduler docs

* better description of scheduler

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

69e72b1d

24 Jul, 2024 1 commit
- [CI] Skip flaky download tests in PR CI (#8945) · 93983b67
  Dhruv Nair authored Jul 24, 2024
```
update
```
  93983b67
23 Jul, 2024 1 commit

[Tests] Improve transformers model test suite coverage - Temporal Transformer (#8932) · 7a95f8d9

Vinh H. Pham authored Jul 23, 2024



* add test for temporal transformer

* remove unused variable

* fix code quality

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

7a95f8d9

22 Jul, 2024 1 commit
- [Tests] proper skipping of request caching test (#8908) · af400040
  Sayak Paul authored Jul 23, 2024
```
proper skipping of request caching test
```
  af400040
17 Jul, 2024 1 commit
- [Core] fix: shard loading and saving when variant is provided. (#8869) · 0f09b01a
  Sayak Paul authored Jul 17, 2024
```
fix: shard loading and saving when variant is provided.
```
  0f09b01a
11 Jul, 2024 1 commit

[Core] Add AuraFlow (#8796) · 2261510b

Sayak Paul authored Jul 11, 2024



* add lavender flow transformer

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

2261510b

09 Jul, 2024 1 commit
- [Tests] fix more sharding tests (#8797) · a785992c
  Sayak Paul authored Jul 09, 2024
```
* fix

* fix

* ugly

* okay

* fix more

* fix oops
```
  a785992c
08 Jul, 2024 1 commit

Remove unnecessary lines (#8569) · 57084dac

Tolga Cangöz authored Jul 08, 2024



* Remove unused line


---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

57084dac

06 Jul, 2024 1 commit

fix loading sharded checkpoints from subfolder (#8798) · 9e9ed353

YiYi Xu authored Jul 06, 2024



* fix load sharded checkpoints from subfolder{

* style

* os.path.join

* add a small test

---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>

9e9ed353

04 Jul, 2024 1 commit
- [Tests] fix sharding tests (#8764) · 31adeb41
  Sayak Paul authored Jul 04, 2024
```
fix sharding tests
```
  31adeb41
27 Jun, 2024 1 commit

Motion Model / Adapter versatility (#8301) · 3e0d128d

Mathis Koroglu authored Jun 27, 2024

* Motion Model / Adapter versatility

- allow to use a different number of layers per block
- allow to use a different number of transformer per layers per block
- allow a different number of motion attention head per block
- use dropout argument in get_down/up_block in 3d blocks

* Motion Model added arguments renamed & refactoring

* Add test for asymmetric UNetMotionModel

3e0d128d