Commits · 14a1b86fc7de53ff1dbf803f616cbb16ad530e45 · renzhc / diffusers_dcu

19 Sep, 2024 2 commits

[training] CogVideoX Lora (#9302) · 2b443a5d

Aryan authored Sep 19, 2024



* cogvideox lora training draft

* update

* update

* update

* update

* update

* make fix-copies

* update

* update

* apply suggestions from review

* apply suggestions from reveiw

* fix typo

* Update examples/cogvideo/train_cogvideox_lora.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* fix lora alpha

* use correct lora scaling for final test pipeline

* Update examples/cogvideo/train_cogvideox_lora.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* apply suggestions from review; prodigy optimizer

YiYi Xu <yixu310@gmail.com>

* add tests

* make style

* add README

* update

* update

* make style

* fix

* update

* add test skeleton

* revert lora utils changes

* add cleaner modifications to lora testing utils

* update lora tests

* deepspeed stuff

* add requirements.txt

* deepspeed refactor

* add lora stuff to img2vid pipeline to fix tests

* fight tests

* add co-authors
Co-Authored-By: Fu-Yun Wang <1697256461@qq.com>
Co-Authored-By: zR <2448370773@qq.com>

* fight lora runner tests

* import Dummy optim and scheduler only wheh required

* update docs

* add coauthors
Co-Authored-By: Fu-Yun Wang <1697256461@qq.com>

* remove option to train text encoder
Co-Authored-By: bghira <bghira@users.github.com>

* update tests

* fight more tests

* update

* fix vid2vid

* fix typo

* remove lora tests; todo in follow-up PR

* undo img2vid changes

* remove text encoder related changes in lora loader mixin

* Revert "remove text encoder related changes in lora loader mixin"

This reverts commit f8a8444487db27859be812866db4e8cec7f25691.

* update

* round 1 of fighting tests

* round 2 of fighting tests

* fix copied from comment

* fix typo in lora test

* update styling
Co-Authored-By: YiYi Xu <yixu310@gmail.com>

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: zR <2448370773@qq.com>
Co-authored-by: Fu-Yun Wang <1697256461@qq.com>
Co-authored-by: bghira <bghira@users.github.com>

2b443a5d

[Flux] add lora integration tests. (#9353) · d13b0d63
Sayak Paul authored Sep 19, 2024
```
* add lora integration tests.

* internal note

* add a skip marker.
```
d13b0d63

17 Sep, 2024 2 commits
- Remove CogVideoX mentions from single file docs; Test updates (#9444) · ba06124e
  Aryan authored Sep 18, 2024
```
* remove mentions from single file

* update tests

* update
```
  ba06124e
- Feature flux controlnet img2img and inpaint pipeline (#9408) · bb1b0fa1
  Subho Ghosh authored Sep 18, 2024
```
* Implemented FLUX controlnet support to Img2Img pipeline
```
  bb1b0fa1
16 Sep, 2024 1 commit

CogVideoX-5b-I2V support (#9418) · 8336405e

Yuxuan.Zhang authored Sep 16, 2024



* draft Init

* draft

* vae encode image

* make style

* image latents preparation

* remove image encoder from conversion script

* fix minor bugs

* make pipeline work

* make style

* remove debug prints

* fix imports

* update example

* make fix-copies

* add fast tests

* fix import

* update vae

* update docs

* update image link

* apply suggestions from review

* apply suggestions from review

* add slow test

* make use of learned positional embeddings

* apply suggestions from review

* doc change

* Update convert_cogvideox_to_diffusers.py

* make style

* final changes

* make style

* fix tests

---------
Co-authored-by: Aryan <aryan@huggingface.co>

8336405e

12 Sep, 2024 1 commit

[CI] Nightly Test Updates (#9380) · 1e8cf276

Dhruv Nair authored Sep 12, 2024



* update

* update

* update

* update

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

1e8cf276

11 Sep, 2024 1 commit
- [Tests] fix some fast gpu tests. (#9379) · adf1f911
  Sayak Paul authored Sep 11, 2024
```
fix some fast gpu tests.
```
  adf1f911
09 Sep, 2024 1 commit

[Pipeline] animatediff + vid2vid + controlnet (#9337) · a7361dcc

Igor Filippov authored Sep 09, 2024

* add animatediff + vid2vide + controlnet

* post tests fixes

* PR discussion fixes

* update docs

* change input video to links on HF + update an example

* make quality fix

* fix ip adapter test

* fix ip adapter test input

* update ip adapter test

a7361dcc

06 Sep, 2024 3 commits

add flux inpaint + img2img + controlnet to auto pipeline (#9367) · 8cdcdd9e
YiYi Xu authored Sep 06, 2024

8cdcdd9e
[CI] Quick fix for Cog Video Test (#9373) · d269cc8a
Dhruv Nair authored Sep 06, 2024
```
update
```
d269cc8a

[core] Freenoise memory improvements (#9262) · 6dfa4996

Aryan authored Sep 06, 2024

* update

* implement prompt interpolation

* make style

* resnet memory optimizations

* more memory optimizations; todo: refactor

* update

* update animatediff controlnet with latest changes

* refactor chunked inference changes

* remove print statements

* update

* chunk -> split

* remove changes from incorrect conflict resolution

* remove changes from incorrect conflict resolution

* add explanation of SplitInferenceModule

* update docs

* Revert "update docs"

This reverts commit c55a50a271b2cefa8fe340a4f2a3ab9b9d374ec0.

* update docstring for freenoise split inference

* apply suggestions from review

* add tests

* apply suggestions from review

6dfa4996

05 Sep, 2024 1 commit
- [CI] Update Single file Nightly Tests (#9357) · 53051cf2
  Dhruv Nair authored Sep 05, 2024
```
* update

* update
```
  53051cf2
04 Sep, 2024 2 commits
- Add Flux inpainting and Flux Img2Img (#9135) · 249a9e48
  Vishnu V Jaddipal authored Sep 05, 2024
```
---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>
```
  249a9e48
- [tests] make 2 tests device-agnostic (#9347) · 2ee32159
  Fanli Lin authored Sep 04, 2024
```
* enabel on xpu

* fix style
```
  2ee32159
03 Sep, 2024 2 commits

[tests] remove/speedup some low signal tests (#9285) · 24053832

Aryan authored Sep 03, 2024

* remove 2 shapes from SDFunctionTesterMixin::test_vae_tiling

* combine freeu enable/disable test to reduce many inference runs

* remove low signal unet test for signature

* remove low signal embeddings test

* remove low signal progress bar test from PipelineTesterMixin

* combine ip-adapter single and multi tests to save many inferences

* fix broken tests

* Update tests/pipelines/test_pipelines_common.py

* Update tests/pipelines/test_pipelines_common.py

* add progress bar tests

24053832

[CI] More Fast GPU Test Fixes (#9346) · f6f16a0c
Dhruv Nair authored Sep 03, 2024
```
* update

* update

* update

* update
```
f6f16a0c

02 Sep, 2024 2 commits

[CI] More fixes for Fast GPU Tests on main (#9300) · 007ad0e2
Dhruv Nair authored Sep 02, 2024
```
update
```
007ad0e2

[core] Support VideoToVideo with CogVideoX (#9333) · 0e6a8403

Aryan authored Sep 02, 2024

* add vid2vid pipeline for cogvideox

* make fix-copies

* update docs

* fake context parallel cache, vae encode tiling

* add test for cog vid2vid

* use video link from HF docs repo

* add copied from comments; correctly rename test class

0e6a8403

28 Aug, 2024 1 commit

AnimateDiff prompt travel (#9231) · cbc2ec8f

Aryan authored Aug 28, 2024

* update

* implement prompt interpolation

* make style

* resnet memory optimizations

* more memory optimizations; todo: refactor

* update

* update animatediff controlnet with latest changes

* refactor chunked inference changes

* remove print statements

* undo memory optimization changes

* update docstrings

* fix tests

* fix pia tests

* apply suggestions from review

* add tests

* update comment

cbc2ec8f

23 Aug, 2024 2 commits
- [Core] fuse_qkv_projection() to Flux (#9185) · 2d9ccf39
  Sayak Paul authored Aug 23, 2024
```
* start fusing flux.

* test

* finish fusion

* fix-copues
```
  2d9ccf39
- Cogvideox-5B Model adapter change (#9203) · 960c149c
  zR authored Aug 23, 2024
```
* draft of embedding

---------
Co-authored-by: Aryan <aryan@huggingface.co>
```
  960c149c
22 Aug, 2024 2 commits

[tests] fix broken xformers tests (#9206) · 0ec64fe9

Aryan authored Aug 22, 2024

* fix xformers tests

* remove unnecessary modifications to cogvideox tests

* update

0ec64fe9

[Flux LoRA] support parsing alpha from a flux lora state dict. (#9236) · 5090b09d

Sayak Paul authored Aug 22, 2024

* support parsing alpha from a flux lora state dict.

* conditional import.

* fix breaking changes.

* safeguard alpha.

* fix

5090b09d

21 Aug, 2024 4 commits

Flux followup (#9074) · c2916175

YiYi Xu authored Aug 21, 2024

* refactor rotary embeds

* adding jsmidt as co-author of this PR for https://github.com/huggingface/diffusers/pull/9133



---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Joseph Smidt <josephsmidt@gmail.com>

c2916175

Add StableDiffusionXLControlNetPAGImg2ImgPipeline (#8990) · 9003d75f

satani99 authored Aug 21, 2024



* Added pad controlnet sdxl img2img pipeline

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

9003d75f

fix a regression in `is_safetensors_compatible` (#9234) · 214372aa
YiYi Xu authored Aug 21, 2024
```
fix
```
214372aa

StableDiffusionLatentUpscalePipeline - positive/negative prompt embeds support (#8947) · 867e0c91

Vinh H. Pham authored Aug 21, 2024



* make latent upscaler accept prompt embeds

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

867e0c91

19 Aug, 2024 3 commits
- [CI] Multiple Slow Test fixes. (#9198) · 940b8e03
  Dhruv Nair authored Aug 19, 2024
```
* update

* update

* update

* update
```
  940b8e03
- Update `is_safetensors_compatible` check (#8991) · b2add10d
  Dhruv Nair authored Aug 19, 2024
```
* update

* update

* update

* update

* update
```
  b2add10d
- [Tests] Improve transformers model test suite coverage - Lumina (#8987) · ba4348d9
  M Saqlain authored Aug 19, 2024
```
* Added test suite for lumina

* Fixed failing tests

* Improved code quality

* Added function docstrings

* Improved formatting
```
  ba4348d9
18 Aug, 2024 1 commit
- feat: allow sharding for auraflow. (#8853) · f848feba
  Sayak Paul authored Aug 18, 2024
  
  f848feba
16 Aug, 2024 1 commit
- feat: allow flux transformer to be sharded during inference (#9159) · 39b87b14
  Sayak Paul authored Aug 16, 2024
```
* feat: support sharding for flux.

* tests
```
  39b87b14
13 Aug, 2024 3 commits

[refactor] CogVideoX followups + tiled decoding support (#9150) · a85b34e7

Aryan authored Aug 14, 2024

* refactor context parallel cache; update torch compile time benchmark

* add tiling support

* make style

* remove num_frames % 8 == 0 requirement

* update default num_frames to original value

* add explanations + refactor

* update torch compile example

* update docs

* update

* clean up if-statements

* address review comments

* add test for vae tiling

* update docs

* update docs

* update docstrings

* add modeling test for cogvideox transformer

* make style

a85b34e7

[FLUX] Support ControlNet (#9126) · 5ffbe14c

王奇勋 authored Aug 13, 2024



* cnt model

* cnt model

* cnt model

* fix Loader "Copied"

* format

* txt_ids for  multiple images

* add test and format

* typo

* Update pipeline_flux_controlnet.py

* remove

* make quality

* fix copy

* Update src/diffusers/pipelines/flux/pipeline_flux_controlnet.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* Update src/diffusers/pipelines/flux/pipeline_flux_controlnet.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* Update src/diffusers/pipelines/flux/pipeline_flux_controlnet.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* Update src/diffusers/pipelines/flux/pipeline_flux_controlnet.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* Update src/diffusers/models/controlnet_flux.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* fix

* make copies

* test

* bs

---------
Co-authored-by: haofanwang <haofanwang.ai@gmail.com>
Co-authored-by: haofanwang <haofan@HaofandeMBP.lan>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

5ffbe14c

Support SD3 controlnet inpainting (#9099) · cc051309

林金鹏 authored Aug 13, 2024



* add controlnet inpainting pipeline

* [SD3] add controlnet inpaint example

* update example and fix code style

* fix code style with ruff

* Update controlnet_sd3.md : add control inpaint pipeline

* Update docs/source/en/api/pipelines/controlnet_sd3.md
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

* Update docs/source/en/api/pipelines/controlnet_sd3.md
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

* Update docs/source/en/api/pipelines/controlnet_sd3.md
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

* Update src/diffusers/pipelines/controlnet_sd3/pipeline_stable_diffusion_3_controlnet_inpainting.py
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

* Update __init__.py : add sd3 control pipelines

* Update pipeline : add new param doc & check input reference.

* fix typo

* make style & make quality

* add unittest for sd3 controlnet inpaint

---------
Co-authored-by: 鹏徙 <linjinpeng.ljp@alibaba-inc.com>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

cc051309

07 Aug, 2024 3 commits

Add CogVideoX text-to-video generation model (#9082) · 2dad462d

zR authored Aug 07, 2024



* add CogVideoX

---------
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

2dad462d

[core] FreeNoise (#8948) · 16a93f1a

Aryan authored Aug 07, 2024



* initial work draft for freenoise; needs massive cleanup

* fix freeinit bug

* add animatediff controlnet implementation

* revert attention changes

* add freenoise

* remove old helper functions

* add decode batch size param to all pipelines

* make style

* fix copied from comments

* make fix-copies

* make style

* copy animatediff controlnet implementation from #8972

* add experimental support for num_frames not perfectly fitting context length, ocntext stride

* make unet motion model lora work again based on #8995

* copy load video utils from #8972

* copied from AnimateDiff::prepare_latents

* address the case where last batch of frames does not match length of indices in prepare latents

* decode_batch_size->vae_batch_size; batch vae encode support in animatediff vid2vid

* revert sparsectrl and sdxl freenoise changes

* revert pia

* add freenoise tests

* make fix-copies

* improve docstrings

* add freenoise tests to animatediff controlnet

* update tests

* Update src/diffusers/models/unets/unet_motion_model.py

* add freenoise to animatediff pag

* address review comments

* make style

* update tests

* make fix-copies

* fix error message

* remove copied from comment

* fix imports in tests

* update

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

16a93f1a

[Kolors] Add PAG (#8934) · 39e1f7ea

Álvaro Somoza authored Aug 06, 2024



* txt2img pag added

* autopipe added, fixed case

* style

* apply suggestions

* added fast tests, added todo tests

* revert dummy objects for kolors

* fix pag dummies

* fix test imports

* update pag tests

* add kolor pag to docs

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

39e1f7ea

06 Aug, 2024 2 commits

Fix loading sharded checkpoints when we have variants (#9061) · e4325606

Marc Sun authored Aug 07, 2024



* Fix loading sharded checkpoint when we have variant

* add test

* remote print

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

e4325606

add PAG support for Stable Diffusion 3 (#8861) · 926daa30

Ahn Donghoon (안동훈 / suno) authored Aug 07, 2024



add pag sd3


---------
Co-authored-by: HyoungwonCho <jhw9811@korea.ac.kr>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: crepejung00 <jaewoojung00@naver.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>

926daa30