Commits · 5704376d0309031a124fcb8a957fc70282ce13eb · renzhc / diffusers_dcu

17 Oct, 2024 2 commits

[refactor] DiffusionPipeline.download (#9557) · 5704376d

Aryan authored Oct 18, 2024



* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

5704376d

[Flux] Add advanced training script + support textual inversion inference (#9434) · 9a7f8246

Linoy Tsaban authored Oct 17, 2024

* add ostris trainer to README & add cache latents of vae

* add ostris trainer to README & add cache latents of vae

* style

* readme

* add test for latent caching

* add ostris noise scheduler
https://github.com/ostris/ai-toolkit/blob/9ee1ef2a0a2a9a02b92d114a95f21312e5906e54/toolkit/samplers/custom_flowmatch_sampler.py#L95

* style

* fix import

* style

* fix tests

* style

* --change upcasting of transformer?

* update readme according to main

* add pivotal tuning for CLIP

* fix imports, encode_prompt call,add TextualInversionLoaderMixin to FluxPipeline for inference

* TextualInversionLoaderMixin support for FluxPipeline for inference

* move changes to advanced flux script, revert canonical

* add latent caching to canonical script

* revert changes to canonical script to keep it separate from https://github.com/huggingface/diffusers/pull/9160

* revert changes to canonical script to keep it separate from https://github.com/huggingface/diffusers/pull/9160

* style

* remove redundant line and change code block placement to align with logic

* add initializer_token arg

* add transformer frac for range support from pure textual inversion to the orig pivotal tuning

* support pure textual inversion - wip

* adjustments to support pure textual inversion and transformer optimization in only part of the epochs

* fix logic when using initializer token

* fix pure_textual_inversion_condition

* fix ti/pivotal loading of last validation run

* remove embeddings loading for ti in final training run (to avoid adding huggingface hub dependency)

* support pivotal for t5

* adapt pivotal for T5 encoder

* adapt pivotal for T5 encoder and support in flux pipeline

* t5 pivotal support + support fo pivotal for clip only or both

* fix param chaining

* fix param chaining

* README first draft

* readme

* readme

* readme

* style

* fix import

* style

* add fix from https://github.com/huggingface/diffusers/pull/9419



* add to readme, change function names

* te lr changes

* readme

* change concept tokens logic

* fix indices

* change arg name

* style

* dummy test

* revert dummy test

* reorder pivoting

* add warning in case the token abstraction is not the instance prompt

* experimental - wip - specific block training

* fix documentation and token abstraction processing

* remove transformer block specification feature (for now)

* style

* fix copies

* fix indexing issue when --initializer_concept has different amounts

* add if TextualInversionLoaderMixin to all flux pipelines

* style

* fix import

* fix imports

* address review comments - remove necessary prints & comments, use pin_memory=True, use free_memory utils, unify warning and prints

* style

* logger info fix

* make lora target modules configurable and change the default

* make lora target modules configurable and change the default

* style

* make lora target modules configurable and change the default, add notes to readme

* style

* add tests

* style

* fix repo id

* add updated requirements for advanced flux

* fix indices of t5 pivotal tuning embeddings

* fix path in test

* remove `pin_memory`

* fix filename of embedding

* fix filename of embedding

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

9a7f8246

16 Oct, 2024 1 commit

[pipeline] CogVideoX-Fun Control (#9671) · 8cabd4a0

Aryan authored Oct 16, 2024



* cogvideox-fun control

* make style

* make fix-copies

* karras schedulers

* Update src/diffusers/pipelines/cogvideo/pipeline_cogvideox_fun_control.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/api/pipelines/cogvideox.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* apply suggestions from review

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

8cabd4a0

15 Oct, 2024 4 commits

[authored by @Anghellia) Add support of Xlabs Controlnets #9638 (#9687) · 3e9a28a8
YiYi Xu authored Oct 15, 2024
```
* Add support of Xlabs Controlnets


---------
Co-authored-by: Anzhella Pankratova <son0shad@gmail.com>
```
3e9a28a8

[training] CogVideoX-I2V LoRA (#9482) · 2ffbb88f

Aryan authored Oct 16, 2024



* update

* update

* update

* update

* update

* add coauthor
Co-Authored-By: yuan-shenghai <963658029@qq.com>

* add coauthor
Co-Authored-By: Shenghai Yuan <140951558+SHYuanBest@users.noreply.github.com>

* update
Co-Authored-By: yuan-shenghai <963658029@qq.com>

* update

---------
Co-authored-by: yuan-shenghai <963658029@qq.com>
Co-authored-by: Shenghai Yuan <140951558+SHYuanBest@users.noreply.github.com>

2ffbb88f

Convert list/tuple of `HunyuanDiT2DControlNetModel` to `HunyuanDiT2DMultiControlNetModel` (#9651) · 957e5cab
hlky authored Oct 15, 2024
```
Convert list/tuple of HunyuanDiT2DControlNetModel to HunyuanDiT2DMultiControlNetModel
Co-authored-by: YiYi Xu <yixu310@gmail.com>
```
957e5cab
Convert list/tuple of `SD3ControlNetModel` to `SD3MultiControlNetModel` (#9652) · 3e4c5707
hlky authored Oct 15, 2024
```
Convert list/tuple of SD3ControlNetModel to SD3MultiControlNetModel
Co-authored-by: YiYi Xu <yixu310@gmail.com>
```
3e4c5707

14 Oct, 2024 2 commits

Added Lora Support to SD3 Img2Img Pipeline (#9659) · 22ed39f5
SahilCarterr authored Oct 15, 2024
```
* add lora
```
22ed39f5

CogView3Plus DiT (#9570) · 8d81564b

Yuxuan.Zhang authored Oct 14, 2024

* merge 9588

* max_shard_size="5GB" for colab running

* conversion script updates; modeling test; refactor transformer

* make fix-copies

* Update convert_cogview3_to_diffusers.py

* initial pipeline draft

* make style

* fight bugs 🐛

🪳

* add example

* add tests; refactor

* make style

* make fix-copies

* add co-author

YiYi Xu <yixu310@gmail.com>

* remove files

* add docs

* add co-author
Co-Authored-By: YiYi Xu <yixu310@gmail.com>

* fight docs

* address reviews

* make style

* make model work

* remove qkv fusion

* remove qkv fusion tets

* address review comments

* fix make fix-copies error

* remove None and TODO

* for FP16(draft)

* make style

* remove dynamic cfg

* remove pooled_projection_dim as a parameter

* fix tests

---------
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

8d81564b

11 Oct, 2024 1 commit
- FluxMultiControlNetModel (#9647) · 0f8fb75c
  hlky authored Oct 11, 2024
  
  0f8fb75c
10 Oct, 2024 1 commit

flux controlnet control_guidance_start and control_guidance_end implement (#9571) · 38a3e4df

Subho Ghosh authored Oct 10, 2024

* flux controlnet control_guidance_start and control_guidance_end implement

* minor fix - added docstrings, consistent controlnet scale flux and SD3

38a3e4df

09 Oct, 2024 3 commits
- make controlnet support interrupt (#9620) · 07bd2fab
  Pakkapon Phongthawee authored Oct 10, 2024
```
* make controlnet support interrupt

* remove white space in controlnet interrupt
```
  07bd2fab
- add PAG support for SD Img2Img (#9463) · af28ae2d
  SahilCarterr authored Oct 10, 2024
```
* added pag to sd img2img pipeline


---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>
```
  af28ae2d
- refac/pipeline_output (#9582) · ec9e5264
  Yijun Lee authored Oct 09, 2024
  
  ec9e5264
08 Oct, 2024 1 commit

Fixed noise_pred_text referenced before assignment. (#9537) · 86bd991e

v2ray authored Oct 09, 2024

* Fixed local variable noise_pred_text referenced before assignment when using PAG with guidance scale and guidance rescale at the same time.

* Fixed style.

* Made returning text pred noise an argument.

86bd991e

07 Oct, 2024 1 commit
- Fix for use_safetensors parameters, allow use of parameter on loading submodels (#9576) (#9587) · 12878229
  Eliseu Silva authored Oct 07, 2024
```
* Fix for use_safetensors parameters, allow use of parameter on loading submodels (#9576)
```
  12878229
03 Oct, 2024 1 commit
- [sd3] make sure height and size are divisible by `16` (#9573) · 99f60821
  YiYi Xu authored Oct 03, 2024
```
* check size

* up
```
  99f60821
01 Oct, 2024 1 commit
- Add PAG support to StableDiffusionControlNetPAGInpaintPipeline (#8875) · 33fafe3d
  JuanCarlosPi authored Oct 01, 2024
```
* Add pag to controlnet inpainting pipeline


---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>
```
  33fafe3d
28 Sep, 2024 1 commit

[Core] fix variant-identification. (#9253) · 11542431

Sayak Paul authored Sep 28, 2024



* fix variant-idenitification.

* fix variant

* fix sharded variant checkpoint loading.

* Apply suggestions from code review

* fixes.

* more fixes.

* remove print.

* fixes

* fixes

* comments

* fixes

* apply suggestions.

* hub_utils.py

* fix test

* updates

* fixes

* fixes

* Apply suggestions from code review
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* updates.

* removep patch file.

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

11542431

26 Sep, 2024 2 commits

[Tests] Fix ChatGLMTokenizer (#9536) · 066ea374
Álvaro Somoza authored Sep 26, 2024
```
fix
```
066ea374

flux controlnet fix (control_modes batch & others) (#9507) · 9cd37557

YiYi Xu authored Sep 25, 2024



* flux controlnet mode to take into account batch size

* incorporate yiyixuxu's suggestions (cleaner logic) as well as clean up control mode handling for multi case

* fix

* fix use_guidance when controlnet is a multi and does not have config

---------
Co-authored-by: Christopher Beckham <christopher.j.beckham@gmail.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

9cd37557

23 Sep, 2024 2 commits

Allow DDPMPipeline half precision (#9222) · 3e69e241
Seongbin Lim authored Sep 24, 2024
```
Co-authored-by: YiYi Xu <yixu310@gmail.com>
```
3e69e241

[Cog] some minor fixes and nits (#9466) · ba5af5ae

Sayak Paul authored Sep 23, 2024

* fix positional arguments in check_inputs().

* add video and latetns to check_inputs().

* prep latents_in_channels.

* quality

* multiple fixes.

* fix

ba5af5ae

20 Sep, 2024 1 commit
- Several fixes to Flux ControlNet pipelines (#9472) · 14a1b86f
  Vladimir Mandic authored Sep 19, 2024
```
* fix flux controlnet pipelines

---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>
```
  14a1b86f
19 Sep, 2024 1 commit

[training] CogVideoX Lora (#9302) · 2b443a5d

Aryan authored Sep 19, 2024



* cogvideox lora training draft

* update

* update

* update

* update

* update

* make fix-copies

* update

* update

* apply suggestions from review

* apply suggestions from reveiw

* fix typo

* Update examples/cogvideo/train_cogvideox_lora.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* fix lora alpha

* use correct lora scaling for final test pipeline

* Update examples/cogvideo/train_cogvideox_lora.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* apply suggestions from review; prodigy optimizer

YiYi Xu <yixu310@gmail.com>

* add tests

* make style

* add README

* update

* update

* make style

* fix

* update

* add test skeleton

* revert lora utils changes

* add cleaner modifications to lora testing utils

* update lora tests

* deepspeed stuff

* add requirements.txt

* deepspeed refactor

* add lora stuff to img2vid pipeline to fix tests

* fight tests

* add co-authors
Co-Authored-By: Fu-Yun Wang <1697256461@qq.com>
Co-Authored-By: zR <2448370773@qq.com>

* fight lora runner tests

* import Dummy optim and scheduler only wheh required

* update docs

* add coauthors
Co-Authored-By: Fu-Yun Wang <1697256461@qq.com>

* remove option to train text encoder
Co-Authored-By: bghira <bghira@users.github.com>

* update tests

* fight more tests

* update

* fix vid2vid

* fix typo

* remove lora tests; todo in follow-up PR

* undo img2vid changes

* remove text encoder related changes in lora loader mixin

* Revert "remove text encoder related changes in lora loader mixin"

This reverts commit f8a8444487db27859be812866db4e8cec7f25691.

* update

* round 1 of fighting tests

* round 2 of fighting tests

* fix copied from comment

* fix typo in lora test

* update styling
Co-Authored-By: YiYi Xu <yixu310@gmail.com>

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: zR <2448370773@qq.com>
Co-authored-by: Fu-Yun Wang <1697256461@qq.com>
Co-authored-by: bghira <bghira@users.github.com>

2b443a5d

17 Sep, 2024 2 commits
- set max_shard_size to None for pipeline save_pretrained (#9447) · da18fbd5
  Aryan authored Sep 18, 2024
```
* update default max_shard_size

* add None check to fix tests

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>
```
  da18fbd5
- Feature flux controlnet img2img and inpaint pipeline (#9408) · bb1b0fa1
  Subho Ghosh authored Sep 18, 2024
```
* Implemented FLUX controlnet support to Img2Img pipeline
```
  bb1b0fa1
16 Sep, 2024 3 commits

[docs] Replace runwayml/stable-diffusion-v1-5 with Lykon/dreamshaper-8 (#9428) · b52119ae

suzukimain authored Sep 17, 2024



* [docs] Replace runwayml/stable-diffusion-v1-5 with Lykon/dreamshaper-8

Updated documentation as runwayml/stable-diffusion-v1-5 has been removed from Huggingface.

* Update docs/source/en/using-diffusers/inpaint.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Replace with stable-diffusion-v1-5/stable-diffusion-v1-5

* Update inpaint.md

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

b52119ae

CogVideoX-5b-I2V support (#9418) · 8336405e

Yuxuan.Zhang authored Sep 16, 2024



* draft Init

* draft

* vae encode image

* make style

* image latents preparation

* remove image encoder from conversion script

* fix minor bugs

* make pipeline work

* make style

* remove debug prints

* fix imports

* update example

* make fix-copies

* add fast tests

* fix import

* update vae

* update docs

* update image link

* apply suggestions from review

* apply suggestions from review

* add slow test

* make use of learned positional embeddings

* apply suggestions from review

* doc change

* Update convert_cogvideox_to_diffusers.py

* make style

* final changes

* make style

* fix tests

---------
Co-authored-by: Aryan <aryan@huggingface.co>

8336405e

Allow max shard size to be specified when saving pipeline (#9440) · 2454b98a
Aryan authored Sep 16, 2024
```
allow max shard size to be specified when saving pipeline
```
2454b98a

12 Sep, 2024 1 commit

Ptxla sd training (#9381) · 45aa8bb1

Juan Acevedo authored Sep 11, 2024



* enable pxla training of stable diffusion 2.x models.

* run linter/style and run pipeline test for stable diffusion and fix issues.

* update xla libraries

* fix read me newline.

* move files to research folder.

* update per comments.

* rename readme.

---------
Co-authored-by: Juan Acevedo <jfacevedo@google.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

45aa8bb1

09 Sep, 2024 2 commits

[Pipeline] animatediff + vid2vid + controlnet (#9337) · a7361dcc

Igor Filippov authored Sep 09, 2024

* add animatediff + vid2vide + controlnet

* post tests fixes

* PR discussion fixes

* update docs

* change input video to links on HF + update an example

* make quality fix

* fix ip adapter test

* fix ip adapter test input

* update ip adapter test

a7361dcc

refactor `get_timesteps` for SDXL img2img + add set_begin_index (#9375) · 485b8bb0
YiYi Xu authored Sep 09, 2024
```
* refator + add begin_index

* add kolors img2img to doc
```
485b8bb0

06 Sep, 2024 2 commits

add flux inpaint + img2img + controlnet to auto pipeline (#9367) · 8cdcdd9e
YiYi Xu authored Sep 06, 2024

8cdcdd9e

[core] Freenoise memory improvements (#9262) · 6dfa4996

Aryan authored Sep 06, 2024

* update

* implement prompt interpolation

* make style

* resnet memory optimizations

* more memory optimizations; todo: refactor

* update

* update animatediff controlnet with latest changes

* refactor chunked inference changes

* remove print statements

* update

* chunk -> split

* remove changes from incorrect conflict resolution

* remove changes from incorrect conflict resolution

* add explanation of SplitInferenceModule

* update docs

* Revert "update docs"

This reverts commit c55a50a271b2cefa8fe340a4f2a3ab9b9d374ec0.

* update docstring for freenoise split inference

* apply suggestions from review

* add tests

* apply suggestions from review

6dfa4996

04 Sep, 2024 3 commits
- Update `UNet2DConditionModel`'s error messages (#9230) · 30005517
  Tolga Cangöz authored Sep 04, 2024
```
* refactor
```
  30005517
- Add Flux inpainting and Flux Img2Img (#9135) · 249a9e48
  Vishnu V Jaddipal authored Sep 05, 2024
```
---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>
```
  249a9e48
- Enable `load_lora_weights` for `StableDiffusion3InpaintPipeline` (#9330) · 8ecf499d
  Eduardo Escobar authored Sep 03, 2024
```
Enable load_lora_weights for StableDiffusion3InpaintPipeline
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  8ecf499d
02 Sep, 2024 1 commit

[core] Support VideoToVideo with CogVideoX (#9333) · 0e6a8403

Aryan authored Sep 02, 2024

* add vid2vid pipeline for cogvideox

* make fix-copies

* update docs

* fake context parallel cache, vae encode tiling

* add test for cog vid2vid

* use video link from HF docs repo

* add copied from comments; correctly rename test class

0e6a8403

28 Aug, 2024 1 commit
- Change default for `guidance_scale`in FLUX (#9305) · 089cf798
  apolinário authored Aug 28, 2024
```
To match the original code, 7.0 is too high
```
  089cf798