Commits · 8336405e50e204fad3601e8350e04e6daa838eb4 · renzhc / diffusers_dcu

"vscode:/vscode.git/clone" did not exist on "06a042cd0ed090be8dc5a425003193ecb70e82b4"

16 Sep, 2024 1 commit

CogVideoX-5b-I2V support (#9418) · 8336405e

Yuxuan.Zhang authored Sep 16, 2024



* draft Init

* draft

* vae encode image

* make style

* image latents preparation

* remove image encoder from conversion script

* fix minor bugs

* make pipeline work

* make style

* remove debug prints

* fix imports

* update example

* make fix-copies

* add fast tests

* fix import

* update vae

* update docs

* update image link

* apply suggestions from review

* apply suggestions from review

* add slow test

* make use of learned positional embeddings

* apply suggestions from review

* doc change

* Update convert_cogvideox_to_diffusers.py

* make style

* final changes

* make style

* fix tests

---------
Co-authored-by: Aryan <aryan@huggingface.co>

8336405e

11 Sep, 2024 1 commit

[docs] AnimateDiff FreeNoise (#9414) · 5e1427a7

Aryan authored Sep 12, 2024



* update docs

* apply suggestions from review

* Update docs/source/en/api/pipelines/animatediff.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/api/pipelines/animatediff.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/api/pipelines/animatediff.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* apply suggestions from review

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

5e1427a7

09 Sep, 2024 3 commits

[docs] Add xDiT in section optimization (#9365) · 2c6a6c97

Jinzhe Pan authored Sep 10, 2024



* docs: add xDiT to optimization methods

* fix: picture layout problem

* docs: add more introduction about xdit & apply suggestions

* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

2c6a6c97

[Pipeline] animatediff + vid2vid + controlnet (#9337) · a7361dcc

Igor Filippov authored Sep 09, 2024

* add animatediff + vid2vide + controlnet

* post tests fixes

* PR discussion fixes

* update docs

* change input video to links on HF + update an example

* make quality fix

* fix ip adapter test

* fix ip adapter test input

* update ip adapter test

a7361dcc

refactor `get_timesteps` for SDXL img2img + add set_begin_index (#9375) · 485b8bb0
YiYi Xu authored Sep 09, 2024
```
* refator + add begin_index

* add kolors img2img to doc
```
485b8bb0

04 Sep, 2024 1 commit
- Add Flux inpainting and Flux Img2Img (#9135) · 249a9e48
  Vishnu V Jaddipal authored Sep 05, 2024
```
---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>
```
  249a9e48
02 Sep, 2024 1 commit

[core] Support VideoToVideo with CogVideoX (#9333) · 0e6a8403

Aryan authored Sep 02, 2024

* add vid2vid pipeline for cogvideox

* make fix-copies

* update docs

* fake context parallel cache, vae encode tiling

* add test for cog vid2vid

* use video link from HF docs repo

* add copied from comments; correctly rename test class

0e6a8403

30 Aug, 2024 1 commit

[docs] Add a note on torchao/quanto benchmarks for CogVideoX and memory-efficient inference (#9296) · e417d028

Aryan authored Aug 30, 2024



* add a note on torchao/quanto benchmarks and memory-efficient inference

* apply suggestions from review

* update

* Update docs/source/en/api/pipelines/cogvideox.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/api/pipelines/cogvideox.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* add note on enable sequential cpu offload

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

e417d028

27 Aug, 2024 1 commit
- [docs] Add pipelines to table (#9282) · bbcf2a85
  Steven Liu authored Aug 26, 2024
```
update pipelines
```
  bbcf2a85
25 Aug, 2024 1 commit
- [Flux] Support Union ControlNet (#9175) · c1e6a32a
  王奇勋 authored Aug 25, 2024
```
* refactor
---------
Co-authored-by: haofanwang <haofanwang.ai@gmail.com>
```
  c1e6a32a
23 Aug, 2024 1 commit

Cogvideox-5B Model adapter change (#9203) · 960c149c

zR authored Aug 23, 2024



* draft of embedding

---------
Co-authored-by: Aryan <aryan@huggingface.co>

960c149c

22 Aug, 2024 1 commit

Docs fix spelling issues (#9219) · 805bf33f

Elias Rad authored Aug 22, 2024

* fix PHILOSOPHY.md

* fix CONTRIBUTING.md

* fix tutorial_overview.md

* fix stable_diffusion.md

* Update tutorial_overview.md

805bf33f

21 Aug, 2024 1 commit

Add StableDiffusionXLControlNetPAGImg2ImgPipeline (#8990) · 9003d75f

satani99 authored Aug 21, 2024



* Added pad controlnet sdxl img2img pipeline

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

9003d75f

19 Aug, 2024 1 commit

Reflect few contributions on `contribution.md` that were not reflected on #8294 (#8938) · d72bbc68

Jiwook Han authored Aug 20, 2024



* incorrect_number_fix

* add_TOC

* Update docs/source/ko/conceptual/contribution.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* Update docs/source/ko/conceptual/contribution.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* fix: manual edits

* fix: manual edtis

* fix: manual edits

* Update docs/source/ko/conceptual/contribution.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* Update docs/source/ko/conceptual/contribution.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* Update docs/source/ko/conceptual/contribution.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* Update docs/source/ko/conceptual/contribution.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* Update docs/source/ko/conceptual/contribution.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* Update docs/source/ko/conceptual/contribution.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* Update docs/source/ko/conceptual/contribution.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* Update docs/source/ko/conceptual/contribution.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* Update docs/source/ko/conceptual/contribution.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

* fix: manual edits

---------
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com>

d72bbc68

18 Aug, 2024 1 commit

[`Docs`] Fix CPU offloading usage (#9207) · 7ef8a465

Tolga Cangöz authored Aug 19, 2024

* chore: Fix cpu offloading usage

* Trim trailing white space

* docs: update Kolors model link in kolors.md

7ef8a465

13 Aug, 2024 2 commits

[refactor] CogVideoX followups + tiled decoding support (#9150) · a85b34e7

Aryan authored Aug 14, 2024

* refactor context parallel cache; update torch compile time benchmark

* add tiling support

* make style

* remove num_frames % 8 == 0 requirement

* update default num_frames to original value

* add explanations + refactor

* update torch compile example

* update docs

* update

* clean up if-statements

* address review comments

* add test for vae tiling

* update docs

* update docs

* update docstrings

* add modeling test for cogvideox transformer

* make style

a85b34e7

Support SD3 controlnet inpainting (#9099) · cc051309

林金鹏 authored Aug 13, 2024



* add controlnet inpainting pipeline

* [SD3] add controlnet inpaint example

* update example and fix code style

* fix code style with ruff

* Update controlnet_sd3.md : add control inpaint pipeline

* Update docs/source/en/api/pipelines/controlnet_sd3.md
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

* Update docs/source/en/api/pipelines/controlnet_sd3.md
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

* Update docs/source/en/api/pipelines/controlnet_sd3.md
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

* Update src/diffusers/pipelines/controlnet_sd3/pipeline_stable_diffusion_3_controlnet_inpainting.py
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

* Update __init__.py : add sd3 control pipelines

* Update pipeline : add new param doc & check input reference.

* fix typo

* make style & make quality

* add unittest for sd3 controlnet inpaint

---------
Co-authored-by: 鹏徙 <linjinpeng.ljp@alibaba-inc.com>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

cc051309

12 Aug, 2024 1 commit

Update distributed_inference.md to include a fuller example on distributed inference (#9152) · 15eb77bc

Sayak Paul authored Aug 12, 2024



* Update distributed_inference.md

* Update docs/source/en/training/distributed_inference.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

15eb77bc

10 Aug, 2024 1 commit
- [docs] Resolve internal links to PEFT (#9144) · 98930ee1
  Steven Liu authored Aug 09, 2024
```
* resolve peft links

* fuse_lora
```
  98930ee1
08 Aug, 2024 2 commits
- Fix a dead link (#9116) · ae026db7
  David Steinberg authored Aug 08, 2024
```
Co-authored-by: Aryan <aryan@huggingface.co>
```
  ae026db7
- [docs] Organize model toctree (#9118) · ba7e4845
  Steven Liu authored Aug 07, 2024
```
* toctree

* fix
```
  ba7e4845
07 Aug, 2024 4 commits

Add CogVideoX text-to-video generation model (#9082) · 2dad462d

zR authored Aug 07, 2024



* add CogVideoX

---------
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

2dad462d

Flux fp16 inference fix (#9097) · 9b5180cb

latentCall145 authored Aug 07, 2024



* clipping for fp16

* fix typo

* added fp16 inference to docs

* fix docs typo

* include link for fp16 investigation

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

9b5180cb

[Kolors] Add PAG (#8934) · 39e1f7ea

Álvaro Somoza authored Aug 06, 2024



* txt2img pag added

* autopipe added, fixed case

* style

* apply suggestions

* added fast tests, added todo tests

* revert dummy objects for kolors

* fix pag dummies

* fix test imports

* update pag tests

* add kolor pag to docs

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

39e1f7ea

[Single File] Add single file support for Flux Transformer (#9083) · e1b603dc
Dhruv Nair authored Aug 07, 2024
```
* update

* update

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
e1b603dc

06 Aug, 2024 3 commits

add PAG support for Stable Diffusion 3 (#8861) · 926daa30

Ahn Donghoon (안동훈 / suno) authored Aug 07, 2024



add pag sd3


---------
Co-authored-by: HyoungwonCho <jhw9811@korea.ac.kr>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: crepejung00 <jaewoojung00@naver.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Aryan <contact.aryanvs@gmail.com>
Co-authored-by: Aryan <aryan@huggingface.co>

926daa30

[Docs] Add community projects section to docs (#9013) · 325a5de3
Dhruv Nair authored Aug 06, 2024
```
* update

* update

* update
```
325a5de3
update · 4c6152c2
Dhruv Nair authored Aug 06, 2024

4c6152c2

05 Aug, 2024 4 commits

Update `CLIPFeatureExtractor` to `CLIPImageProcessor` and... · 3dc97bd1

Tolga Cangöz authored Aug 05, 2024


Update `CLIPFeatureExtractor` to `CLIPImageProcessor` and `DPTFeatureExtractor` to `DPTImageProcessor` (#9002)

* fix: update `CLIPFeatureExtractor` to `CLIPImageProcessor` in codebase

* `make style && make quality`

* Update `DPTFeatureExtractor` to `DPTImageProcessor` in codebase

* `make style`

---------
Co-authored-by: Aryan <aryan@huggingface.co>

3dc97bd1

Fix typos (#9077) · 6d32b292
omahs authored Aug 05, 2024
```
* fix typo
```
6d32b292
[Docs] add stable cascade unet doc. (#9066) · 5934873b
Sayak Paul authored Aug 05, 2024
```
* add stable cascade unet doc.

* fix path
```
5934873b

PAG variant for HunyuanDiT, PAG refactor (#8936) · b7058d14

Aryan authored Aug 05, 2024



* copy hunyuandit pipeline

* pag variant of hunyuan dit

* add tests

* update docs

* make style

* make fix-copies

* Update src/diffusers/pipelines/pag/pag_utils.py

* remove incorrect copied from

* remove pag hunyuan attn procs to resolve conflicts

* add pag attn procs again

* new implementation for pag_utils

* revert pag changes

* add pag refactor back; update pixart sigma

* update pixart pag tests

* apply suggestions from review

Co-Authored-By: yixu310@gmail.com

* make style

* update docs, fix tests

* fix tests

* fix test_components_function since list not accepted as valid __init__ param

* apply patch to fix broken tests
Co-Authored-By: Sayak Paul <spsayakpaul@gmail.com>

* make style

* fix hunyuan tests

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

b7058d14

04 Aug, 2024 1 commit
- [Flux] minor documentation fixes for flux. (#9048) · c370b90f
  Sayak Paul authored Aug 04, 2024
```
* minor documentation fixes for flux.

* clipskip

* add gist
```
  c370b90f
03 Aug, 2024 2 commits

Fix grammar mistake. (#9072) · ebf3ab14
Philip Rideout authored Aug 03, 2024

ebf3ab14

Errata: Fix typos & `\s+$` (#9008) · 7071b746

Tolga Cangöz authored Aug 03, 2024



* Fix typos

* chore: Fix typos

* chore: Update README.md for promptdiffusion example

* Trim trailing white spaces

* Fix a typo

* update number

* chore: update number

* Trim trailing white space

* Update README.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update README.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

7071b746

02 Aug, 2024 1 commit

[Core] Add PAG support for PixArtSigma (#8921) · 7b98c4cc

Sayak Paul authored Aug 02, 2024

* feat: add pixart sigma pag.

* inits.

* fixes

* fix

* remove print.

* copy paste methods to the pixart pag mixin

* fix-copies

* add documentation.

* add tests.

* remove correction file.

* remove pag_applied_layers

* empty

7b98c4cc

01 Aug, 2024 2 commits

Flux pipeline (#9043) · 27637a54

Sayak Paul authored Aug 02, 2024



add flux!
Signed-off-by: Adrien <adrien@huggingface.co>
Co-authored-by: Adrien <adrien.69740@gmail.com>
Co-authored-by: Anatoly Belikov <abelikov@singularitynet.io>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com>

27637a54

PAG variant for AnimateDiff (#8789) · 05b706c0

Aryan authored Aug 01, 2024

* add animatediff pag pipeline

* remove unnecessary print

* make fix-copies

* fix ip-adapter bug

* update docs

* add fast tests and fix bugs

* update

* update

* address review comments

* update ip adapter single test expected slice

* implement test_from_pipe_consistent_config; fix expected slice values

* LoraLoaderMixin->StableDiffusionLoraLoaderMixin; add latest freeinit test

05b706c0

30 Jul, 2024 2 commits

[core] Move community AnimateDiff ControlNet to core (#8972) · e5b94b4c

Aryan authored Jul 30, 2024



* add animatediff controlnet to core

* make style; remove unused method

* fix copied from comment

* add tests

* changes to make tests work

* add utility function to load videos

* update docs

* update pipeline example

* make style

* update docs with example

* address review comments

* add latest freeinit test from #8969

* LoraLoaderMixin -> StableDiffusionLoraLoaderMixin

* fix docs

* Update src/diffusers/utils/loading_utils.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* fix: variable out of scope

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

e5b94b4c

Stable Audio integration (#8716) · 69e72b1d

Yoach Lacombe authored Jul 30, 2024



* WIP modeling code and pipeline

* add custom attention processor + custom activation + add to init

* correct ProjectionModel forward

* add stable audio to __initèè

* add autoencoder and update pipeline and modeling code

* add half Rope

* add partial rotary v2

* add temporary modfis to scheduler

* add EDM DPM Solver

* remove TODOs

* clean GLU

* remove att.group_norm to attn processor

* revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py

* refactor GLU -> SwiGLU

* remove redundant args

* add channel multiples in autoencoder docstrings

* changes in docsrtings and copyright headers

* clean pipeline

* further cleaning

* remove peft and lora and fromoriginalmodel

* Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace

* make style

* dummy models

* fix copied from

* add fast oobleck tests

* add brownian tree

* oobleck autoencoder slow tests

* remove TODO

* fast stable audio pipeline tests

* add slow tests

* make style

* add first version of docs

* wrap is_torchsde_available to the scheduler

* fix slow test

* test with input waveform

* add input waveform

* remove some todos

* create stableaudio gaussian projection + make style

* add pipeline to toctree

* fix copied from

* make quality

* refactor timestep_features->time_proj

* refactor joint_attention_kwargs->cross_attention_kwargs

* remove forward_chunk

* move StableAudioDitModel to transformers folder

* correct convert + remove partial rotary embed

* apply suggestions from yiyixuxu -> removing attn.kv_heads

* remove temb

* remove cross_attention_kwargs

* further removal of cross_attention_kwargs

* remove text encoder autocast to fp16

* continue removing autocast

* make style

* refactor how text and audio are embedded

* add paper

* update example code

* make style

* unify projection model forward + fix device placement

* make style

* remove fuse qkv

* apply suggestions from review

* Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* make style

* smaller models in fast tests

* pass sequential offloading fast tests

* add docs for vae and autoencoder

* make style and update example

* remove useless import

* add cosine scheduler

* dummy classes

* cosine scheduler docs

* better description of scheduler

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

69e72b1d