"vscode:/vscode.git/clone" did not exist on "06a042cd0ed090be8dc5a425003193ecb70e82b4"
  1. 16 Sep, 2024 1 commit
    • Yuxuan.Zhang's avatar
      CogVideoX-5b-I2V support (#9418) · 8336405e
      Yuxuan.Zhang authored
      
      
      * draft Init
      
      * draft
      
      * vae encode image
      
      * make style
      
      * image latents preparation
      
      * remove image encoder from conversion script
      
      * fix minor bugs
      
      * make pipeline work
      
      * make style
      
      * remove debug prints
      
      * fix imports
      
      * update example
      
      * make fix-copies
      
      * add fast tests
      
      * fix import
      
      * update vae
      
      * update docs
      
      * update image link
      
      * apply suggestions from review
      
      * apply suggestions from review
      
      * add slow test
      
      * make use of learned positional embeddings
      
      * apply suggestions from review
      
      * doc change
      
      * Update convert_cogvideox_to_diffusers.py
      
      * make style
      
      * final changes
      
      * make style
      
      * fix tests
      
      ---------
      Co-authored-by: default avatarAryan <aryan@huggingface.co>
      8336405e
  2. 11 Sep, 2024 1 commit
  3. 09 Sep, 2024 3 commits
  4. 04 Sep, 2024 1 commit
  5. 02 Sep, 2024 1 commit
    • Aryan's avatar
      [core] Support VideoToVideo with CogVideoX (#9333) · 0e6a8403
      Aryan authored
      * add vid2vid pipeline for cogvideox
      
      * make fix-copies
      
      * update docs
      
      * fake context parallel cache, vae encode tiling
      
      * add test for cog vid2vid
      
      * use video link from HF docs repo
      
      * add copied from comments; correctly rename test class
      0e6a8403
  6. 30 Aug, 2024 1 commit
  7. 27 Aug, 2024 1 commit
  8. 25 Aug, 2024 1 commit
  9. 23 Aug, 2024 1 commit
  10. 22 Aug, 2024 1 commit
    • Elias Rad's avatar
      Docs fix spelling issues (#9219) · 805bf33f
      Elias Rad authored
      * fix PHILOSOPHY.md
      
      * fix CONTRIBUTING.md
      
      * fix tutorial_overview.md
      
      * fix stable_diffusion.md
      
      * Update tutorial_overview.md
      805bf33f
  11. 21 Aug, 2024 1 commit
  12. 19 Aug, 2024 1 commit
  13. 18 Aug, 2024 1 commit
  14. 13 Aug, 2024 2 commits
    • Aryan's avatar
      [refactor] CogVideoX followups + tiled decoding support (#9150) · a85b34e7
      Aryan authored
      * refactor context parallel cache; update torch compile time benchmark
      
      * add tiling support
      
      * make style
      
      * remove num_frames % 8 == 0 requirement
      
      * update default num_frames to original value
      
      * add explanations + refactor
      
      * update torch compile example
      
      * update docs
      
      * update
      
      * clean up if-statements
      
      * address review comments
      
      * add test for vae tiling
      
      * update docs
      
      * update docs
      
      * update docstrings
      
      * add modeling test for cogvideox transformer
      
      * make style
      a85b34e7
    • 林金鹏's avatar
      Support SD3 controlnet inpainting (#9099) · cc051309
      林金鹏 authored
      
      
      * add controlnet inpainting pipeline
      
      * [SD3] add controlnet inpaint example
      
      * update example and fix code style
      
      * fix code style with ruff
      
      * Update controlnet_sd3.md : add control inpaint pipeline
      
      * Update docs/source/en/api/pipelines/controlnet_sd3.md
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      
      * Update docs/source/en/api/pipelines/controlnet_sd3.md
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      
      * Update docs/source/en/api/pipelines/controlnet_sd3.md
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      
      * Update src/diffusers/pipelines/controlnet_sd3/pipeline_stable_diffusion_3_controlnet_inpainting.py
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      
      * Update __init__.py : add sd3 control pipelines
      
      * Update pipeline : add new param doc & check input reference.
      
      * fix typo
      
      * make style & make quality
      
      * add unittest for sd3 controlnet inpaint
      
      ---------
      Co-authored-by: default avatar鹏徙 <linjinpeng.ljp@alibaba-inc.com>
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      cc051309
  15. 12 Aug, 2024 1 commit
  16. 10 Aug, 2024 1 commit
  17. 08 Aug, 2024 2 commits
  18. 07 Aug, 2024 4 commits
  19. 06 Aug, 2024 3 commits
  20. 05 Aug, 2024 4 commits
    • Tolga Cangöz's avatar
      Update `CLIPFeatureExtractor` to `CLIPImageProcessor` and... · 3dc97bd1
      Tolga Cangöz authored
      
      Update `CLIPFeatureExtractor` to `CLIPImageProcessor` and `DPTFeatureExtractor` to `DPTImageProcessor` (#9002)
      
      * fix: update `CLIPFeatureExtractor` to `CLIPImageProcessor` in codebase
      
      * `make style && make quality`
      
      * Update `DPTFeatureExtractor` to `DPTImageProcessor` in codebase
      
      * `make style`
      
      ---------
      Co-authored-by: default avatarAryan <aryan@huggingface.co>
      3dc97bd1
    • omahs's avatar
      Fix typos (#9077) · 6d32b292
      omahs authored
      * fix typo
      6d32b292
    • Sayak Paul's avatar
      [Docs] add stable cascade unet doc. (#9066) · 5934873b
      Sayak Paul authored
      * add stable cascade unet doc.
      
      * fix path
      5934873b
    • Aryan's avatar
      PAG variant for HunyuanDiT, PAG refactor (#8936) · b7058d14
      Aryan authored
      
      
      * copy hunyuandit pipeline
      
      * pag variant of hunyuan dit
      
      * add tests
      
      * update docs
      
      * make style
      
      * make fix-copies
      
      * Update src/diffusers/pipelines/pag/pag_utils.py
      
      * remove incorrect copied from
      
      * remove pag hunyuan attn procs to resolve conflicts
      
      * add pag attn procs again
      
      * new implementation for pag_utils
      
      * revert pag changes
      
      * add pag refactor back; update pixart sigma
      
      * update pixart pag tests
      
      * apply suggestions from review
      
      Co-Authored-By: yixu310@gmail.com
      
      * make style
      
      * update docs, fix tests
      
      * fix tests
      
      * fix test_components_function since list not accepted as valid __init__ param
      
      * apply patch to fix broken tests
      Co-Authored-By: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * make style
      
      * fix hunyuan tests
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      b7058d14
  21. 04 Aug, 2024 1 commit
  22. 03 Aug, 2024 2 commits
  23. 02 Aug, 2024 1 commit
    • Sayak Paul's avatar
      [Core] Add PAG support for PixArtSigma (#8921) · 7b98c4cc
      Sayak Paul authored
      * feat: add pixart sigma pag.
      
      * inits.
      
      * fixes
      
      * fix
      
      * remove print.
      
      * copy paste methods to the pixart pag mixin
      
      * fix-copies
      
      * add documentation.
      
      * add tests.
      
      * remove correction file.
      
      * remove pag_applied_layers
      
      * empty
      7b98c4cc
  24. 01 Aug, 2024 2 commits
  25. 30 Jul, 2024 2 commits
    • Aryan's avatar
      [core] Move community AnimateDiff ControlNet to core (#8972) · e5b94b4c
      Aryan authored
      
      
      * add animatediff controlnet to core
      
      * make style; remove unused method
      
      * fix copied from comment
      
      * add tests
      
      * changes to make tests work
      
      * add utility function to load videos
      
      * update docs
      
      * update pipeline example
      
      * make style
      
      * update docs with example
      
      * address review comments
      
      * add latest freeinit test from #8969
      
      * LoraLoaderMixin -> StableDiffusionLoraLoaderMixin
      
      * fix docs
      
      * Update src/diffusers/utils/loading_utils.py
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      
      * fix: variable out of scope
      
      ---------
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      e5b94b4c
    • Yoach Lacombe's avatar
      Stable Audio integration (#8716) · 69e72b1d
      Yoach Lacombe authored
      
      
      * WIP modeling code and pipeline
      
      * add custom attention processor + custom activation + add to init
      
      * correct ProjectionModel forward
      
      * add stable audio to __initèè
      
      * add autoencoder and update pipeline and modeling code
      
      * add half Rope
      
      * add partial rotary v2
      
      * add temporary modfis to scheduler
      
      * add EDM DPM Solver
      
      * remove TODOs
      
      * clean GLU
      
      * remove att.group_norm to attn processor
      
      * revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py
      
      * refactor GLU -> SwiGLU
      
      * remove redundant args
      
      * add channel multiples in autoencoder docstrings
      
      * changes in docsrtings and copyright headers
      
      * clean pipeline
      
      * further cleaning
      
      * remove peft and lora and fromoriginalmodel
      
      * Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace
      
      * make style
      
      * dummy models
      
      * fix copied from
      
      * add fast oobleck tests
      
      * add brownian tree
      
      * oobleck autoencoder slow tests
      
      * remove TODO
      
      * fast stable audio pipeline tests
      
      * add slow tests
      
      * make style
      
      * add first version of docs
      
      * wrap is_torchsde_available to the scheduler
      
      * fix slow test
      
      * test with input waveform
      
      * add input waveform
      
      * remove some todos
      
      * create stableaudio gaussian projection + make style
      
      * add pipeline to toctree
      
      * fix copied from
      
      * make quality
      
      * refactor timestep_features->time_proj
      
      * refactor joint_attention_kwargs->cross_attention_kwargs
      
      * remove forward_chunk
      
      * move StableAudioDitModel to transformers folder
      
      * correct convert + remove partial rotary embed
      
      * apply suggestions from yiyixuxu -> removing attn.kv_heads
      
      * remove temb
      
      * remove cross_attention_kwargs
      
      * further removal of cross_attention_kwargs
      
      * remove text encoder autocast to fp16
      
      * continue removing autocast
      
      * make style
      
      * refactor how text and audio are embedded
      
      * add paper
      
      * update example code
      
      * make style
      
      * unify projection model forward + fix device placement
      
      * make style
      
      * remove fuse qkv
      
      * apply suggestions from review
      
      * Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      
      * make style
      
      * smaller models in fast tests
      
      * pass sequential offloading fast tests
      
      * add docs for vae and autoencoder
      
      * make style and update example
      
      * remove useless import
      
      * add cosine scheduler
      
      * dummy classes
      
      * cosine scheduler docs
      
      * better description of scheduler
      
      ---------
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      69e72b1d