1. 07 Nov, 2024 1 commit
    • Sayak Paul's avatar
      [Core] introduce `controlnet` module (#8768) · ded3db16
      Sayak Paul authored
      
      
      * move vae flax module.
      
      * controlnet module.
      
      * prepare for PR.
      
      * revert a commit
      
      * gracefully deprecate controlnet deps.
      
      * fix
      
      * fix doc path
      
      * fix-copies
      
      * fix path
      
      * style
      
      * style
      
      * conflicts
      
      * fix
      
      * fix-copies
      
      * sparsectrl.
      
      * updates
      
      * fix
      
      * updates
      
      * updates
      
      * updates
      
      * fix
      
      ---------
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      ded3db16
  2. 05 Nov, 2024 1 commit
    • Aryan's avatar
      [core] Mochi T2V (#9769) · 3f329a42
      Aryan authored
      
      
      * update
      
      * udpate
      
      * update transformer
      
      * make style
      
      * fix
      
      * add conversion script
      
      * update
      
      * fix
      
      * update
      
      * fix
      
      * update
      
      * fixes
      
      * make style
      
      * update
      
      * update
      
      * update
      
      * init
      
      * update
      
      * update
      
      * add
      
      * up
      
      * up
      
      * up
      
      * update
      
      * mochi transformer
      
      * remove original implementation
      
      * make style
      
      * update inits
      
      * update conversion script
      
      * docs
      
      * Update src/diffusers/pipelines/mochi/pipeline_mochi.py
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      
      * Update src/diffusers/pipelines/mochi/pipeline_mochi.py
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      
      * fix docs
      
      * pipeline fixes
      
      * make style
      
      * invert sigmas in scheduler; fix pipeline
      
      * fix pipeline num_frames
      
      * flip proj and gate in swiglu
      
      * make style
      
      * fix
      
      * make style
      
      * fix tests
      
      * latent mean and std fix
      
      * update
      
      * cherry-pick 1069d210e1b9e84a366cdc7a13965626ea258178
      
      * remove additional sigma already handled by flow match scheduler
      
      * fix
      
      * remove hardcoded value
      
      * replace conv1x1 with linear
      
      * Update src/diffusers/pipelines/mochi/pipeline_mochi.py
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      
      * framewise decoding and conv_cache
      
      * make style
      
      * Apply suggestions from code review
      
      * mochi vae encoder changes
      
      * rebase correctly
      
      * Update scripts/convert_mochi_to_diffusers.py
      
      * fix tests
      
      * fixes
      
      * make style
      
      * update
      
      * make style
      
      * update
      
      * add framewise and tiled encoding
      
      * make style
      
      * make original vae implementation behaviour the default; note: framewise encoding does not work
      
      * remove framewise encoding implementation due to presence of attn layers
      
      * fight test 1
      
      * fight test 2
      
      ---------
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      Co-authored-by: default avataryiyixuxu <yixu310@gmail.com>
      3f329a42
  3. 29 Oct, 2024 1 commit
  4. 22 Oct, 2024 1 commit
  5. 21 Oct, 2024 3 commits
    • YiYi Xu's avatar
      minor doc/test update (#9734) · e2d037bb
      YiYi Xu authored
      
      
      * update some docs and tests!
      
      ---------
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      Co-authored-by: default avatarAryan <aryan@huggingface.co>
      Co-authored-by: default avatarapolinário <joaopaulo.passos@gmail.com>
      e2d037bb
    • Sayak Paul's avatar
      [Docs] docs to xlabs controlnets. (#9688) · d27ecc59
      Sayak Paul authored
      
      
      * docs to xlabs controlnets.
      Co-authored-by: default avatarAnzhella Pankratova <son0shad@gmail.com>
      
      * Apply suggestions from code review
      Co-authored-by: default avatarAnzhella Pankratova <54744846+Anghellia@users.noreply.github.com>
      
      ---------
      Co-authored-by: default avatarAnzhella Pankratova <son0shad@gmail.com>
      Co-authored-by: default avatarAnzhella Pankratova <54744846+Anghellia@users.noreply.github.com>
      d27ecc59
    • Sayak Paul's avatar
      [Quantization] Add quantization support for `bitsandbytes` (#9213) · b821f006
      Sayak Paul authored
      * quantization config.
      
      * fix-copies
      
      * fix
      
      * modules_to_not_convert
      
      * add bitsandbytes utilities.
      
      * make progress.
      
      * fixes
      
      * quality
      
      * up
      
      * up
      
      rotary embedding refactor 2: update comments, fix dtype for use_real=False (#9312)
      
      fix notes and dtype
      
      up
      
      up
      
      * minor
      
      * up
      
      * up
      
      * fix
      
      * provide credits where due.
      
      * make configurations work.
      
      * fixes
      
      * fix
      
      * update_missing_keys
      
      * fix
      
      * fix
      
      * make it work.
      
      * fix
      
      * provide credits to transformers.
      
      * empty commit
      
      * handle to() better.
      
      * tests
      
      * change to bnb from bitsandbytes
      
      * fix tests
      
      fix slow quality tests
      
      SD3 remark
      
      fix
      
      complete int4 tests
      
      add a readme to the test files.
      
      add model cpu offload tests
      
      warning test
      
      * better safeguard.
      
      * change merging status
      
      * courtesy to transformers.
      
      * move  upper.
      
      * better
      
      * make the unused kwargs warning friendlier.
      
      * harmonize changes with https://github.com/huggingface/transformers/pull/33122
      
      
      
      * style
      
      * trainin tests
      
      * feedback part i.
      
      * Add Flux inpainting and Flux Img2Img (#9135)
      
      ---------
      Co-authored-by: default avataryiyixuxu <yixu310@gmail.com>
      
      Update `UNet2DConditionModel`'s error messages (#9230)
      
      * refactor
      
      [CI] Update Single file Nightly Tests (#9357)
      
      * update
      
      * update
      
      feedback.
      
      improve README for flux dreambooth lora (#9290)
      
      * improve readme
      
      * improve readme
      
      * improve readme
      
      * improve readme
      
      fix one uncaught deprecation warning for accessing vae_latent_channels in VaeImagePreprocessor (#9372)
      
      deprecation warning vae_latent_channels
      
      add mixed int8 tests and more tests to nf4.
      
      [core] Freenoise memory improvements (#9262)
      
      * update
      
      * implement prompt interpolation
      
      * make style
      
      * resnet memory optimizations
      
      * more memory optimizations; todo: refactor
      
      * update
      
      * update animatediff controlnet with latest changes
      
      * refactor chunked inference changes
      
      * remove print statements
      
      * update
      
      * chunk -> split
      
      * remove changes from incorrect conflict resolution
      
      * remove changes from incorrect conflict resolution
      
      * add explanation of SplitInferenceModule
      
      * update docs
      
      * Revert "update docs"
      
      This reverts commit c55a50a271b2cefa8fe340a4f2a3ab9b9d374ec0.
      
      * update docstring for freenoise split inference
      
      * apply suggestions from review
      
      * add tests
      
      * apply suggestions from review
      
      quantization docs.
      
      docs.
      
      * Revert "Add Flux inpainting and Flux Img2Img (#9135)"
      
      This reverts commit 5799954dd4b3d753c7c1b8d722941350fe4f62ca.
      
      * tests
      
      * don
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * contribution guide.
      
      * changes
      
      * empty
      
      * fix tests
      
      * harmonize with https://github.com/huggingface/transformers/pull/33546
      
      .
      
      * numpy_cosine_distance
      
      * config_dict modification.
      
      * remove if config comment.
      
      * note for load_state_dict changes.
      
      * float8 check.
      
      * quantizer.
      
      * raise an error for non-True low_cpu_mem_usage values when using quant.
      
      * low_cpu_mem_usage shenanigans when using fp32 modules.
      
      * don't re-assign _pre_quantization_type.
      
      * make comments clear.
      
      * remove comments.
      
      * handle mixed types better when moving to cpu.
      
      * add tests to check if we're throwing warning rightly.
      
      * better check.
      
      * fix 8bit test_quality.
      
      * handle dtype more robustly.
      
      * better message when keep_in_fp32_modules.
      
      * handle dtype casting.
      
      * fix dtype checks in pipeline.
      
      * fix warning message.
      
      * Update src/diffusers/models/modeling_utils.py
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      
      * mitigate the confusing cpu warning
      
      ---------
      Co-authored-by: default avatarVishnu V Jaddipal <95531133+Gothos@users.noreply.github.com>
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      b821f006
  6. 16 Oct, 2024 1 commit
  7. 14 Oct, 2024 1 commit
    • Yuxuan.Zhang's avatar
      CogView3Plus DiT (#9570) · 8d81564b
      Yuxuan.Zhang authored
      * merge 9588
      
      * max_shard_size="5GB" for colab running
      
      * conversion script updates; modeling test; refactor transformer
      
      * make fix-copies
      
      * Update convert_cogview3_to_diffusers.py
      
      * initial pipeline draft
      
      * make style
      
      * fight bugs 🐛
      
      🪳
      
      * add example
      
      * add tests; refactor
      
      * make style
      
      * make fix-copies
      
      * add co-author
      
      YiYi Xu <yixu310@gmail.com>
      
      * remove files
      
      * add docs
      
      * add co-author
      Co-Authored-By: default avatarYiYi Xu <yixu310@gmail.com>
      
      * fight docs
      
      * address reviews
      
      * make style
      
      * make model work
      
      * remove qkv fusion
      
      * remove qkv fusion tets
      
      * address review comments
      
      * fix make fix-copies error
      
      * remove None and TODO
      
      * for FP16(draft)
      
      * make style
      
      * remove dynamic cfg
      
      * remove pooled_projection_dim as a parameter
      
      * fix tests
      
      ---------
      Co-authored-by: default avatarAryan <aryan@huggingface.co>
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      8d81564b
  8. 09 Oct, 2024 1 commit
  9. 01 Oct, 2024 1 commit
  10. 30 Sep, 2024 1 commit
  11. 25 Sep, 2024 1 commit
  12. 23 Sep, 2024 2 commits
  13. 17 Sep, 2024 2 commits
  14. 16 Sep, 2024 2 commits
    • suzukimain's avatar
      [docs] Replace runwayml/stable-diffusion-v1-5 with Lykon/dreamshaper-8 (#9428) · b52119ae
      suzukimain authored
      
      
      * [docs] Replace runwayml/stable-diffusion-v1-5 with Lykon/dreamshaper-8
      
      Updated documentation as runwayml/stable-diffusion-v1-5 has been removed from Huggingface.
      
      * Update docs/source/en/using-diffusers/inpaint.md
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * Replace with stable-diffusion-v1-5/stable-diffusion-v1-5
      
      * Update inpaint.md
      
      ---------
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      b52119ae
    • Yuxuan.Zhang's avatar
      CogVideoX-5b-I2V support (#9418) · 8336405e
      Yuxuan.Zhang authored
      
      
      * draft Init
      
      * draft
      
      * vae encode image
      
      * make style
      
      * image latents preparation
      
      * remove image encoder from conversion script
      
      * fix minor bugs
      
      * make pipeline work
      
      * make style
      
      * remove debug prints
      
      * fix imports
      
      * update example
      
      * make fix-copies
      
      * add fast tests
      
      * fix import
      
      * update vae
      
      * update docs
      
      * update image link
      
      * apply suggestions from review
      
      * apply suggestions from review
      
      * add slow test
      
      * make use of learned positional embeddings
      
      * apply suggestions from review
      
      * doc change
      
      * Update convert_cogvideox_to_diffusers.py
      
      * make style
      
      * final changes
      
      * make style
      
      * fix tests
      
      ---------
      Co-authored-by: default avatarAryan <aryan@huggingface.co>
      8336405e
  15. 11 Sep, 2024 1 commit
  16. 09 Sep, 2024 2 commits
  17. 04 Sep, 2024 1 commit
  18. 02 Sep, 2024 1 commit
    • Aryan's avatar
      [core] Support VideoToVideo with CogVideoX (#9333) · 0e6a8403
      Aryan authored
      * add vid2vid pipeline for cogvideox
      
      * make fix-copies
      
      * update docs
      
      * fake context parallel cache, vae encode tiling
      
      * add test for cog vid2vid
      
      * use video link from HF docs repo
      
      * add copied from comments; correctly rename test class
      0e6a8403
  19. 30 Aug, 2024 1 commit
  20. 27 Aug, 2024 1 commit
  21. 25 Aug, 2024 1 commit
  22. 23 Aug, 2024 1 commit
  23. 21 Aug, 2024 1 commit
  24. 18 Aug, 2024 1 commit
  25. 13 Aug, 2024 2 commits
    • Aryan's avatar
      [refactor] CogVideoX followups + tiled decoding support (#9150) · a85b34e7
      Aryan authored
      * refactor context parallel cache; update torch compile time benchmark
      
      * add tiling support
      
      * make style
      
      * remove num_frames % 8 == 0 requirement
      
      * update default num_frames to original value
      
      * add explanations + refactor
      
      * update torch compile example
      
      * update docs
      
      * update
      
      * clean up if-statements
      
      * address review comments
      
      * add test for vae tiling
      
      * update docs
      
      * update docs
      
      * update docstrings
      
      * add modeling test for cogvideox transformer
      
      * make style
      a85b34e7
    • 林金鹏's avatar
      Support SD3 controlnet inpainting (#9099) · cc051309
      林金鹏 authored
      
      
      * add controlnet inpainting pipeline
      
      * [SD3] add controlnet inpaint example
      
      * update example and fix code style
      
      * fix code style with ruff
      
      * Update controlnet_sd3.md : add control inpaint pipeline
      
      * Update docs/source/en/api/pipelines/controlnet_sd3.md
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      
      * Update docs/source/en/api/pipelines/controlnet_sd3.md
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      
      * Update docs/source/en/api/pipelines/controlnet_sd3.md
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      
      * Update src/diffusers/pipelines/controlnet_sd3/pipeline_stable_diffusion_3_controlnet_inpainting.py
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      
      * Update __init__.py : add sd3 control pipelines
      
      * Update pipeline : add new param doc & check input reference.
      
      * fix typo
      
      * make style & make quality
      
      * add unittest for sd3 controlnet inpaint
      
      ---------
      Co-authored-by: default avatar鹏徙 <linjinpeng.ljp@alibaba-inc.com>
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      cc051309
  26. 08 Aug, 2024 1 commit
  27. 07 Aug, 2024 4 commits
  28. 06 Aug, 2024 2 commits
  29. 05 Aug, 2024 1 commit