1. 25 Aug, 2023 1 commit
    • Sanchit Gandhi's avatar
      [AudioLDM 2] Pipeline fixes (#4738) · 29a11c2a
      Sanchit Gandhi authored
      * fix docs
      
      * fix unet docs
      
      * use image output for latents
      
      * fix hub checkpoints
      
      * fix pipeline example
      
      * update example
      
      * return_dict = False
      
      * revert image pipeline output
      
      * revert doc changes
      
      * remove dtype test
      
      * make style
      
      * remove docstring updates
      
      * remove unet docstring update
      
      * Empty commit to re-trigger CI
      
      * fix cpu offload
      
      * fix dtype test
      
      * add offload test
      29a11c2a
  2. 24 Aug, 2023 2 commits
  3. 22 Aug, 2023 2 commits
  4. 21 Aug, 2023 1 commit
    • Sanchit Gandhi's avatar
      Add AudioLDM 2 (#4549) · 7a24977c
      Sanchit Gandhi authored
      
      
      * from audioldm
      
      * unet down + mid
      
      * vae, clap, flan-t5
      
      * start sequence audio mae
      
      * iterate on audioldm encoder
      
      * finish encoder
      
      * finish weight conversion
      
      * text pre-processing
      
      * gpt2 pre-processing
      
      * fix projection model
      
      * working
      
      * unet equivalence
      
      * finish in base
      
      * add unet cond
      
      * finish unet
      
      * finish custom unet
      
      * start clean-up
      
      * revert base unet changes
      
      * refactor pre-processing
      
      * tests: from audioldm
      
      * fix some tests
      
      * more fixes
      
      * iterate on tests
      
      * make fix copies
      
      * harden fast tests
      
      * slow integration tests
      
      * finish tests
      
      * update checkpoint
      
      * update copyright
      
      * docs
      
      * remove outdated method
      
      * add docstring
      
      * make style
      
      * remove decode latents
      
      * enable cpu offload
      
      * (text_encoder_1, tokenizer_1) -> (text_encoder, tokenizer)
      
      * more clean up
      
      * more refactor
      
      * build pr docs
      
      * Update docs/source/en/api/pipelines/audioldm2.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * small clean
      
      * tidy conversion
      
      * update for large checkpoint
      
      * generate -> generate_language_model
      
      * full clap model
      
      * shrink clap-audio in tests
      
      * fix large integration test
      
      * fix fast tests
      
      * use generation config
      
      * make style
      
      * update docs
      
      * finish docs
      
      * finish doc
      
      * update tests
      
      * fix last test
      
      * syntax
      
      * finalise tests
      
      * refactor projection model in prep for TTS
      
      * fix fast tests
      
      * style
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      7a24977c
  5. 17 Aug, 2023 2 commits
  6. 16 Aug, 2023 2 commits
    • Sayak Paul's avatar
      [Core] feat: MultiControlNet support for SDXL ControlNet pipeline (#4597) · 50495991
      Sayak Paul authored
      * core: add multicontrolnet support to sdxl controlnet
      
      * modify checks.
      
      * fix: original_size determination
      
      * add: tests for multi controlnet sdxl.
      
      * remove unnecessary prints.
      50495991
    • nikhil-masterful's avatar
      Add GLIGEN implementation (#4441) · da5ab51d
      nikhil-masterful authored
      * Add GLIGEN implementation
      
      * GLIGEN: Fix code quality check failures
      
      * GLIGEN: Fix Import block un-sorted or un-formatted failures
      
      * GLIGEN: Fix check_repository_consistency failures
      
      * GLIGEN: Add 'PositionNet' to versatile_diffusion/modeling_text_unet.py
      
      * GLIGEN: check_repository_consistency: fix 'copy does not match' error
      
      * GLIGEN: Fix review comments (1)
      
      * GLIGEN: Fix E721 Do not compare types, use `isinstance()` failures
      
      * GLIGEN : Ensure _encode_prompt() copy matches to StableDiffusionPipeline
      
      * GLIGEN: Fix ruff E721 failure in unidiffuser/test_unidiffuser.py
      
      * GLIGEN: doc_builder: restyle pipeline_stable_diffusion_gligen.py
      
      * GIGLEN: reset files unrelated to gligen
      
      * GLIGEN: Fix documentation comments (1)
      
      * GLIGEN: Fix review comments (2)
      
      * GLIGEN: Added FastTest
      
      * GLIGEN: Fix review comments (3)
      da5ab51d
  7. 15 Aug, 2023 1 commit
  8. 11 Aug, 2023 1 commit
  9. 09 Aug, 2023 2 commits
  10. 07 Aug, 2023 1 commit
  11. 04 Aug, 2023 2 commits
  12. 03 Aug, 2023 2 commits
  13. 01 Aug, 2023 2 commits
  14. 28 Jul, 2023 2 commits
    • Patrick von Platen's avatar
      [ONNX] Don't download ONNX model by default (#4338) · 306a7bd0
      Patrick von Platen authored
      * [Download] Don't download ONNX weights by default
      
      * [Download] Don't download ONNX weights by default
      
      * [Download] Don't download ONNX weights by default
      
      * fix more
      
      * finish
      
      * finish
      
      * finish
      306a7bd0
    • Patrick von Platen's avatar
      [SDXL Refiner] Fix refiner forward pass for batched input (#4327) · 18b018c8
      Patrick von Platen authored
      * fix_batch_xl
      
      * Fix other pipelines as well
      
      * up
      
      * up
      
      * Update tests/pipelines/stable_diffusion_xl/test_stable_diffusion_xl_inpaint.py
      
      * sort
      
      * up
      
      * Finish it all up Co-authored-by: Bagheera <bghira@users.github.com>
      
      * Co-authored-by: Bagheera bghira@users.github.com
      
      * Co-authored-by: Bagheera <bghira@users.github.com>
      
      * Finish it all up Co-authored-by: Bagheera <bghira@users.github.com>
      18b018c8
  15. 27 Jul, 2023 2 commits
  16. 26 Jul, 2023 1 commit
  17. 25 Jul, 2023 2 commits
    • Harutatsu Akiyama's avatar
      [SDXL and IP2P]: instruction pix2pix XL training and pipeline (#4079) · 428dbfec
      Harutatsu Akiyama authored
      
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * [Community] Implementation of the IADB community pipeline (#3996)
      
      * community pipeline: implementation of iadb
      
      * iadb.py: reformat using black
      
      * iadb.py: linting update
      
      * add kandinsky to readme table (#4081)
      Co-authored-by: default avataryiyixuxu <yixu310@gmail,com>
      
      * [From Single File] Force accelerate to be installed (#4078)
      
      force accelerate to be installed
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Support instruction pix2pix sdxl
      
      * Clean up IP2P SDXL code
      
      * Clean up IP2P SDXL code
      
      * [IP2P and SDXL] clean up code
      
      * [IP2P and SDXL] clean up code
      
      * [IP2P and SDXL] clean up code
      
      * [IP2P SDXL] Address code reviews
      
      * [IP2P SDXL] Address code reviews, add docs, tests
      
      * [IP2P SDXL] Address code reviews, add docs, tests
      
      * [IP2P SDXL] Address code reviews, add docs, tests
      
      * [IP2P SDXL] Address code reviews, add docs, tests
      
      * [IP2P SDXL] Address code reviews, add docs, tests
      
      * [IP2P SDXL] Address code reviews, add docs, tests
      
      * [IP2P SDXL] Address code reviews, add docs, tests
      
      * [IP2P SDXL] Address code reviews, add docs, tests
      
      * [IP2P SDXL] Address code reviews, add docs, tests
      
      * [IP2P SDXL] Address code reviews, add docs, tests
      
      * [IP2P SDXL] Address code reviews, add docs, tests
      
      * [IP2P SDXL] Address code reviews, add docs, tests
      
      * [IP2P SDXL] Address code reviews
      
      * [IP2P SDXL] Address code reviews
      
      * [IP2P SDXL] Add README_SDXL
      
      * [IP2P SDXL] Address code reviews
      
      * [IP2P SDXL] Address code reviews
      
      * [IP2P SDXL] Fix the copy problems
      
      * [IP2P SDXL] Add license
      
      * [IP2P SDXL] Add license
      
      * [IP2P SDXL] Add license
      
      * [IP2P SDXL] Address code reivew for selecting VAE andd others
      
      * [IP2P SDXL] Update README_sdxl
      
      * [IP2P SDXL] Update __init__
      
      * [IP2P SDXL] Update dummy_torch_and_transformers_and_invisible_watermark_objects
      
      * address patrick's comments and some additions to readmes.
      
      ---------
      Co-authored-by: default avatarHarutatsu Akiyama <kf.zy.qin@gmail.com>
      Co-authored-by: default avatarThomas Chambon <36728882+tchambon@users.noreply.github.com>
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      Co-authored-by: default avataryiyixuxu <yixu310@gmail,com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      428dbfec
    • YiYi Xu's avatar
      [draft v2] AutoPipeline (#4138) · c11d11d6
      YiYi Xu authored
      
      
      * initial
      
      * style
      
      * from ...pipelines -> from ..pipeline_util
      
      * make style
      
      * fix-copies
      
      * fix value_guided_sampling oops
      
      * style
      
      * add test
      
      * Show failing test
      
      * update from_pipe
      
      * fix
      
      * add controlnet, additional test and register unused original config
      
      * update for controlnet
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * store unused config as private attribute and pass if can
      
      * add doc
      
      * kandinsky inpaint pipeline does not work with decoder checkpoint
      
      * update doc
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * style
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * fix
      
      * Apply suggestions from code review
      
      ---------
      Co-authored-by: default avataryiyixuxu <yixu310@gmail,com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      c11d11d6
  18. 24 Jul, 2023 5 commits
  19. 21 Jul, 2023 1 commit
  20. 20 Jul, 2023 3 commits
  21. 19 Jul, 2023 1 commit
  22. 18 Jul, 2023 2 commits
    • Sayak Paul's avatar
      [Core] add: controlnet support for SDXL (#4038) · 3eb498e7
      Sayak Paul authored
      * add: controlnet sdxl.
      
      * modifications to controlnet.
      
      * run styling.
      
      * add: __init__.pys
      
      * incorporate https://github.com/huggingface/diffusers/pull/4019
      
       changes.
      
      * run make fix-copies.
      
      * resize the conditioning images.
      
      * remove autocast.
      
      * run styling.
      
      * disable autocast.
      
      * debugging
      
      * device placement.
      
      * back to autocast.
      
      * remove comment.
      
      * save some memory by reusing the vae and unet in the pipeline.
      
      * apply styling.
      
      * Allow low precision sd xl
      
      * finish
      
      * finish
      
      * changes to accommodate the improved VAE.
      
      * modifications to how we handle vae encoding in the training.
      
      * make style
      
      * make existing controlnet fast tests pass.
      
      * change vae checkpoint cli arg.
      
      * fix: vae pretrained paths.
      
      * fix: steps in get_scheduler().
      
      * debugging.
      
      * debugging./
      
      * fix: weight conversion.
      
      * add: docs.
      
      * add: limited tests./
      
      * add: datasets to the requirements.
      
      * update docstrings and incorporate the usage of watermarking.
      
      * incorporate fix from #4083
      
      * fix watermarking dependency handling.
      
      * run make-fix-copies.
      
      * Empty-Commit
      
      * Update requirements_sdxl.txt
      
      * remove vae upcasting part.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * run make style
      
      * run make fix-copies.
      
      * disable suppot for multicontrolnet.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * run make fix-copies.
      
      * dtyle/.
      
      * fix-copies.
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      3eb498e7
    • clarencechen's avatar
      Add Recent Timestep Scheduling Improvements to DDIM Inverse Scheduler (#3865) · c6e56e92
      clarencechen authored
      * Add Recent Timestep Scheduling Improvements to DDIM Inverse Scheduler
      
      Roll timesteps by one to reflect origin-destination semantic discrepancy
      
      Restore `set_alpha_to_one` option to handle negative initial timesteps
      
      Remove `set_alpha_to_zero` option not used due to previous truncation
      
      * Bugfix
      
      * Remove unnecessary calls to `detach()`
      
      Use `self.image_processor.preprocess` in DiffEdit pipeline functions
      
      * Preprocess list input for inverted image latents in diffedit pipeline
      
      * Add `timestep_spacing` and `steps_offset` to `DPMSolverMultistepInverseScheduler`
      
      * Update expected test results to account for inverting last forward diffusion step
      
      * Fix inversion progress bar bug
      
      * Add first draft for proper fast tests for DDIMInverseScheduler
      
      * Add deprecated DDIMInverseScheduler kwarg to ConfigMixer registry
      
      * Fix test failure in DPMMultistepInverseScheduler
      
      Invert step specification leads to negative noise variance in SDE-based algs
      
      Add first draft for proper fast tests for DPMMultistepInverseScheduler
      
      * Update expected test results to account for inverting last forward diffusion step
      
      Clean up diffedit fast test
      c6e56e92