1. 05 Sep, 2023 1 commit
    • Sayak Paul's avatar
      [Core] better support offloading when side loading is enabled. (#4855) · e4b8e792
      Sayak Paul authored
      * better support offloading when side loading is enabled.
      
      * load_textual_inversion
      
      * better messaging for textual inversion.
      
      * fixes
      
      * address PR feedback.
      
      * sdxl support.
      
      * improve messaging
      
      * recursive removal when cpu sequential offloading is enabled.
      
      * add: lora tests
      
      * recruse.
      
      * add: offload tests for textual inversion.
      e4b8e792
  2. 04 Sep, 2023 3 commits
    • Sayak Paul's avatar
      [Core] LoRA improvements pt. 3 (#4842) · c81a88b2
      Sayak Paul authored
      
      
      * throw warning when more than one lora is attempted to be fused.
      
      * introduce support of lora scale during fusion.
      
      * change test name
      
      * changes
      
      * change to _lora_scale
      
      * lora_scale to call whenever applicable.
      
      * debugging
      
      * lora_scale additional.
      
      * cross_attention_kwargs
      
      * lora_scale -> scale.
      
      * lora_scale fix
      
      * lora_scale in patched projection.
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * styling.
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * remove unneeded prints.
      
      * remove unneeded prints.
      
      * assign cross_attention_kwargs.
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * clean up.
      
      * refactor scale retrieval logic a bit.
      
      * fix nonetypw
      
      * fix: tests
      
      * add more tests
      
      * more fixes.
      
      * figure out a way to pass lora_scale.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * unify the retrieval logic of lora_scale.
      
      * move adjust_lora_scale_text_encoder to lora.py.
      
      * introduce dynamic adjustment lora scale support to sd
      
      * fix up copies
      
      * Empty-Commit
      
      * add: test to check fusion equivalence on different scales.
      
      * handle lora fusion warning.
      
      * make lora smaller
      
      * make lora smaller
      
      * make lora smaller
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      c81a88b2
    • YiYi Xu's avatar
      allow passing components to connected pipelines when use the combined pipeline (#4883) · 2c1677ee
      YiYi Xu authored
      
      
      * fix
      
      * add test
      
      ---------
      Co-authored-by: default avataryiyixuxu <yixu310@gmail,com>
      2c1677ee
    • dg845's avatar
      Fix get_dummy_inputs for Stable Diffusion Inpaint Tests (#4845) · c73e609a
      dg845 authored
      
      
      * Change StableDiffusionInpaintPipelineFastTests.get_dummy_inputs to produce a random image and a white mask_image.
      
      * Add dummy expected slices for the test_stable_diffusion_inpaint tests.
      
      * Remove print statement
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      c73e609a
  3. 02 Sep, 2023 2 commits
  4. 01 Sep, 2023 5 commits
  5. 30 Aug, 2023 2 commits
    • Patrick von Platen's avatar
      Fix Unfuse Lora (#4833) · 9f1936d2
      Patrick von Platen authored
      * Fix Unfuse Lora
      
      * add tests
      
      * Fix more
      
      * Fix more
      
      * Fix all
      
      * make style
      
      * make style
      9f1936d2
    • Sayak Paul's avatar
      [Core] refactor encode_prompt (#4617) · 3768d4d7
      Sayak Paul authored
      
      
      * refactoring of encode_prompt()
      
      * better handling of device.
      
      * fix: device determination
      
      * fix: device determination 2
      
      * handle num_images_per_prompt
      
      * revert changes in loaders.py and give birth to encode_prompt().
      
      * minor refactoring for encode_prompt()/
      
      * make backward compatible.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * fix: concatenation of the neg and pos embeddings.
      
      * incorporate encode_prompt() in test_stable_diffusion.py
      
      * turn it into big PR.
      
      * make it bigger
      
      * gligen fixes.
      
      * more fixes to fligen
      
      * _encode_prompt -> encode_prompt in tests
      
      * first batch
      
      * second batch
      
      * fix blasphemous mistake
      
      * fix
      
      * fix: hopefully for the final time.
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      3768d4d7
  6. 29 Aug, 2023 4 commits
  7. 28 Aug, 2023 5 commits
  8. 26 Aug, 2023 2 commits
  9. 25 Aug, 2023 5 commits
  10. 24 Aug, 2023 2 commits
  11. 23 Aug, 2023 2 commits
    • YiYi Xu's avatar
      add a step_index counter (#4347) · cd21b965
      YiYi Xu authored
      
      
      add self.step_index
      
      ---------
      Co-authored-by: default avataryiyixuxu <yixu310@gmail,com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      cd21b965
    • Ollin Boer Bohan's avatar
      Fix AutoencoderTiny encoder scaling convention (#4682) · 052bf328
      Ollin Boer Bohan authored
      * Fix AutoencoderTiny encoder scaling convention
      
        * Add [-1, 1] -> [0, 1] rescaling to EncoderTiny
      
        * Move [0, 1] -> [-1, 1] rescaling from AutoencoderTiny.decode to DecoderTiny
          (i.e. immediately after the final conv, as early as possible)
      
        * Fix missing [0, 255] -> [0, 1] rescaling in AutoencoderTiny.forward
      
        * Update AutoencoderTinyIntegrationTests to protect against scaling issues.
          The new test constructs a simple image, round-trips it through AutoencoderTiny,
          and confirms the decoded result is approximately equal to the source image.
          This test checks behavior with and without tiling enabled.
          This test will fail if new AutoencoderTiny scaling issues are introduced.
      
        * Context: Raw TAESD weights expect images in [0, 1], but diffusers'
          convention represents images with zero-centered values in [-1, 1],
          so AutoencoderTiny needs to scale / unscale images at the start of
          encoding and at the end of decoding in order to work with diffusers.
      
      * Re-add existing AutoencoderTiny test, update golden values
      
      * Add comments to AutoencoderTiny.forward
      052bf328
  12. 22 Aug, 2023 2 commits
  13. 21 Aug, 2023 1 commit
    • Sanchit Gandhi's avatar
      Add AudioLDM 2 (#4549) · 7a24977c
      Sanchit Gandhi authored
      
      
      * from audioldm
      
      * unet down + mid
      
      * vae, clap, flan-t5
      
      * start sequence audio mae
      
      * iterate on audioldm encoder
      
      * finish encoder
      
      * finish weight conversion
      
      * text pre-processing
      
      * gpt2 pre-processing
      
      * fix projection model
      
      * working
      
      * unet equivalence
      
      * finish in base
      
      * add unet cond
      
      * finish unet
      
      * finish custom unet
      
      * start clean-up
      
      * revert base unet changes
      
      * refactor pre-processing
      
      * tests: from audioldm
      
      * fix some tests
      
      * more fixes
      
      * iterate on tests
      
      * make fix copies
      
      * harden fast tests
      
      * slow integration tests
      
      * finish tests
      
      * update checkpoint
      
      * update copyright
      
      * docs
      
      * remove outdated method
      
      * add docstring
      
      * make style
      
      * remove decode latents
      
      * enable cpu offload
      
      * (text_encoder_1, tokenizer_1) -> (text_encoder, tokenizer)
      
      * more clean up
      
      * more refactor
      
      * build pr docs
      
      * Update docs/source/en/api/pipelines/audioldm2.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * small clean
      
      * tidy conversion
      
      * update for large checkpoint
      
      * generate -> generate_language_model
      
      * full clap model
      
      * shrink clap-audio in tests
      
      * fix large integration test
      
      * fix fast tests
      
      * use generation config
      
      * make style
      
      * update docs
      
      * finish docs
      
      * finish doc
      
      * update tests
      
      * fix last test
      
      * syntax
      
      * finalise tests
      
      * refactor projection model in prep for TTS
      
      * fix fast tests
      
      * style
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      7a24977c
  14. 18 Aug, 2023 1 commit
  15. 17 Aug, 2023 3 commits