1. 23 Aug, 2023 1 commit
    • Ollin Boer Bohan's avatar
      Fix AutoencoderTiny encoder scaling convention (#4682) · 052bf328
      Ollin Boer Bohan authored
      * Fix AutoencoderTiny encoder scaling convention
      
        * Add [-1, 1] -> [0, 1] rescaling to EncoderTiny
      
        * Move [0, 1] -> [-1, 1] rescaling from AutoencoderTiny.decode to DecoderTiny
          (i.e. immediately after the final conv, as early as possible)
      
        * Fix missing [0, 255] -> [0, 1] rescaling in AutoencoderTiny.forward
      
        * Update AutoencoderTinyIntegrationTests to protect against scaling issues.
          The new test constructs a simple image, round-trips it through AutoencoderTiny,
          and confirms the decoded result is approximately equal to the source image.
          This test checks behavior with and without tiling enabled.
          This test will fail if new AutoencoderTiny scaling issues are introduced.
      
        * Context: Raw TAESD weights expect images in [0, 1], but diffusers'
          convention represents images with zero-centered values in [-1, 1],
          so AutoencoderTiny needs to scale / unscale images at the start of
          encoding and at the end of decoding in order to work with diffusers.
      
      * Re-add existing AutoencoderTiny test, update golden values
      
      * Add comments to AutoencoderTiny.forward
      052bf328
  2. 22 Aug, 2023 5 commits
  3. 21 Aug, 2023 1 commit
    • Sanchit Gandhi's avatar
      Add AudioLDM 2 (#4549) · 7a24977c
      Sanchit Gandhi authored
      
      
      * from audioldm
      
      * unet down + mid
      
      * vae, clap, flan-t5
      
      * start sequence audio mae
      
      * iterate on audioldm encoder
      
      * finish encoder
      
      * finish weight conversion
      
      * text pre-processing
      
      * gpt2 pre-processing
      
      * fix projection model
      
      * working
      
      * unet equivalence
      
      * finish in base
      
      * add unet cond
      
      * finish unet
      
      * finish custom unet
      
      * start clean-up
      
      * revert base unet changes
      
      * refactor pre-processing
      
      * tests: from audioldm
      
      * fix some tests
      
      * more fixes
      
      * iterate on tests
      
      * make fix copies
      
      * harden fast tests
      
      * slow integration tests
      
      * finish tests
      
      * update checkpoint
      
      * update copyright
      
      * docs
      
      * remove outdated method
      
      * add docstring
      
      * make style
      
      * remove decode latents
      
      * enable cpu offload
      
      * (text_encoder_1, tokenizer_1) -> (text_encoder, tokenizer)
      
      * more clean up
      
      * more refactor
      
      * build pr docs
      
      * Update docs/source/en/api/pipelines/audioldm2.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * small clean
      
      * tidy conversion
      
      * update for large checkpoint
      
      * generate -> generate_language_model
      
      * full clap model
      
      * shrink clap-audio in tests
      
      * fix large integration test
      
      * fix fast tests
      
      * use generation config
      
      * make style
      
      * update docs
      
      * finish docs
      
      * finish doc
      
      * update tests
      
      * fix last test
      
      * syntax
      
      * finalise tests
      
      * refactor projection model in prep for TTS
      
      * fix fast tests
      
      * style
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      7a24977c
  4. 18 Aug, 2023 2 commits
  5. 17 Aug, 2023 6 commits
  6. 16 Aug, 2023 4 commits
    • Steven Liu's avatar
      [docs] PushToHubMixin (#4622) · 4ff7264d
      Steven Liu authored
      * push to hub docs
      
      * fix typo
      
      * feedback
      
      * make style
      4ff7264d
    • Sayak Paul's avatar
      [Core] feat: MultiControlNet support for SDXL ControlNet pipeline (#4597) · 50495991
      Sayak Paul authored
      * core: add multicontrolnet support to sdxl controlnet
      
      * modify checks.
      
      * fix: original_size determination
      
      * add: tests for multi controlnet sdxl.
      
      * remove unnecessary prints.
      50495991
    • Dirk Morris's avatar
      Fix unipc use_karras_sigmas exception - fixes huggingface/diffusers#4580 (#4581) · a7de9650
      Dirk Morris authored
      * Fix unipc karras sigmas exception - fixes huggingface/diffusers#4580
      
      * Add unipc scheduler tests for karras sigmas
      a7de9650
    • nikhil-masterful's avatar
      Add GLIGEN implementation (#4441) · da5ab51d
      nikhil-masterful authored
      * Add GLIGEN implementation
      
      * GLIGEN: Fix code quality check failures
      
      * GLIGEN: Fix Import block un-sorted or un-formatted failures
      
      * GLIGEN: Fix check_repository_consistency failures
      
      * GLIGEN: Add 'PositionNet' to versatile_diffusion/modeling_text_unet.py
      
      * GLIGEN: check_repository_consistency: fix 'copy does not match' error
      
      * GLIGEN: Fix review comments (1)
      
      * GLIGEN: Fix E721 Do not compare types, use `isinstance()` failures
      
      * GLIGEN : Ensure _encode_prompt() copy matches to StableDiffusionPipeline
      
      * GLIGEN: Fix ruff E721 failure in unidiffuser/test_unidiffuser.py
      
      * GLIGEN: doc_builder: restyle pipeline_stable_diffusion_gligen.py
      
      * GIGLEN: reset files unrelated to gligen
      
      * GLIGEN: Fix documentation comments (1)
      
      * GLIGEN: Fix review comments (2)
      
      * GLIGEN: Added FastTest
      
      * GLIGEN: Fix review comments (3)
      da5ab51d
  7. 15 Aug, 2023 2 commits
  8. 12 Aug, 2023 1 commit
  9. 11 Aug, 2023 3 commits
  10. 10 Aug, 2023 3 commits
  11. 09 Aug, 2023 2 commits
  12. 08 Aug, 2023 2 commits
    • Wooyeol Baek's avatar
      Copy lora functions to XLPipelines (#4512) · c7c0b575
      Wooyeol Baek authored
      * add load_lora_weights and save_lora_weights to StableDiffusionXLImg2ImgPipeline
      
      * add load_lora_weights and save_lora_weights to StableDiffusionXLInpaintPipeline
      
      * apply black format
      
      * apply black format
      
      * add copy statement
      
      * fix statements
      
      * fix statements
      
      * fix statements
      
      * run `make fix-copies`
      c7c0b575
    • George He's avatar
      Fix misc typos (#4479) · f0725c58
      George He authored
      Fix typos
      f0725c58
  13. 07 Aug, 2023 1 commit
  14. 04 Aug, 2023 3 commits
  15. 03 Aug, 2023 4 commits