1. 23 Aug, 2023 1 commit
  2. 22 Aug, 2023 3 commits
  3. 21 Aug, 2023 1 commit
    • Sanchit Gandhi's avatar
      Add AudioLDM 2 (#4549) · 7a24977c
      Sanchit Gandhi authored
      
      
      * from audioldm
      
      * unet down + mid
      
      * vae, clap, flan-t5
      
      * start sequence audio mae
      
      * iterate on audioldm encoder
      
      * finish encoder
      
      * finish weight conversion
      
      * text pre-processing
      
      * gpt2 pre-processing
      
      * fix projection model
      
      * working
      
      * unet equivalence
      
      * finish in base
      
      * add unet cond
      
      * finish unet
      
      * finish custom unet
      
      * start clean-up
      
      * revert base unet changes
      
      * refactor pre-processing
      
      * tests: from audioldm
      
      * fix some tests
      
      * more fixes
      
      * iterate on tests
      
      * make fix copies
      
      * harden fast tests
      
      * slow integration tests
      
      * finish tests
      
      * update checkpoint
      
      * update copyright
      
      * docs
      
      * remove outdated method
      
      * add docstring
      
      * make style
      
      * remove decode latents
      
      * enable cpu offload
      
      * (text_encoder_1, tokenizer_1) -> (text_encoder, tokenizer)
      
      * more clean up
      
      * more refactor
      
      * build pr docs
      
      * Update docs/source/en/api/pipelines/audioldm2.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * small clean
      
      * tidy conversion
      
      * update for large checkpoint
      
      * generate -> generate_language_model
      
      * full clap model
      
      * shrink clap-audio in tests
      
      * fix large integration test
      
      * fix fast tests
      
      * use generation config
      
      * make style
      
      * update docs
      
      * finish docs
      
      * finish doc
      
      * update tests
      
      * fix last test
      
      * syntax
      
      * finalise tests
      
      * refactor projection model in prep for TTS
      
      * fix fast tests
      
      * style
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      7a24977c
  4. 17 Aug, 2023 2 commits
  5. 16 Aug, 2023 4 commits
    • Steven Liu's avatar
      [docs] PushToHubMixin (#4622) · 4ff7264d
      Steven Liu authored
      * push to hub docs
      
      * fix typo
      
      * feedback
      
      * make style
      4ff7264d
    • Sayak Paul's avatar
      Update text2image.md to fix the links (#4626) · 351aab60
      Sayak Paul authored
      351aab60
    • nikhil-masterful's avatar
      Add GLIGEN implementation (#4441) · da5ab51d
      nikhil-masterful authored
      * Add GLIGEN implementation
      
      * GLIGEN: Fix code quality check failures
      
      * GLIGEN: Fix Import block un-sorted or un-formatted failures
      
      * GLIGEN: Fix check_repository_consistency failures
      
      * GLIGEN: Add 'PositionNet' to versatile_diffusion/modeling_text_unet.py
      
      * GLIGEN: check_repository_consistency: fix 'copy does not match' error
      
      * GLIGEN: Fix review comments (1)
      
      * GLIGEN: Fix E721 Do not compare types, use `isinstance()` failures
      
      * GLIGEN : Ensure _encode_prompt() copy matches to StableDiffusionPipeline
      
      * GLIGEN: Fix ruff E721 failure in unidiffuser/test_unidiffuser.py
      
      * GLIGEN: doc_builder: restyle pipeline_stable_diffusion_gligen.py
      
      * GIGLEN: reset files unrelated to gligen
      
      * GLIGEN: Fix documentation comments (1)
      
      * GLIGEN: Fix review comments (2)
      
      * GLIGEN: Added FastTest
      
      * GLIGEN: Fix review comments (3)
      da5ab51d
    • Sayak Paul's avatar
      add: train to text image with sdxl script. (#4505) · 5175d3d7
      Sayak Paul authored
      
      
      * add: train to text image with sdxl script.
      Co-authored-by: default avatarCaptnSeraph <s3raph1m@gmail.com>
      
      * fix: partial func.
      
      * fix: default value of output_dir.
      
      * make style
      
      * set num inference steps to 25.
      
      * remove mentions of LoRA.
      
      * up min version
      
      * add: ema cli arg
      
      * run device placement while running step.
      
      * precompute vae encodings too.
      
      * fix
      
      * debug
      
      * should work now.
      
      * debug
      
      * debug
      
      * goes alright?
      
      * style
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * fix
      
      * reinit scheduler if prediction_type was passed.
      
      * akways cast vae in float32
      
      * better handling of snr.
      Co-authored-by: default avatarbghira <bghira@users.github.com>
      
      * the vae should be also passed
      
      * add: docs.
      
      * add: sdlx t2i tests
      
      * save the pipeline
      
      * autocast.
      
      * fix: save_model_card
      
      * fix: save_model_card.
      
      ---------
      Co-authored-by: default avatarCaptnSeraph <s3raph1m@gmail.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarbghira <bghira@users.github.com>
      5175d3d7
  6. 15 Aug, 2023 3 commits
  7. 14 Aug, 2023 1 commit
  8. 12 Aug, 2023 1 commit
  9. 10 Aug, 2023 6 commits
  10. 09 Aug, 2023 3 commits
  11. 07 Aug, 2023 1 commit
  12. 06 Aug, 2023 1 commit
  13. 03 Aug, 2023 2 commits
  14. 02 Aug, 2023 4 commits
  15. 01 Aug, 2023 2 commits
  16. 31 Jul, 2023 1 commit
  17. 28 Jul, 2023 3 commits
    • Will Berman's avatar
      fix fp type in t2i adapter docs (#4350) · 2b178673
      Will Berman authored
      2b178673
    • Sayak Paul's avatar
      [Feat] Support SDXL Kohya-style LoRA (#4287) · 4a4cdd6b
      Sayak Paul authored
      
      
      * sdxl lora changes.
      
      * better name replacement.
      
      * better replacement.
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * remove print.
      
      * print state dict keys.
      
      * print
      
      * distingisuih better
      
      * debuggable.
      
      * fxi: tyests
      
      * fix: arg from training script.
      
      * access from class.
      
      * run style
      
      * debug
      
      * save intermediate
      
      * some simplifications for SDXL LoRA
      
      * styling
      
      * unet config is not needed in diffusers format.
      
      * fix: dynamic SGM block mapping for SDXL kohya loras (#4322)
      
      * Use lora compatible layers for linear proj_in/proj_out (#4323)
      
      * improve condition for using the sgm_diffusers mapping
      
      * informative comment.
      
      * load compatible keys and embedding layer maaping.
      
      * Get SDXL 1.0 example lora to load
      
      * simplify
      
      * specif ranks and hidden sizes.
      
      * better handling of k rank and hidden
      
      * debug
      
      * debug
      
      * debug
      
      * debug
      
      * debug
      
      * fix: alpha keys
      
      * add check for handling LoRAAttnAddedKVProcessor
      
      * sanity comment
      
      * modifications for text encoder SDXL
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * denugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * up
      
      * up
      
      * up
      
      * up
      
      * up
      
      * up
      
      * unneeded comments.
      
      * unneeded comments.
      
      * kwargs for the other attention processors.
      
      * kwargs for the other attention processors.
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * improve
      
      * debugging
      
      * debugging
      
      * more print
      
      * Fix alphas
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * clean up
      
      * clean up.
      
      * debugging
      
      * fix: text
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarBatuhan Taskaya <batuhan@python.org>
      4a4cdd6b
    • Patrick von Platen's avatar
      [SDXL] Make watermarker optional under certain circumstances to improve... · b7b6d613
      Patrick von Platen authored
      [SDXL] Make watermarker optional under certain circumstances to improve usability of SDXL 1.0 (#4346)
      
      * improve sdxl
      
      * more fixes
      
      * improve sdxl
      
      * improve sdxl
      
      * improve sdxl
      
      * finish
      b7b6d613
  18. 27 Jul, 2023 1 commit