1. 31 Aug, 2023 1 commit
    • Steven Liu's avatar
      [docs] ControlNet guide (#4640) · aedd7876
      Steven Liu authored
      * first draft
      
      * finish first draft
      
      * feedback and remove sections from API pages
      
      * clean docstrings
      
      * add full code example
      aedd7876
  2. 30 Aug, 2023 2 commits
  3. 29 Aug, 2023 1 commit
    • Chong Mou's avatar
      add models for T2I-Adapter-XL (#4696) · 12358b98
      Chong Mou authored
      
      
      * T2I-Adapter-XL
      
      * update
      
      * update
      
      * add pipeline
      
      * modify pipeline
      
      * modify pipeline
      
      * modify pipeline
      
      * modify pipeline
      
      * modify pipeline
      
      * modify modeling_text_unet
      
      * fix styling.
      
      * fix: copies.
      
      * adapter settings
      
      * new test case
      
      * new test case
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * revert prints.
      
      * new test case
      
      * remove print
      
      * org test case
      
      * add test_pipeline
      
      * styling.
      
      * fix copies.
      
      * modify test parameter
      
      * style.
      
      * add adapter-xl doc
      
      * double quotes in docs
      
      * Fix potential type mismatch
      
      * style.
      
      ---------
      Co-authored-by: default avatarsayakpaul <spsayakpaul@gmail.com>
      12358b98
  4. 28 Aug, 2023 1 commit
    • Shauray Singh's avatar
      [WIP] Add Fabric (#4201) · fd35689f
      Shauray Singh authored
      
      
      * empty PR
      
      * init
      
      * changes
      
      * starting with the pipeline
      
      * stable diff
      
      * prev
      
      * more things, getting started
      
      * more functions
      
      * makeing it more readable
      
      * almost done testing
      
      * var changes
      
      * testing
      
      * device
      
      * device support
      
      * maybe
      
      * device malfunctions
      
      * new new
      
      * register
      
      * testing
      
      * exec does not work
      
      * float
      
      * change info
      
      * change of architecture
      
      * might work
      
      * testing with colab
      
      * more attn atuff
      
      * stupid additions
      
      * documenting and testing
      
      * writing tests
      
      * more docs
      
      * tests and docs
      
      * remove test
      
      * empty PR
      
      * init
      
      * changes
      
      * starting with the pipeline
      
      * stable diff
      
      * prev
      
      * more things, getting started
      
      * more functions
      
      * makeing it more readable
      
      * almost done testing
      
      * var changes
      
      * testing
      
      * device
      
      * device support
      
      * maybe
      
      * device malfunctions
      
      * new new
      
      * register
      
      * testing
      
      * exec does not work
      
      * float
      
      * change info
      
      * change of architecture
      
      * might work
      
      * testing with colab
      
      * more attn atuff
      
      * stupid additions
      
      * documenting and testing
      
      * writing tests
      
      * more docs
      
      * tests and docs
      
      * remove test
      
      * change cross attention
      
      * revert back
      
      * tests
      
      * reverting back to orig
      
      * changes
      
      * test passing
      
      * pipeline changes
      
      * before quality
      
      * quality checks pass
      
      * remove print statements
      
      * doc fixes
      
      * __init__ error something
      
      * update docs, working on dim
      
      * working on encoding
      
      * doc fix
      
      * more fixes
      
      * no more dependent on 512*512
      
      * update docs
      
      * fixes
      
      * test passing
      
      * remove comment
      
      * fixes and migration
      
      * simpler tests
      
      * doc changes
      
      * green CI
      
      * changes
      
      * more docs
      
      * changes
      
      * new images
      
      * to community examples
      
      * selete
      
      * more fixes
      
      * changes
      
      * fix
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      fd35689f
  5. 26 Aug, 2023 2 commits
  6. 25 Aug, 2023 2 commits
  7. 24 Aug, 2023 1 commit
  8. 23 Aug, 2023 2 commits
  9. 22 Aug, 2023 2 commits
  10. 21 Aug, 2023 1 commit
    • Sanchit Gandhi's avatar
      Add AudioLDM 2 (#4549) · 7a24977c
      Sanchit Gandhi authored
      
      
      * from audioldm
      
      * unet down + mid
      
      * vae, clap, flan-t5
      
      * start sequence audio mae
      
      * iterate on audioldm encoder
      
      * finish encoder
      
      * finish weight conversion
      
      * text pre-processing
      
      * gpt2 pre-processing
      
      * fix projection model
      
      * working
      
      * unet equivalence
      
      * finish in base
      
      * add unet cond
      
      * finish unet
      
      * finish custom unet
      
      * start clean-up
      
      * revert base unet changes
      
      * refactor pre-processing
      
      * tests: from audioldm
      
      * fix some tests
      
      * more fixes
      
      * iterate on tests
      
      * make fix copies
      
      * harden fast tests
      
      * slow integration tests
      
      * finish tests
      
      * update checkpoint
      
      * update copyright
      
      * docs
      
      * remove outdated method
      
      * add docstring
      
      * make style
      
      * remove decode latents
      
      * enable cpu offload
      
      * (text_encoder_1, tokenizer_1) -> (text_encoder, tokenizer)
      
      * more clean up
      
      * more refactor
      
      * build pr docs
      
      * Update docs/source/en/api/pipelines/audioldm2.md
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * small clean
      
      * tidy conversion
      
      * update for large checkpoint
      
      * generate -> generate_language_model
      
      * full clap model
      
      * shrink clap-audio in tests
      
      * fix large integration test
      
      * fix fast tests
      
      * use generation config
      
      * make style
      
      * update docs
      
      * finish docs
      
      * finish doc
      
      * update tests
      
      * fix last test
      
      * syntax
      
      * finalise tests
      
      * refactor projection model in prep for TTS
      
      * fix fast tests
      
      * style
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      7a24977c
  11. 17 Aug, 2023 2 commits
  12. 16 Aug, 2023 4 commits
    • Steven Liu's avatar
      [docs] PushToHubMixin (#4622) · 4ff7264d
      Steven Liu authored
      * push to hub docs
      
      * fix typo
      
      * feedback
      
      * make style
      4ff7264d
    • Sayak Paul's avatar
      Update text2image.md to fix the links (#4626) · 351aab60
      Sayak Paul authored
      351aab60
    • nikhil-masterful's avatar
      Add GLIGEN implementation (#4441) · da5ab51d
      nikhil-masterful authored
      * Add GLIGEN implementation
      
      * GLIGEN: Fix code quality check failures
      
      * GLIGEN: Fix Import block un-sorted or un-formatted failures
      
      * GLIGEN: Fix check_repository_consistency failures
      
      * GLIGEN: Add 'PositionNet' to versatile_diffusion/modeling_text_unet.py
      
      * GLIGEN: check_repository_consistency: fix 'copy does not match' error
      
      * GLIGEN: Fix review comments (1)
      
      * GLIGEN: Fix E721 Do not compare types, use `isinstance()` failures
      
      * GLIGEN : Ensure _encode_prompt() copy matches to StableDiffusionPipeline
      
      * GLIGEN: Fix ruff E721 failure in unidiffuser/test_unidiffuser.py
      
      * GLIGEN: doc_builder: restyle pipeline_stable_diffusion_gligen.py
      
      * GIGLEN: reset files unrelated to gligen
      
      * GLIGEN: Fix documentation comments (1)
      
      * GLIGEN: Fix review comments (2)
      
      * GLIGEN: Added FastTest
      
      * GLIGEN: Fix review comments (3)
      da5ab51d
    • Sayak Paul's avatar
      add: train to text image with sdxl script. (#4505) · 5175d3d7
      Sayak Paul authored
      
      
      * add: train to text image with sdxl script.
      Co-authored-by: default avatarCaptnSeraph <s3raph1m@gmail.com>
      
      * fix: partial func.
      
      * fix: default value of output_dir.
      
      * make style
      
      * set num inference steps to 25.
      
      * remove mentions of LoRA.
      
      * up min version
      
      * add: ema cli arg
      
      * run device placement while running step.
      
      * precompute vae encodings too.
      
      * fix
      
      * debug
      
      * should work now.
      
      * debug
      
      * debug
      
      * goes alright?
      
      * style
      
      * debugging
      
      * debugging
      
      * debugging
      
      * debugging
      
      * fix
      
      * reinit scheduler if prediction_type was passed.
      
      * akways cast vae in float32
      
      * better handling of snr.
      Co-authored-by: default avatarbghira <bghira@users.github.com>
      
      * the vae should be also passed
      
      * add: docs.
      
      * add: sdlx t2i tests
      
      * save the pipeline
      
      * autocast.
      
      * fix: save_model_card
      
      * fix: save_model_card.
      
      ---------
      Co-authored-by: default avatarCaptnSeraph <s3raph1m@gmail.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarbghira <bghira@users.github.com>
      5175d3d7
  13. 15 Aug, 2023 3 commits
  14. 14 Aug, 2023 1 commit
  15. 12 Aug, 2023 1 commit
  16. 10 Aug, 2023 6 commits
  17. 09 Aug, 2023 3 commits
  18. 07 Aug, 2023 1 commit
  19. 06 Aug, 2023 1 commit
  20. 03 Aug, 2023 2 commits
  21. 02 Aug, 2023 1 commit
    • Sayak Paul's avatar
      [Feat] add tiny Autoencoder for (almost) instant decoding (#4384) · 18fc40c1
      Sayak Paul authored
      
      
      * add: model implementation of tiny autoencoder.
      
      * add: inits.
      
      * push the latest devs.
      
      * add: conversion script and finish.
      
      * add: scaling factor args.
      
      * debugging
      
      * fix denormalization.
      
      * fix: positional argument.
      
      * handle use_torch_2_0_or_xformers.
      
      * handle post_quant_conv
      
      * handle dtype
      
      * fix: sdxl image processor for tiny ae.
      
      * fix: sdxl image processor for tiny ae.
      
      * unify upcasting logic.
      
      * copied from madness.
      
      * remove trailing whitespace.
      
      * set is_tiny_vae = False
      
      * address PR comments.
      
      * change to AutoencoderTiny
      
      * make act_fn an str throughout
      
      * fix: apply_forward_hook decorator call
      
      * get rid of the special is_tiny_vae flag.
      
      * directly scale the output.
      
      * fix dummies?
      
      * fix: act_fn.
      
      * get rid of the Clamp() layer.
      
      * bring back copied from.
      
      * movement of the blocks to appropriate modules.
      
      * add: docstrings to AutoencoderTiny
      
      * add: documentation.
      
      * changes to the conversion script.
      
      * add doc entry.
      
      * settle tests.
      
      * style
      
      * add one slow test.
      
      * fix
      
      * fix 2
      
      * fix 2
      
      * fix: 4
      
      * fix: 5
      
      * finish integration tests
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * style
      
      ---------
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      18fc40c1