1. 05 Dec, 2023 1 commit
  2. 04 Dec, 2023 4 commits
    • Steven Liu's avatar
      [docs] Add Kandinsky 3 (#5988) · b64f835e
      Steven Liu authored
      * add
      
      * fix api docs
      
      * edits
      b64f835e
    • takuoko's avatar
      [Feature] Support IP-Adapter Plus (#5915) · 0a08d419
      takuoko authored
      
      
      * Support IP-Adapter Plus
      
      * fix format
      
      * restore before black format
      
      * restore before black format
      
      * generic
      
      * Refactor PerceiverAttention
      
      * format
      
      * fix test and refactor PerceiverAttention
      
      * generic encode_image
      
      * keep attention implementation
      
      * merge tests
      
      * encode_image backward compatible
      
      * code quality
      
      * fix controlnet inpaint pipeline
      
      * refactor FFN
      
      * refactor FFN
      
      ---------
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      0a08d419
    • Dhruv Nair's avatar
      Update Tests Fetcher (#5950) · b2172922
      Dhruv Nair authored
      
      
      * update setup and deps table
      
      * update
      
      * update
      
      * update
      
      * up
      
      * up
      
      * update
      
      * up
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * update
      
      * quality fix
      
      * fix failure reporting
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      b2172922
    • Parth38's avatar
      Update value_guided_sampling.py (#6027) · 8a812e4e
      Parth38 authored
      
      
      * Update value_guided_sampling.py
      
      Changed the scheduler step function as predict_epsilon parameter is not there in latest  DDPM Scheduler
      
      * Update value_guided_sampling.md
      
      Updated a link to a working notebook
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      8a812e4e
  3. 02 Dec, 2023 2 commits
  4. 01 Dec, 2023 7 commits
  5. 29 Nov, 2023 6 commits
    • Suraj Patil's avatar
      Add SVD (#5895) · 63f767ef
      Suraj Patil authored
      
      
      * begin model
      
      * finish blocks
      
      * add_embedding
      
      * addition_time_embed_dim
      
      * use TimestepEmbedding
      
      * fix temporal res block
      
      * fix time_pos_embed
      
      * fix add_embedding
      
      * add conversion script
      
      * fix model
      
      * up
      
      * add new resnet blocks
      
      * make forward work
      
      * return sample in original shape
      
      * fix temb shape in TemporalResnetBlock
      
      * add spatio temporal transformers
      
      * add vae blocks
      
      * fix blocks
      
      * update
      
      * update
      
      * fix shapes in Alphablender and add time activation in res blcok
      
      * use new blocks
      
      * style
      
      * fix temb shape
      
      * fix SpatioTemporalResBlock
      
      * reuse TemporalBasicTransformerBlock
      
      * fix TemporalBasicTransformerBlock
      
      * use TransformerSpatioTemporalModel
      
      * fix TransformerSpatioTemporalModel
      
      * fix time_context dim
      
      * clean up
      
      * make temb optional
      
      * add blocks
      
      * rename model
      
      * update conversion script
      
      * remove UNetMidBlockSpatioTemporal
      
      * add in init
      
      * remove unused arg
      
      * remove unused arg
      
      * remove more unsed args
      
      * up
      
      * up
      
      * check for None
      
      * update vae
      
      * update up/mid blocks for decoder
      
      * begin pipeline
      
      * adapt scheduler
      
      * add guidance scalings
      
      * fix norm eps in temporal transformers
      
      * add temporal autoencoder
      
      * make pipeline run
      
      * fix frame decodig
      
      * decode in float32
      
      * decode n frames at a time
      
      * pass decoding_t to decode_latents
      
      * fix decode_latents
      
      * vae encode/decode in fp32
      
      * fix dtype in TransformerSpatioTemporalModel
      
      * type image_latents same as image_embeddings
      
      * allow using differnt eps in temporal block for video decoder
      
      * fix default values in vae
      
      * pass num frames in decode
      
      * switch spatial to temporal for mixing in VAE
      
      * fix num frames during split decoding
      
      * cast alpha to sample dtype
      
      * fix attention in MidBlockTemporalDecoder
      
      * fix typo
      
      * fix guidance_scales dtype
      
      * fix missing activation in TemporalDecoder
      
      * skip_post_quant_conv
      
      * add vae conversion
      
      * style
      
      * take guidance scale as input
      
      * up
      
      * allow passing PIL to export_video
      
      * accept fps as arg
      
      * add pipeline and vae in init
      
      * remove hack
      
      * use AutoencoderKLTemporalDecoder
      
      * don't scale image latents
      
      * add unet tests
      
      * clean up unet
      
      * clean TransformerSpatioTemporalModel
      
      * add slow svd test
      
      * clean up
      
      * make temb optional in Decoder mid block
      
      * fix norm eps in TransformerSpatioTemporalModel
      
      * clean up temp decoder
      
      * clean up
      
      * clean up
      
      * use c_noise values for timesteps
      
      * use math for log
      
      * update
      
      * fix copies
      
      * doc
      
      * upcast vae
      
      * update forward pass for gradient checkpointing
      
      * make added_time_ids is tensor
      
      * up
      
      * fix upcasting
      
      * remove post quant conv
      
      * add _resize_with_antialiasing
      
      * fix _compute_padding
      
      * cleanup model
      
      * more cleanup
      
      * more cleanup
      
      * more cleanup
      
      * remove freeu
      
      * remove attn slice
      
      * small clean
      
      * up
      
      * up
      
      * remove extra step kwargs
      
      * remove eta
      
      * remove dropout
      
      * remove callback
      
      * remove merge factor args
      
      * clean
      
      * clean up
      
      * move to dedicated folder
      
      * remove attention_head_dim
      
      * docstr and small fix
      
      * update unet doc strings
      
      * rename decoding_t
      
      * correct linting
      
      * store c_skip and c_out
      
      * cleanup
      
      * clean TemporalResnetBlock
      
      * more cleanup
      
      * clean up vae
      
      * clean up
      
      * begin doc
      
      * more cleanup
      
      * up
      
      * up
      
      * doc
      
      * Improve
      
      * better naming
      
      * better naming
      
      * better naming
      
      * better naming
      
      * better naming
      
      * better naming
      
      * better naming
      
      * better naming
      
      * Apply suggestions from code review
      
      * Default chunk size to None
      
      * add example
      
      * Better
      
      * Apply suggestions from code review
      
      * update doc
      
      * Update src/diffusers/pipelines/stable_diffusion_video/pipeline_stable_diffusion_video.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * style
      
      * Get torch compile working
      
      * up
      
      * rename
      
      * fix doc
      
      * add chunking
      
      * torch compile
      
      * torch compile
      
      * add modelling outputs
      
      * torch compile
      
      * Improve chunking
      
      * Apply suggestions from code review
      
      * Update docs/source/en/using-diffusers/svd.md
      
      * Close diff tag
      
      * remove slicing
      
      * resnet docstr
      
      * add docstr in resnet
      
      * rename
      
      * Apply suggestions from code review
      
      * update tests
      
      * Fix output type latents
      
      * fix more
      
      * fix more
      
      * Update docs/source/en/using-diffusers/svd.md
      
      * fix more
      
      * add pipeline tests
      
      * remove unused arg
      
      * clean  up
      
      * make sure get_scaling receives tensors
      
      * fix euler scheduler
      
      * fix get_scalings
      
      * simply euler for now
      
      * remove old test file
      
      * use randn_tensor to create noise
      
      * fix device for rand tensor
      
      * increase expected_max_difference
      
      * fix test_inference_batch_single_identical
      
      * actually fix test_inference_batch_single_identical
      
      * disable test_save_load_float16
      
      * skip test_float16_inference
      
      * skip test_inference_batch_single_identical
      
      * fix test_xformers_attention_forwardGenerator_pass
      
      * Apply suggestions from code review
      
      * update StableVideoDiffusionPipelineSlowTests
      
      * update image
      
      * add diffusers example
      
      * fix more
      
      ---------
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarapolinário <joaopaulo.passos@gmail.com>
      63f767ef
    • PENGUINLIONG's avatar
      Fixed custom module importing on Windows (#5891) · d1b2a1a9
      PENGUINLIONG authored
      
      
      * Fixed custom module importing on Windows
      
      Windows use back slash and `os.path.join()` follows that convention.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarLucain <lucainp@gmail.com>
      
      * Update pipeline_utils.py
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      Co-authored-by: default avatarLucain <lucainp@gmail.com>
      d1b2a1a9
    • vahramtadevosyan's avatar
      [Pipeline] Add TextToVideoZeroSDXLPipeline (#4695) · d63a498c
      vahramtadevosyan authored
      
      
      * integrated sdxl for the text2video-zero pipeline
      
      * make fix-copies
      
      * fixed CI issues
      
      * make fix-copies
      
      * added docs and `copied from` statements
      
      * added fast tests
      
      * made a small change in docs
      
      * quality+style check fix
      
      * updated docs. added controlnet inference with sdxl
      
      * added device compatibility for fast tests
      
      * fixed docstrings
      
      * changing vae upcasting
      
      * remove torch.empty_cache to speed up inference
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * made fast tests to run on dummy models only, fixed copied from statements
      
      * fixed testing utils imports
      
      * Added bullet points for SDXL support
      
      * fixed formatting & quality
      
      * Update tests/pipelines/text_to_video/test_text_to_video_zero_sdxl.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update tests/pipelines/text_to_video/test_text_to_video_zero_sdxl.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * fixed minor error for merging
      
      * fixed updates of sdxl
      
      * made fast tests inherit from `PipelineTesterMixin` and run in 3-4secs on CPU
      
      * make style && make quality
      
      * reimplemented fast tests w/o default attn processor
      
      * make style & make quality
      
      * make fix-copies
      
      * make fix-copies
      
      * fixed docs
      
      * make style & make quality & make fix-copies
      
      * bug fix in cross attention
      
      * make style && make quality
      
      * make fix-copies
      
      * fix gpu issues
      
      * make fix-copies
      
      * updated pipeline signature
      
      ---------
      Co-authored-by: default avatarVahram <vahram.tadevosyan@lambda-loginnode02.cm.cluster>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      d63a498c
    • Marko Kostiv's avatar
      Controlnet ssd 1b support (#5779) · 6a4aad43
      Marko Kostiv authored
      
      
      * Add SSD-1B support for controlnet model
      
      * Add conditioning_channels into ControlNet init from unet
      
      * Fix black formatting
      
      * Isort fixes
      
      * Adds SSD-1B controlnet pipeline test with UNetMidBlock2D as mid block
      
      * Overrides failing ssd-1b tests
      
      * Fixes tests after main branch update
      
      * Fixes code quality checks
      
      ---------
      Co-authored-by: default avatarMarko Kostiv <marko@linearity.io>
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      6a4aad43
    • JuanCarlosPi's avatar
      Support of ip-adapter to the StableDiffusionControlNetInpaintPipeline (#5887) · 9f7b2cf2
      JuanCarlosPi authored
      
      
      * Change pipeline_controlnet_inpaint.py to add ip-adapter support. Changes are similar to those in pipeline_controlnet
      
      * Change tests for the StableDiffusionControlNetInpaintPipeline by adding image_encoder: None
      
      * Update src/diffusers/pipelines/controlnet/pipeline_controlnet_inpaint.py
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      
      ---------
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      9f7b2cf2
    • Sayak Paul's avatar
      [LoRA refactor] move several state dict conversion utils out of lora.py (#5955) · 895c4b70
      Sayak Paul authored
      
      
      * move several state dict conversion utils out of lora.py
      
      * check
      
      * check
      
      * check
      
      * check
      
      * check
      
      * check
      
      * check
      
      * revert back
      
      * check
      
      * check
      
      * again check
      
      * maybe fix?
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      895c4b70
  6. 28 Nov, 2023 2 commits
  7. 27 Nov, 2023 9 commits
  8. 24 Nov, 2023 2 commits
    • Patrick von Platen's avatar
      correct num inference steps · 2a7f43a7
      Patrick von Platen authored
      2a7f43a7
    • Patrick von Platen's avatar
      [@cene555][Kandinsky 3.0] Add Kandinsky 3.0 (#5913) · b978334d
      Patrick von Platen authored
      * finalize
      
      * finalize
      
      * finalize
      
      * add slow test
      
      * add slow test
      
      * add slow test
      
      * Fix more
      
      * add slow test
      
      * fix more
      
      * fix more
      
      * fix more
      
      * fix more
      
      * fix more
      
      * fix more
      
      * fix more
      
      * fix more
      
      * fix more
      
      * Better
      
      * Fix more
      
      * Fix more
      
      * add slow test
      
      * Add auto pipelines
      
      * add slow test
      
      * Add all
      
      * add slow test
      
      * add slow test
      
      * add slow test
      
      * add slow test
      
      * add slow test
      
      * Apply suggestions from code review
      
      * add slow test
      
      * add slow test
      b978334d
  9. 21 Nov, 2023 4 commits
  10. 20 Nov, 2023 3 commits
    • Steven Liu's avatar
      [docs] Loader APIs (#5813) · 7457aa67
      Steven Liu authored
      * first draft
      
      * remove old loader doc
      
      * start adding lora code examples
      
      * finish
      
      * add link to loralinearlayer
      
      * feedback
      
      * fix
      7457aa67
    • dg845's avatar
      Update LCMScheduler Inference Timesteps to be More Evenly Spaced (#5836) · dc21498b
      dg845 authored
      
      
      * Change LCMScheduler.set_timesteps to pick more evenly spaced inference timesteps.
      
      * Change inference_indices implementation to better match previous behavior.
      
      * Add num_inference_steps=26 test case to test_inference_steps.
      
      * run CI
      
      ---------
      Co-authored-by: default avatarpatil-suraj <surajp815@gmail.com>
      dc21498b
    • Roy Hvaara's avatar
      [JAX] Replace uses of jax.devices("cpu") with jax.local_devices(backend="cpu") (#5864) · 2695ba8e
      Roy Hvaara authored
      
      
      An upcoming change to JAX will include non-local (addressable) CPU devices in jax.devices() when JAX is used multicontroller-style, where there are multiple Python processes.
      
      This change preserves the current behavior by replacing uses of jax.devices("cpu"), which previously only returned local devices, with jax.local_devices("cpu"), which will return local devices both now and in the future.
      
      This change is always safe (i.e., it should always preserve the previous behavior), but it may sometimes be unnecessary if code is never used in a multicontroller setting.
      Co-authored-by: default avatarPeter Hawkins <phawkins@google.com>
      2695ba8e