1. 29 Nov, 2023 13 commits
    • Suraj Patil's avatar
      Add SVD (#5895) · 63f767ef
      Suraj Patil authored
      
      
      * begin model
      
      * finish blocks
      
      * add_embedding
      
      * addition_time_embed_dim
      
      * use TimestepEmbedding
      
      * fix temporal res block
      
      * fix time_pos_embed
      
      * fix add_embedding
      
      * add conversion script
      
      * fix model
      
      * up
      
      * add new resnet blocks
      
      * make forward work
      
      * return sample in original shape
      
      * fix temb shape in TemporalResnetBlock
      
      * add spatio temporal transformers
      
      * add vae blocks
      
      * fix blocks
      
      * update
      
      * update
      
      * fix shapes in Alphablender and add time activation in res blcok
      
      * use new blocks
      
      * style
      
      * fix temb shape
      
      * fix SpatioTemporalResBlock
      
      * reuse TemporalBasicTransformerBlock
      
      * fix TemporalBasicTransformerBlock
      
      * use TransformerSpatioTemporalModel
      
      * fix TransformerSpatioTemporalModel
      
      * fix time_context dim
      
      * clean up
      
      * make temb optional
      
      * add blocks
      
      * rename model
      
      * update conversion script
      
      * remove UNetMidBlockSpatioTemporal
      
      * add in init
      
      * remove unused arg
      
      * remove unused arg
      
      * remove more unsed args
      
      * up
      
      * up
      
      * check for None
      
      * update vae
      
      * update up/mid blocks for decoder
      
      * begin pipeline
      
      * adapt scheduler
      
      * add guidance scalings
      
      * fix norm eps in temporal transformers
      
      * add temporal autoencoder
      
      * make pipeline run
      
      * fix frame decodig
      
      * decode in float32
      
      * decode n frames at a time
      
      * pass decoding_t to decode_latents
      
      * fix decode_latents
      
      * vae encode/decode in fp32
      
      * fix dtype in TransformerSpatioTemporalModel
      
      * type image_latents same as image_embeddings
      
      * allow using differnt eps in temporal block for video decoder
      
      * fix default values in vae
      
      * pass num frames in decode
      
      * switch spatial to temporal for mixing in VAE
      
      * fix num frames during split decoding
      
      * cast alpha to sample dtype
      
      * fix attention in MidBlockTemporalDecoder
      
      * fix typo
      
      * fix guidance_scales dtype
      
      * fix missing activation in TemporalDecoder
      
      * skip_post_quant_conv
      
      * add vae conversion
      
      * style
      
      * take guidance scale as input
      
      * up
      
      * allow passing PIL to export_video
      
      * accept fps as arg
      
      * add pipeline and vae in init
      
      * remove hack
      
      * use AutoencoderKLTemporalDecoder
      
      * don't scale image latents
      
      * add unet tests
      
      * clean up unet
      
      * clean TransformerSpatioTemporalModel
      
      * add slow svd test
      
      * clean up
      
      * make temb optional in Decoder mid block
      
      * fix norm eps in TransformerSpatioTemporalModel
      
      * clean up temp decoder
      
      * clean up
      
      * clean up
      
      * use c_noise values for timesteps
      
      * use math for log
      
      * update
      
      * fix copies
      
      * doc
      
      * upcast vae
      
      * update forward pass for gradient checkpointing
      
      * make added_time_ids is tensor
      
      * up
      
      * fix upcasting
      
      * remove post quant conv
      
      * add _resize_with_antialiasing
      
      * fix _compute_padding
      
      * cleanup model
      
      * more cleanup
      
      * more cleanup
      
      * more cleanup
      
      * remove freeu
      
      * remove attn slice
      
      * small clean
      
      * up
      
      * up
      
      * remove extra step kwargs
      
      * remove eta
      
      * remove dropout
      
      * remove callback
      
      * remove merge factor args
      
      * clean
      
      * clean up
      
      * move to dedicated folder
      
      * remove attention_head_dim
      
      * docstr and small fix
      
      * update unet doc strings
      
      * rename decoding_t
      
      * correct linting
      
      * store c_skip and c_out
      
      * cleanup
      
      * clean TemporalResnetBlock
      
      * more cleanup
      
      * clean up vae
      
      * clean up
      
      * begin doc
      
      * more cleanup
      
      * up
      
      * up
      
      * doc
      
      * Improve
      
      * better naming
      
      * better naming
      
      * better naming
      
      * better naming
      
      * better naming
      
      * better naming
      
      * better naming
      
      * better naming
      
      * Apply suggestions from code review
      
      * Default chunk size to None
      
      * add example
      
      * Better
      
      * Apply suggestions from code review
      
      * update doc
      
      * Update src/diffusers/pipelines/stable_diffusion_video/pipeline_stable_diffusion_video.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * style
      
      * Get torch compile working
      
      * up
      
      * rename
      
      * fix doc
      
      * add chunking
      
      * torch compile
      
      * torch compile
      
      * add modelling outputs
      
      * torch compile
      
      * Improve chunking
      
      * Apply suggestions from code review
      
      * Update docs/source/en/using-diffusers/svd.md
      
      * Close diff tag
      
      * remove slicing
      
      * resnet docstr
      
      * add docstr in resnet
      
      * rename
      
      * Apply suggestions from code review
      
      * update tests
      
      * Fix output type latents
      
      * fix more
      
      * fix more
      
      * Update docs/source/en/using-diffusers/svd.md
      
      * fix more
      
      * add pipeline tests
      
      * remove unused arg
      
      * clean  up
      
      * make sure get_scaling receives tensors
      
      * fix euler scheduler
      
      * fix get_scalings
      
      * simply euler for now
      
      * remove old test file
      
      * use randn_tensor to create noise
      
      * fix device for rand tensor
      
      * increase expected_max_difference
      
      * fix test_inference_batch_single_identical
      
      * actually fix test_inference_batch_single_identical
      
      * disable test_save_load_float16
      
      * skip test_float16_inference
      
      * skip test_inference_batch_single_identical
      
      * fix test_xformers_attention_forwardGenerator_pass
      
      * Apply suggestions from code review
      
      * update StableVideoDiffusionPipelineSlowTests
      
      * update image
      
      * add diffusers example
      
      * fix more
      
      ---------
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarapolinário <joaopaulo.passos@gmail.com>
      63f767ef
    • PENGUINLIONG's avatar
      Fixed custom module importing on Windows (#5891) · d1b2a1a9
      PENGUINLIONG authored
      
      
      * Fixed custom module importing on Windows
      
      Windows use back slash and `os.path.join()` follows that convention.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarLucain <lucainp@gmail.com>
      
      * Update pipeline_utils.py
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      Co-authored-by: default avatarLucain <lucainp@gmail.com>
      d1b2a1a9
    • Kashif Rasul's avatar
      [Wuerstchen] Adapt lora training example scripts to use PEFT (#5959) · 01782c22
      Kashif Rasul authored
      * Adapt lora example scripts to use PEFT
      
      * add to_out.0
      01782c22
    • vahramtadevosyan's avatar
      [Pipeline] Add TextToVideoZeroSDXLPipeline (#4695) · d63a498c
      vahramtadevosyan authored
      
      
      * integrated sdxl for the text2video-zero pipeline
      
      * make fix-copies
      
      * fixed CI issues
      
      * make fix-copies
      
      * added docs and `copied from` statements
      
      * added fast tests
      
      * made a small change in docs
      
      * quality+style check fix
      
      * updated docs. added controlnet inference with sdxl
      
      * added device compatibility for fast tests
      
      * fixed docstrings
      
      * changing vae upcasting
      
      * remove torch.empty_cache to speed up inference
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * made fast tests to run on dummy models only, fixed copied from statements
      
      * fixed testing utils imports
      
      * Added bullet points for SDXL support
      
      * fixed formatting & quality
      
      * Update tests/pipelines/text_to_video/test_text_to_video_zero_sdxl.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * Update tests/pipelines/text_to_video/test_text_to_video_zero_sdxl.py
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      * fixed minor error for merging
      
      * fixed updates of sdxl
      
      * made fast tests inherit from `PipelineTesterMixin` and run in 3-4secs on CPU
      
      * make style && make quality
      
      * reimplemented fast tests w/o default attn processor
      
      * make style & make quality
      
      * make fix-copies
      
      * make fix-copies
      
      * fixed docs
      
      * make style & make quality & make fix-copies
      
      * bug fix in cross attention
      
      * make style && make quality
      
      * make fix-copies
      
      * fix gpu issues
      
      * make fix-copies
      
      * updated pipeline signature
      
      ---------
      Co-authored-by: default avatarVahram <vahram.tadevosyan@lambda-loginnode02.cm.cluster>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      d63a498c
    • Marko Kostiv's avatar
      Controlnet ssd 1b support (#5779) · 6a4aad43
      Marko Kostiv authored
      
      
      * Add SSD-1B support for controlnet model
      
      * Add conditioning_channels into ControlNet init from unet
      
      * Fix black formatting
      
      * Isort fixes
      
      * Adds SSD-1B controlnet pipeline test with UNetMidBlock2D as mid block
      
      * Overrides failing ssd-1b tests
      
      * Fixes tests after main branch update
      
      * Fixes code quality checks
      
      ---------
      Co-authored-by: default avatarMarko Kostiv <marko@linearity.io>
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      6a4aad43
    • Steven Liu's avatar
      [docs] LCM training (#5796) · ddd8bd53
      Steven Liu authored
      * first draft
      
      * feedback
      ddd8bd53
    • JuanCarlosPi's avatar
      Support of ip-adapter to the StableDiffusionControlNetInpaintPipeline (#5887) · 9f7b2cf2
      JuanCarlosPi authored
      
      
      * Change pipeline_controlnet_inpaint.py to add ip-adapter support. Changes are similar to those in pipeline_controlnet
      
      * Change tests for the StableDiffusionControlNetInpaintPipeline by adding image_encoder: None
      
      * Update src/diffusers/pipelines/controlnet/pipeline_controlnet_inpaint.py
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      
      ---------
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      9f7b2cf2
    • Sayak Paul's avatar
      [LoRA refactor] move several state dict conversion utils out of lora.py (#5955) · 895c4b70
      Sayak Paul authored
      
      
      * move several state dict conversion utils out of lora.py
      
      * check
      
      * check
      
      * check
      
      * check
      
      * check
      
      * check
      
      * check
      
      * revert back
      
      * check
      
      * check
      
      * again check
      
      * maybe fix?
      
      * Apply suggestions from code review
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      
      ---------
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      895c4b70
    • Linh Nguyen's avatar
      Rename output_dir argument (#5916) · 636feba5
      Linh Nguyen authored
      Fix typo in output_dir argument: "text-inversion-model" → "dreambooth-model"
      636feba5
    • Andrés Romero's avatar
      [bug fix] Inpainting for MultiAdapter (#5922) · 79dc7df0
      Andrés Romero authored
      
      
      * bug in MultiAdapter for Inpainting
      
      * adapter_input is a list for MultiAdapter
      
      ---------
      Co-authored-by: default avatarandres <andres@hax.ai>
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      79dc7df0
    • Charchit Sharma's avatar
      added doc for Kandinsky3.0 (#5937) · 6031ecbd
      Charchit Sharma authored
      
      
      * added en doc for Kandinsky3.0
      
      * required changes
      
      * Update docs/source/en/api/pipelines/kandinsky3.md
      
      * Update docs/source/en/api/pipelines/kandinsky3.md
      
      * Update docs/source/en/api/pipelines/kandinsky3.md
      
      ---------
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      Co-authored-by: default avatarPatrick von Platen <patrick.v.platen@gmail.com>
      6031ecbd
    • Sayak Paul's avatar
      [Tests] Refactor `test_examples.py` for better readability (#5946) · fdd003d8
      Sayak Paul authored
      * control and custom diffusion
      
      * dreambooth
      
      * instructpix2pix and dreambooth ckpting
      
      * t2i adapters.
      
      * text to image ft
      
      * textual inversion
      
      * unconditional
      
      * workflows
      
      * import fix
      
      * fix import
      fdd003d8
    • Steven Liu's avatar
      [docs] Update pipeline list (#5952) · 172acc98
      Steven Liu authored
      add to list
      172acc98
  2. 28 Nov, 2023 2 commits
  3. 27 Nov, 2023 17 commits
  4. 24 Nov, 2023 3 commits
  5. 23 Nov, 2023 2 commits
  6. 22 Nov, 2023 1 commit
  7. 21 Nov, 2023 2 commits