1. 01 Aug, 2024 4 commits
  2. 30 Jul, 2024 5 commits
    • Yoach Lacombe's avatar
      Fix Stable Audio repository id (#9016) · ea1b4ea7
      Yoach Lacombe authored
      Fix Stable Audio repo id
      ea1b4ea7
    • Aryan's avatar
      [core] Move community AnimateDiff ControlNet to core (#8972) · e5b94b4c
      Aryan authored
      
      
      * add animatediff controlnet to core
      
      * make style; remove unused method
      
      * fix copied from comment
      
      * add tests
      
      * changes to make tests work
      
      * add utility function to load videos
      
      * update docs
      
      * update pipeline example
      
      * make style
      
      * update docs with example
      
      * address review comments
      
      * add latest freeinit test from #8969
      
      * LoraLoaderMixin -> StableDiffusionLoraLoaderMixin
      
      * fix docs
      
      * Update src/diffusers/utils/loading_utils.py
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      
      * fix: variable out of scope
      
      ---------
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      e5b94b4c
    • Yoach Lacombe's avatar
      Stable Audio integration (#8716) · 69e72b1d
      Yoach Lacombe authored
      
      
      * WIP modeling code and pipeline
      
      * add custom attention processor + custom activation + add to init
      
      * correct ProjectionModel forward
      
      * add stable audio to __initèè
      
      * add autoencoder and update pipeline and modeling code
      
      * add half Rope
      
      * add partial rotary v2
      
      * add temporary modfis to scheduler
      
      * add EDM DPM Solver
      
      * remove TODOs
      
      * clean GLU
      
      * remove att.group_norm to attn processor
      
      * revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py
      
      * refactor GLU -> SwiGLU
      
      * remove redundant args
      
      * add channel multiples in autoencoder docstrings
      
      * changes in docsrtings and copyright headers
      
      * clean pipeline
      
      * further cleaning
      
      * remove peft and lora and fromoriginalmodel
      
      * Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace
      
      * make style
      
      * dummy models
      
      * fix copied from
      
      * add fast oobleck tests
      
      * add brownian tree
      
      * oobleck autoencoder slow tests
      
      * remove TODO
      
      * fast stable audio pipeline tests
      
      * add slow tests
      
      * make style
      
      * add first version of docs
      
      * wrap is_torchsde_available to the scheduler
      
      * fix slow test
      
      * test with input waveform
      
      * add input waveform
      
      * remove some todos
      
      * create stableaudio gaussian projection + make style
      
      * add pipeline to toctree
      
      * fix copied from
      
      * make quality
      
      * refactor timestep_features->time_proj
      
      * refactor joint_attention_kwargs->cross_attention_kwargs
      
      * remove forward_chunk
      
      * move StableAudioDitModel to transformers folder
      
      * correct convert + remove partial rotary embed
      
      * apply suggestions from yiyixuxu -> removing attn.kv_heads
      
      * remove temb
      
      * remove cross_attention_kwargs
      
      * further removal of cross_attention_kwargs
      
      * remove text encoder autocast to fp16
      
      * continue removing autocast
      
      * make style
      
      * refactor how text and audio are embedded
      
      * add paper
      
      * update example code
      
      * make style
      
      * unify projection model forward + fix device placement
      
      * make style
      
      * remove fuse qkv
      
      * apply suggestions from review
      
      * Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      
      * make style
      
      * smaller models in fast tests
      
      * pass sequential offloading fast tests
      
      * add docs for vae and autoencoder
      
      * make style and update example
      
      * remove useless import
      
      * add cosine scheduler
      
      * dummy classes
      
      * cosine scheduler docs
      
      * better description of scheduler
      
      ---------
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      69e72b1d
    • Sayak Paul's avatar
      [LoRA] fix: animate diff lora stuff. (#8995) · 8c4856cd
      Sayak Paul authored
      * fix: animate diff lora stuff.
      
      * fix scaling function for UNetMotionModel
      
      * emoty
      8c4856cd
    • Anatoly Belikov's avatar
      handle lora scale and clip skip in lpw sd and sdxl community pipelines (#8988) · f240a936
      Anatoly Belikov authored
      
      
      * handle lora scale and clip skip in lpw sd and sdxl
      
      * use StableDiffusionLoraLoaderMixin
      
      * use StableDiffusionXLLoraLoaderMixin
      
      * style
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      f240a936
  3. 29 Jul, 2024 2 commits
  4. 26 Jul, 2024 9 commits
  5. 25 Jul, 2024 10 commits
  6. 24 Jul, 2024 4 commits
  7. 23 Jul, 2024 6 commits