1. 07 Aug, 2024 5 commits
    • latentCall145's avatar
      Flux fp16 inference fix (#9097) · 9b5180cb
      latentCall145 authored
      
      
      * clipping for fp16
      
      * fix typo
      
      * added fp16 inference to docs
      
      * fix docs typo
      
      * include link for fp16 investigation
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      9b5180cb
    • Aryan's avatar
      [core] FreeNoise (#8948) · 16a93f1a
      Aryan authored
      
      
      * initial work draft for freenoise; needs massive cleanup
      
      * fix freeinit bug
      
      * add animatediff controlnet implementation
      
      * revert attention changes
      
      * add freenoise
      
      * remove old helper functions
      
      * add decode batch size param to all pipelines
      
      * make style
      
      * fix copied from comments
      
      * make fix-copies
      
      * make style
      
      * copy animatediff controlnet implementation from #8972
      
      * add experimental support for num_frames not perfectly fitting context length, ocntext stride
      
      * make unet motion model lora work again based on #8995
      
      * copy load video utils from #8972
      
      * copied from AnimateDiff::prepare_latents
      
      * address the case where last batch of frames does not match length of indices in prepare latents
      
      * decode_batch_size->vae_batch_size; batch vae encode support in animatediff vid2vid
      
      * revert sparsectrl and sdxl freenoise changes
      
      * revert pia
      
      * add freenoise tests
      
      * make fix-copies
      
      * improve docstrings
      
      * add freenoise tests to animatediff controlnet
      
      * update tests
      
      * Update src/diffusers/models/unets/unet_motion_model.py
      
      * add freenoise to animatediff pag
      
      * address review comments
      
      * make style
      
      * update tests
      
      * make fix-copies
      
      * fix error message
      
      * remove copied from comment
      
      * fix imports in tests
      
      * update
      
      ---------
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      16a93f1a
    • Sayak Paul's avatar
      2d753b6f
    • Álvaro Somoza's avatar
      [Kolors] Add PAG (#8934) · 39e1f7ea
      Álvaro Somoza authored
      
      
      * txt2img pag added
      
      * autopipe added, fixed case
      
      * style
      
      * apply suggestions
      
      * added fast tests, added todo tests
      
      * revert dummy objects for kolors
      
      * fix pag dummies
      
      * fix test imports
      
      * update pag tests
      
      * add kolor pag to docs
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      39e1f7ea
    • Dhruv Nair's avatar
      [Single File] Add single file support for Flux Transformer (#9083) · e1b603dc
      Dhruv Nair authored
      
      
      * update
      
      * update
      
      * update
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      e1b603dc
  2. 06 Aug, 2024 7 commits
  3. 05 Aug, 2024 7 commits
  4. 04 Aug, 2024 4 commits
  5. 03 Aug, 2024 4 commits
  6. 02 Aug, 2024 3 commits
    • Dhruv Nair's avatar
      Fix Nightly Deps (#9036) · b1f43d71
      Dhruv Nair authored
      update
      b1f43d71
    • Sayak Paul's avatar
      [Flux] allow tests to run (#9050) · 0e460675
      Sayak Paul authored
      * fix tests
      
      * fix
      
      * float64 skip
      
      * remove sample_size.
      
      * remove
      
      * remove more
      
      * default_sample_size.
      
      * credit black forest for flux model.
      
      * skip
      
      * fix: tests
      
      * remove OriginalModelMixin
      
      * add transformer model test
      
      * add: transformer model tests
      0e460675
    • Sayak Paul's avatar
      [Core] Add PAG support for PixArtSigma (#8921) · 7b98c4cc
      Sayak Paul authored
      * feat: add pixart sigma pag.
      
      * inits.
      
      * fixes
      
      * fix
      
      * remove print.
      
      * copy paste methods to the pixart pag mixin
      
      * fix-copies
      
      * add documentation.
      
      * add tests.
      
      * remove correction file.
      
      * remove pag_applied_layers
      
      * empty
      7b98c4cc
  7. 01 Aug, 2024 5 commits
  8. 30 Jul, 2024 5 commits
    • Yoach Lacombe's avatar
      Fix Stable Audio repository id (#9016) · ea1b4ea7
      Yoach Lacombe authored
      Fix Stable Audio repo id
      ea1b4ea7
    • Aryan's avatar
      [core] Move community AnimateDiff ControlNet to core (#8972) · e5b94b4c
      Aryan authored
      
      
      * add animatediff controlnet to core
      
      * make style; remove unused method
      
      * fix copied from comment
      
      * add tests
      
      * changes to make tests work
      
      * add utility function to load videos
      
      * update docs
      
      * update pipeline example
      
      * make style
      
      * update docs with example
      
      * address review comments
      
      * add latest freeinit test from #8969
      
      * LoraLoaderMixin -> StableDiffusionLoraLoaderMixin
      
      * fix docs
      
      * Update src/diffusers/utils/loading_utils.py
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      
      * fix: variable out of scope
      
      ---------
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      e5b94b4c
    • Yoach Lacombe's avatar
      Stable Audio integration (#8716) · 69e72b1d
      Yoach Lacombe authored
      
      
      * WIP modeling code and pipeline
      
      * add custom attention processor + custom activation + add to init
      
      * correct ProjectionModel forward
      
      * add stable audio to __initèè
      
      * add autoencoder and update pipeline and modeling code
      
      * add half Rope
      
      * add partial rotary v2
      
      * add temporary modfis to scheduler
      
      * add EDM DPM Solver
      
      * remove TODOs
      
      * clean GLU
      
      * remove att.group_norm to attn processor
      
      * revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py
      
      * refactor GLU -> SwiGLU
      
      * remove redundant args
      
      * add channel multiples in autoencoder docstrings
      
      * changes in docsrtings and copyright headers
      
      * clean pipeline
      
      * further cleaning
      
      * remove peft and lora and fromoriginalmodel
      
      * Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace
      
      * make style
      
      * dummy models
      
      * fix copied from
      
      * add fast oobleck tests
      
      * add brownian tree
      
      * oobleck autoencoder slow tests
      
      * remove TODO
      
      * fast stable audio pipeline tests
      
      * add slow tests
      
      * make style
      
      * add first version of docs
      
      * wrap is_torchsde_available to the scheduler
      
      * fix slow test
      
      * test with input waveform
      
      * add input waveform
      
      * remove some todos
      
      * create stableaudio gaussian projection + make style
      
      * add pipeline to toctree
      
      * fix copied from
      
      * make quality
      
      * refactor timestep_features->time_proj
      
      * refactor joint_attention_kwargs->cross_attention_kwargs
      
      * remove forward_chunk
      
      * move StableAudioDitModel to transformers folder
      
      * correct convert + remove partial rotary embed
      
      * apply suggestions from yiyixuxu -> removing attn.kv_heads
      
      * remove temb
      
      * remove cross_attention_kwargs
      
      * further removal of cross_attention_kwargs
      
      * remove text encoder autocast to fp16
      
      * continue removing autocast
      
      * make style
      
      * refactor how text and audio are embedded
      
      * add paper
      
      * update example code
      
      * make style
      
      * unify projection model forward + fix device placement
      
      * make style
      
      * remove fuse qkv
      
      * apply suggestions from review
      
      * Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      
      * make style
      
      * smaller models in fast tests
      
      * pass sequential offloading fast tests
      
      * add docs for vae and autoencoder
      
      * make style and update example
      
      * remove useless import
      
      * add cosine scheduler
      
      * dummy classes
      
      * cosine scheduler docs
      
      * better description of scheduler
      
      ---------
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      69e72b1d
    • Sayak Paul's avatar
      [LoRA] fix: animate diff lora stuff. (#8995) · 8c4856cd
      Sayak Paul authored
      * fix: animate diff lora stuff.
      
      * fix scaling function for UNetMotionModel
      
      * emoty
      8c4856cd
    • Anatoly Belikov's avatar
      handle lora scale and clip skip in lpw sd and sdxl community pipelines (#8988) · f240a936
      Anatoly Belikov authored
      
      
      * handle lora scale and clip skip in lpw sd and sdxl
      
      * use StableDiffusionLoraLoaderMixin
      
      * use StableDiffusionXLLoraLoaderMixin
      
      * style
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      f240a936