"docs/vscode:/vscode.git/clone" did not exist on "b2e5ad6b86ca8cfa5427608b8a76dca1207807bb"
  1. 20 Jan, 2025 1 commit
  2. 14 Jan, 2025 1 commit
    • Junsong Chen's avatar
      [Sana-4K] (#10537) · 3d707773
      Junsong Chen authored
      
      
      * [Sana 4K]
      add 4K support for Sana
      
      * [Sana-4K] fix SanaPAGPipeline
      
      * add VAE automatically tiling function;
      
      * set clean_caption to False;
      
      * add warnings for VAE OOM.
      
      * style
      
      ---------
      Co-authored-by: default avataryiyixuxu <yixu310@gmail.com>
      3d707773
  3. 12 Jan, 2025 1 commit
  4. 11 Jan, 2025 1 commit
    • Junyu Chen's avatar
      [DC-AE] support tiling for DC-AE (#10510) · e7db062e
      Junyu Chen authored
      
      
      * autoencoder_dc tiling
      
      * add tiling and slicing support in SANA pipelines
      
      * create variables for padding length because the line becomes too long
      
      * add tiling and slicing support in pag SANA pipelines
      
      * revert changes to tile size
      
      * make style
      
      * add vae tiling test
      
      ---------
      Co-authored-by: default avatarAryan <aryan@huggingface.co>
      e7db062e
  5. 10 Jan, 2025 1 commit
  6. 08 Jan, 2025 2 commits
  7. 07 Jan, 2025 2 commits
  8. 23 Dec, 2024 1 commit
  9. 18 Dec, 2024 2 commits
    • hlky's avatar
      Use `torch` in `get_2d_rotary_pos_embed` (#10155) · 0ac52d6f
      hlky authored
      * Use `torch` in `get_2d_rotary_pos_embed`
      
      * Add deprecation
      0ac52d6f
    • Sayak Paul's avatar
      [LoRA] feat: lora support for SANA. (#10234) · 9408aa2d
      Sayak Paul authored
      
      
      * feat: lora support for SANA.
      
      * make fix-copies
      
      * rename test class.
      
      * attention_kwargs -> cross_attention_kwargs.
      
      * Revert "attention_kwargs -> cross_attention_kwargs."
      
      This reverts commit 23433bf9bccc12e0f2f55df26bae58a894e8b43b.
      
      * exhaust 119 max line limit
      
      * sana lora fine-tuning script.
      
      * readme
      
      * add a note about the supported models.
      
      * Apply suggestions from code review
      Co-authored-by: default avatarAryan <aryan@huggingface.co>
      
      * style
      
      * docs for attention_kwargs.
      
      * remove lora_scale from pag pipeline.
      
      * copy fix
      
      ---------
      Co-authored-by: default avatarAryan <aryan@huggingface.co>
      9408aa2d
  10. 16 Dec, 2024 1 commit
  11. 15 Dec, 2024 1 commit
    • Junsong Chen's avatar
      [Sana] Add Sana, including `SanaPipeline`, `SanaPAGPipeline`,... · 5a196e3d
      Junsong Chen authored
      
      [Sana] Add Sana, including `SanaPipeline`, `SanaPAGPipeline`, `LinearAttentionProcessor`, `Flow-based DPM-sovler` and so on. (#9982)
      
      * first add a script for DC-AE;
      
      * DC-AE init
      
      * replace triton with custom implementation
      
      * 1. rename file and remove un-used codes;
      
      * no longer rely on omegaconf and dataclass
      
      * replace custom activation with diffuers activation
      
      * remove dc_ae attention in attention_processor.py
      
      * iinherit from ModelMixin
      
      * inherit from ConfigMixin
      
      * dc-ae reduce to one file
      
      * update downsample and upsample
      
      * clean code
      
      * support DecoderOutput
      
      * remove get_same_padding and val2tuple
      
      * remove autocast and some assert
      
      * update ResBlock
      
      * remove contents within super().__init__
      
      * Update src/diffusers/models/autoencoders/dc_ae.py
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      
      * remove opsequential
      
      * update other blocks to support the removal of build_norm
      
      * remove build encoder/decoder project in/out
      
      * remove inheritance of RMSNorm2d from LayerNorm
      
      * remove reset_parameters for RMSNorm2d
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      
      * remove device and dtype in RMSNorm2d __init__
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      
      * Update src/diffusers/models/autoencoders/dc_ae.py
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      
      * Update src/diffusers/models/autoencoders/dc_ae.py
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      
      * Update src/diffusers/models/autoencoders/dc_ae.py
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      
      * remove op_list & build_block
      
      * remove build_stage_main
      
      * change file name to autoencoder_dc
      
      * move LiteMLA to attention.py
      
      * align with other vae decode output;
      
      * add DC-AE into init files;
      
      * update
      
      * make quality && make style;
      
      * quick push before dgx disappears again
      
      * update
      
      * make style
      
      * update
      
      * update
      
      * fix
      
      * refactor
      
      * refactor
      
      * refactor
      
      * update
      
      * possibly change to nn.Linear
      
      * refactor
      
      * make fix-copies
      
      * replace vae with ae
      
      * replace get_block_from_block_type to get_block
      
      * replace downsample_block_type from Conv to conv for consistency
      
      * add scaling factors
      
      * incorporate changes for all checkpoints
      
      * make style
      
      * move mla to attention processor file; split qkv conv to linears
      
      * refactor
      
      * add tests
      
      * from original file loader
      
      * add docs
      
      * add standard autoencoder methods
      
      * combine attention processor
      
      * fix tests
      
      * update
      
      * minor fix
      
      * minor fix
      
      * minor fix & in/out shortcut rename
      
      * minor fix
      
      * make style
      
      * fix paper link
      
      * update docs
      
      * update single file loading
      
      * make style
      
      * remove single file loading support; todo for DN6
      
      * Apply suggestions from code review
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * add abstract
      
      * 1. add DCAE into diffusers;
      2. make style and make quality;
      
      * add DCAE_HF into diffusers;
      
      * bug fixed;
      
      * add SanaPipeline, SanaTransformer2D into diffusers;
      
      * add sanaLinearAttnProcessor2_0;
      
      * first update for SanaTransformer;
      
      * first update for SanaPipeline;
      
      * first success run SanaPipeline;
      
      * model output finally match with original model with the same intput;
      
      * code update;
      
      * code update;
      
      * add a flow dpm-solver scripts
      
      * 🎉[important update]
      1. Integrate flow-dpm-sovler into diffusers;
      2. finally run successfully on both `FlowMatchEulerDiscreteScheduler` and `FlowDPMSolverMultistepScheduler`;
      
      * 🎉🔧
      
      [important update & fix huge bugs!!]
      1. add SanaPAGPipeline & several related Sana linear attention operators;
      2. `SanaTransformer2DModel` not supports multi-resolution input;
      2. fix the multi-scale HW bugs in SanaPipeline and SanaPAGPipeline;
      3. fix the flow-dpm-solver set_timestep() init `model_output` and `lower_order_nums` bugs;
      
      * remove prints;
      
      * add convert sana official checkpoint to diffusers format Safetensor.
      
      * Update src/diffusers/models/transformers/sana_transformer_2d.py
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * Update src/diffusers/models/transformers/sana_transformer_2d.py
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * Update src/diffusers/models/transformers/sana_transformer_2d.py
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * Update src/diffusers/pipelines/pag/pipeline_pag_sana.py
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * Update src/diffusers/models/transformers/sana_transformer_2d.py
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * Update src/diffusers/models/transformers/sana_transformer_2d.py
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * Update src/diffusers/pipelines/sana/pipeline_sana.py
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * Update src/diffusers/pipelines/sana/pipeline_sana.py
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      
      * update Sana for DC-AE's recent commit;
      
      * make style && make quality
      
      * Add StableDiffusion3PAGImg2Img Pipeline + Fix SD3 Unconditional PAG (#9932)
      
      * fix progress bar updates in SD 1.5 PAG Img2Img pipeline
      
      ---------
      Co-authored-by: default avatarVinh H. Pham <phamvinh257@gmail.com>
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * make the vae can be None in `__init__` of `SanaPipeline`
      
      * Update src/diffusers/models/transformers/sana_transformer_2d.py
      Co-authored-by: default avatarhlky <hlky@hlky.ac>
      
      * change the ae related code due to the latest update of DCAE branch;
      
      * change the ae related code due to the latest update of DCAE branch;
      
      * 1. change code based on AutoencoderDC;
      2. fix the bug of new GLUMBConv;
      3. run success;
      
      * update for solving conversation.
      
      * 1. fix bugs and run convert script success;
      2. Downloading ckpt from hub automatically;
      
      * make style && make quality;
      
      * 1. remove un-unsed parameters in init;
      2. code update;
      
      * remove test file
      
      * refactor; add docs; add tests; update conversion script
      
      * make style
      
      * make fix-copies
      
      * refactor
      
      * udpate pipelines
      
      * pag tests and refactor
      
      * remove sana pag conversion script
      
      * handle weight casting in conversion script
      
      * update conversion script
      
      * add a processor
      
      * 1. add bf16 pth file path;
      2. add complex human instruct in pipeline;
      
      * fix fast \tests
      
      * change gemma-2-2b-it ckpt to a non-gated repo;
      
      * fix the pth path bug in conversion script;
      
      * change grad ckpt to original; make style
      
      * fix the complex_human_instruct bug and typo;
      
      * remove dpmsolver flow scheduler
      
      * apply review suggestions
      
      * change the `FlowMatchEulerDiscreteScheduler` to default `DPMSolverMultistepScheduler` with flow matching scheduler.
      
      * fix the tokenizer.padding_side='right' bug;
      
      * update docs
      
      * make fix-copies
      
      * fix imports
      
      * fix docs
      
      * add integration test
      
      * update docs
      
      * update examples
      
      * fix convert_model_output in schedulers
      
      * fix failing tests
      
      ---------
      Co-authored-by: default avatarJunyu Chen <chenjydl2003@gmail.com>
      Co-authored-by: default avatarYiYi Xu <yixu310@gmail.com>
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      Co-authored-by: default avatarchenjy2003 <70215701+chenjy2003@users.noreply.github.com>
      Co-authored-by: default avatarAryan <aryan@huggingface.co>
      Co-authored-by: default avatarSteven Liu <59462357+stevhliu@users.noreply.github.com>
      Co-authored-by: default avatarhlky <hlky@hlky.ac>
      5a196e3d
  12. 12 Dec, 2024 1 commit
  13. 10 Dec, 2024 1 commit
  14. 04 Dec, 2024 1 commit
  15. 03 Dec, 2024 2 commits
  16. 20 Nov, 2024 1 commit
  17. 14 Nov, 2024 1 commit
  18. 21 Oct, 2024 1 commit
  19. 09 Oct, 2024 1 commit
  20. 08 Oct, 2024 1 commit
  21. 03 Oct, 2024 1 commit
  22. 01 Oct, 2024 1 commit
  23. 09 Sep, 2024 1 commit
  24. 28 Aug, 2024 1 commit
    • Aryan's avatar
      AnimateDiff prompt travel (#9231) · cbc2ec8f
      Aryan authored
      * update
      
      * implement prompt interpolation
      
      * make style
      
      * resnet memory optimizations
      
      * more memory optimizations; todo: refactor
      
      * update
      
      * update animatediff controlnet with latest changes
      
      * refactor chunked inference changes
      
      * remove print statements
      
      * undo memory optimization changes
      
      * update docstrings
      
      * fix tests
      
      * fix pia tests
      
      * apply suggestions from review
      
      * add tests
      
      * update comment
      cbc2ec8f
  25. 21 Aug, 2024 1 commit
  26. 20 Aug, 2024 1 commit
  27. 07 Aug, 2024 3 commits
    • Dhruv Nair's avatar
      Freenoise change `vae_batch_size` to `decode_chunk_size` (#9110) · e3568d14
      Dhruv Nair authored
      * update
      
      * update
      e3568d14
    • Aryan's avatar
      [core] FreeNoise (#8948) · 16a93f1a
      Aryan authored
      
      
      * initial work draft for freenoise; needs massive cleanup
      
      * fix freeinit bug
      
      * add animatediff controlnet implementation
      
      * revert attention changes
      
      * add freenoise
      
      * remove old helper functions
      
      * add decode batch size param to all pipelines
      
      * make style
      
      * fix copied from comments
      
      * make fix-copies
      
      * make style
      
      * copy animatediff controlnet implementation from #8972
      
      * add experimental support for num_frames not perfectly fitting context length, ocntext stride
      
      * make unet motion model lora work again based on #8995
      
      * copy load video utils from #8972
      
      * copied from AnimateDiff::prepare_latents
      
      * address the case where last batch of frames does not match length of indices in prepare latents
      
      * decode_batch_size->vae_batch_size; batch vae encode support in animatediff vid2vid
      
      * revert sparsectrl and sdxl freenoise changes
      
      * revert pia
      
      * add freenoise tests
      
      * make fix-copies
      
      * improve docstrings
      
      * add freenoise tests to animatediff controlnet
      
      * update tests
      
      * Update src/diffusers/models/unets/unet_motion_model.py
      
      * add freenoise to animatediff pag
      
      * address review comments
      
      * make style
      
      * update tests
      
      * make fix-copies
      
      * fix error message
      
      * remove copied from comment
      
      * fix imports in tests
      
      * update
      
      ---------
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      16a93f1a
    • Álvaro Somoza's avatar
      [Kolors] Add PAG (#8934) · 39e1f7ea
      Álvaro Somoza authored
      
      
      * txt2img pag added
      
      * autopipe added, fixed case
      
      * style
      
      * apply suggestions
      
      * added fast tests, added todo tests
      
      * revert dummy objects for kolors
      
      * fix pag dummies
      
      * fix test imports
      
      * update pag tests
      
      * add kolor pag to docs
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      39e1f7ea
  28. 06 Aug, 2024 1 commit
  29. 05 Aug, 2024 1 commit
    • Aryan's avatar
      PAG variant for HunyuanDiT, PAG refactor (#8936) · b7058d14
      Aryan authored
      
      
      * copy hunyuandit pipeline
      
      * pag variant of hunyuan dit
      
      * add tests
      
      * update docs
      
      * make style
      
      * make fix-copies
      
      * Update src/diffusers/pipelines/pag/pag_utils.py
      
      * remove incorrect copied from
      
      * remove pag hunyuan attn procs to resolve conflicts
      
      * add pag attn procs again
      
      * new implementation for pag_utils
      
      * revert pag changes
      
      * add pag refactor back; update pixart sigma
      
      * update pixart pag tests
      
      * apply suggestions from review
      
      Co-Authored-By: yixu310@gmail.com
      
      * make style
      
      * update docs, fix tests
      
      * fix tests
      
      * fix test_components_function since list not accepted as valid __init__ param
      
      * apply patch to fix broken tests
      Co-Authored-By: default avatarSayak Paul <spsayakpaul@gmail.com>
      
      * make style
      
      * fix hunyuan tests
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      b7058d14
  30. 03 Aug, 2024 1 commit
  31. 02 Aug, 2024 1 commit
    • Sayak Paul's avatar
      [Core] Add PAG support for PixArtSigma (#8921) · 7b98c4cc
      Sayak Paul authored
      * feat: add pixart sigma pag.
      
      * inits.
      
      * fixes
      
      * fix
      
      * remove print.
      
      * copy paste methods to the pixart pag mixin
      
      * fix-copies
      
      * add documentation.
      
      * add tests.
      
      * remove correction file.
      
      * remove pag_applied_layers
      
      * empty
      7b98c4cc
  32. 01 Aug, 2024 1 commit
    • Aryan's avatar
      PAG variant for AnimateDiff (#8789) · 05b706c0
      Aryan authored
      * add animatediff pag pipeline
      
      * remove unnecessary print
      
      * make fix-copies
      
      * fix ip-adapter bug
      
      * update docs
      
      * add fast tests and fix bugs
      
      * update
      
      * update
      
      * address review comments
      
      * update ip adapter single test expected slice
      
      * implement test_from_pipe_consistent_config; fix expected slice values
      
      * LoraLoaderMixin->StableDiffusionLoraLoaderMixin; add latest freeinit test
      05b706c0
  33. 26 Jul, 2024 1 commit
    • Sayak Paul's avatar
      [Chore] add `LoraLoaderMixin` to the inits (#8981) · d87fe95f
      Sayak Paul authored
      
      
      * introduce  to promote reusability.
      
      * up
      
      * add more tests
      
      * up
      
      * remove comments.
      
      * fix fuse_nan test
      
      * clarify the scope of fuse_lora and unfuse_lora
      
      * remove space
      
      * rewrite fuse_lora a bit.
      
      * feedback
      
      * copy over load_lora_into_text_encoder.
      
      * address dhruv's feedback.
      
      * fix-copies
      
      * fix issubclass.
      
      * num_fused_loras
      
      * fix
      
      * fix
      
      * remove mapping
      
      * up
      
      * fix
      
      * style
      
      * fix-copies
      
      * change to SD3TransformerLoRALoadersMixin
      
      * Apply suggestions from code review
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      
      * up
      
      * handle wuerstchen
      
      * up
      
      * move lora to lora_pipeline.py
      
      * up
      
      * fix-copies
      
      * fix documentation.
      
      * comment set_adapters().
      
      * fix-copies
      
      * fix set_adapters() at the model level.
      
      * fix?
      
      * fix
      
      * loraloadermixin.
      
      ---------
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      d87fe95f
  34. 25 Jul, 2024 1 commit