1. 09 Apr, 2025 1 commit
  2. 08 Apr, 2025 3 commits
  3. 05 Apr, 2025 1 commit
  4. 04 Apr, 2025 1 commit
  5. 03 Apr, 2025 1 commit
  6. 02 Apr, 2025 1 commit
  7. 31 Mar, 2025 1 commit
  8. 23 Mar, 2025 1 commit
    • Tolga Cangöz's avatar
      Update README and example code for AnyText usage (#11028) · 0213179b
      Tolga Cangöz authored
      * [Documentation] Update README and example code with additional usage instructions for AnyText
      
      * [Documentation] Update README for AnyTextPipeline and improve logging in code
      
      * Remove wget command for font file from example docstring in anytext.py
      0213179b
  9. 20 Mar, 2025 1 commit
  10. 19 Mar, 2025 1 commit
  11. 18 Mar, 2025 1 commit
  12. 15 Mar, 2025 1 commit
  13. 14 Mar, 2025 2 commits
  14. 13 Mar, 2025 1 commit
  15. 11 Mar, 2025 2 commits
  16. 10 Mar, 2025 1 commit
    • Tolga Cangöz's avatar
      [`Research Project`] Add AnyText: Multilingual Visual Text Generation And Editing (#8998) · b88fef47
      Tolga Cangöz authored
      * Add initial template
      
      * Second template
      
      * feat: Add TextEmbeddingModule to AnyTextPipeline
      
      * feat: Add AuxiliaryLatentModule template to AnyTextPipeline
      
      * Add bert tokenizer from the anytext repo for now
      
      * feat: Update AnyTextPipeline's modify_prompt method
      
      This commit adds improvements to the modify_prompt method in the AnyTextPipeline class. The method now handles special characters and replaces selected string prompts with a placeholder. Additionally, it includes a check for Chinese text and translation using the trans_pipe.
      
      * Fill in the `forward` pass of `AuxiliaryLatentModule`
      
      * `make style && make quality`
      
      * `chore: Update bert_tokenizer.py with a TODO comment suggesting the use of the transformers library`
      
      * Update error handling to raise and logging
      
      * Add `create_glyph_lines` function into `TextEmbeddingModule`
      
      * make style
      
      * Up
      
      * Up
      
      * Up
      
      * Up
      
      * Remove several comments
      
      * refactor: Remove ControlNetConditioningEmbedding and update code accordingly
      
      * Up
      
      * Up
      
      * up
      
      * refactor: Update AnyTextPipeline to include new optional parameters
      
      * up
      
      * feat: Add OCR model and its components
      
      * chore: Update `TextEmbeddingModule` to include OCR model components and dependencies
      
      * chore: Update `AuxiliaryLatentModule` to include VAE model and its dependencies for masked image in the editing task
      
      * `make style`
      
      * refactor: Update `AnyTextPipeline`'s docstring
      
      * Update `AuxiliaryLatentModule` to include info dictionary so that text processing is done once
      
      * simplify
      
      * `make style`
      
      * Converting `TextEmbeddingModule` to ordinary `encode_prompt()` function
      
      * Simplify for now
      
      * `make style`
      
      * Up
      
      * feat: Add scripts to convert AnyText controlnet to diffusers
      
      * `make style`
      
      * Fix: Move glyph rendering to `TextEmbeddingModule` from `AuxiliaryLatentModule`
      
      * make style
      
      * Up
      
      * Simplify
      
      * Up
      
      * feat: Add safetensors module for loading model file
      
      * Fix device issues
      
      * Up
      
      * Up
      
      * refactor: Simplify
      
      * refactor: Simplify code for loading models and handling data types
      
      * `make style`
      
      * refactor: Update to() method in FrozenCLIPEmbedderT3 and TextEmbeddingModule
      
      * refactor: Update dtype in embedding_manager.py to match proj.weight
      
      * Up
      
      * Add attribution and adaptation information to pipeline_anytext.py
      
      * Update usage example
      
      * Will refactor `controlnet_cond_embedding` initialization
      
      * Add `AnyTextControlNetConditioningEmbedding` template
      
      * Refactor organization
      
      * style
      
      * style
      
      * Move custom blocks from `AuxiliaryLatentModule` to `AnyTextControlNetConditioningEmbedding`
      
      * Follow one-file policy
      
      * style
      
      * [Docs] Update README and pipeline_anytext.py to use AnyTextControlNetModel
      
      * [Docs] Update import statement for AnyTextControlNetModel in pipeline_anytext.py
      
      * [Fix] Update import path for ControlNetModel, ControlNetOutput in anytext_controlnet.py
      
      * Refactor AnyTextControlNet to use configurable conditioning embedding channels
      
      * Complete control net conditioning embedding in AnyTextControlNetModel
      
      * up
      
      * [FIX] Ensure embeddings use correct device in AnyTextControlNetModel
      
      * up
      
      * up
      
      * style
      
      * [UPDATE] Revise README and example code for AnyTextPipeline integration with DiffusionPipeline
      
      * [UPDATE] Update example code in anytext.py to use correct font file and improve clarity
      
      * down
      
      * [UPDATE] Refactor BasicTokenizer usage to a new Checker class for text processing
      
      * update pillow
      
      * [UPDATE] Remove commented-out code and unnecessary docstring in anytext.py and anytext_controlnet.py for improved clarity
      
      * [REMOVE] Delete frozen_clip_embedder_t3.py as it is in the anytext.py file
      
      * [UPDATE] Replace edict with dict for configuration in anytext.py and RecModel.py for consistency
      
      * 🆙
      
      
      
      * style
      
      * [UPDATE] Revise README.md for clarity, remove unused imports in anytext.py, and add author credits in anytext_controlnet.py
      
      * style
      
      * Update examples/research_projects/anytext/README.md
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      
      * Remove commented-out image preparation code in AnyTextPipeline
      
      * Remove unnecessary blank line in README.md
      b88fef47
  17. 07 Mar, 2025 1 commit
    • Kinam Kim's avatar
      Add STG to community pipelines (#10960) · b38450d5
      Kinam Kim authored
      
      
      * Support STG for video pipelines
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update README.md
      
      * Update pipeline_stg_cogvideox.py
      
      * Update pipeline_stg_hunyuan_video.py
      
      * Update pipeline_stg_ltx.py
      
      * Update pipeline_stg_ltx_image2video.py
      
      * Update pipeline_stg_mochi.py
      
      * Update pipeline_stg_hunyuan_video.py
      
      * Update pipeline_stg_ltx.py
      
      * Update pipeline_stg_ltx_image2video.py
      
      * Update pipeline_stg_mochi.py
      
      * update
      
      * remove rescaling
      
      * Apply style fixes
      
      ---------
      Co-authored-by: default avatargithub-actions[bot] <github-actions[bot]@users.noreply.github.com>
      b38450d5
  18. 06 Mar, 2025 3 commits
  19. 05 Mar, 2025 1 commit
  20. 04 Mar, 2025 2 commits
  21. 24 Feb, 2025 2 commits
  22. 20 Feb, 2025 2 commits
  23. 18 Feb, 2025 1 commit
    • puhuk's avatar
      Fix max_shift value in flux and related functions to 1.15 (issue #10675) (#10807) · b75b204a
      puhuk authored
      This PR updates the max_shift value in flux to 1.15 for consistency across the codebase. In addition to modifying max_shift in flux, all related functions that copy and use this logic, such as calculate_shift in `src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3_img2img.py`, have also been updated to ensure uniform behavior.
      b75b204a
  24. 16 Feb, 2025 1 commit
  25. 12 Feb, 2025 1 commit
    • Eliseu Silva's avatar
      fix: [Community pipeline] Fix flattened elements on image (#10774) · 051ebc3c
      Eliseu Silva authored
      * feat: new community mixture_tiling_sdxl pipeline for SDXL mixture-of-diffusers support
      
      * fix use of variable latents to tile_latents
      
      * removed references to modules that are not being used in this pipeline
      
      * make style, make quality
      
      * fixfeat: added _get_crops_coords_list function to pipeline to automatically define ctop,cleft coord to focus on image generation, helps to better harmonize the image and corrects the problem of flattened elements.
      051ebc3c
  26. 11 Feb, 2025 1 commit
  27. 06 Feb, 2025 2 commits
    • Leo Jiang's avatar
      [bugfix] NPU Adaption for Sana (#10724) · cd0a4a82
      Leo Jiang authored
      
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * NPU Adaption for Sanna
      
      * [bugfix]NPU Adaption for Sanna
      
      ---------
      Co-authored-by: default avatarJ石页 <jiangshuo9@h-partners.com>
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      cd0a4a82
    • suzukimain's avatar
      [Community] Enhanced `Model Search` (#10417) · 145522cb
      suzukimain authored
      * Added `auto_load_textual_inversion` and `auto_load_lora_weights`
      
      * update README.md
      
      * fix
      
      * make quality
      
      * Fix and `make style`
      145522cb
  28. 04 Feb, 2025 2 commits
  29. 31 Jan, 2025 1 commit