1. 17 Oct, 2025 1 commit
  2. 11 Sep, 2025 1 commit
  3. 03 Sep, 2025 1 commit
  4. 25 Aug, 2025 1 commit
  5. 11 Aug, 2025 1 commit
  6. 29 Jul, 2025 1 commit
  7. 19 Jun, 2025 1 commit
  8. 30 May, 2025 1 commit
  9. 19 May, 2025 1 commit
  10. 13 May, 2025 1 commit
  11. 08 May, 2025 1 commit
  12. 01 May, 2025 1 commit
  13. 24 Apr, 2025 1 commit
  14. 09 Apr, 2025 1 commit
  15. 04 Apr, 2025 1 commit
  16. 23 Mar, 2025 1 commit
    • Tolga Cangöz's avatar
      Update README and example code for AnyText usage (#11028) · 0213179b
      Tolga Cangöz authored
      * [Documentation] Update README and example code with additional usage instructions for AnyText
      
      * [Documentation] Update README for AnyTextPipeline and improve logging in code
      
      * Remove wget command for font file from example docstring in anytext.py
      0213179b
  17. 19 Mar, 2025 1 commit
  18. 18 Mar, 2025 1 commit
  19. 14 Mar, 2025 1 commit
  20. 13 Mar, 2025 1 commit
  21. 10 Mar, 2025 1 commit
    • Tolga Cangöz's avatar
      [`Research Project`] Add AnyText: Multilingual Visual Text Generation And Editing (#8998) · b88fef47
      Tolga Cangöz authored
      * Add initial template
      
      * Second template
      
      * feat: Add TextEmbeddingModule to AnyTextPipeline
      
      * feat: Add AuxiliaryLatentModule template to AnyTextPipeline
      
      * Add bert tokenizer from the anytext repo for now
      
      * feat: Update AnyTextPipeline's modify_prompt method
      
      This commit adds improvements to the modify_prompt method in the AnyTextPipeline class. The method now handles special characters and replaces selected string prompts with a placeholder. Additionally, it includes a check for Chinese text and translation using the trans_pipe.
      
      * Fill in the `forward` pass of `AuxiliaryLatentModule`
      
      * `make style && make quality`
      
      * `chore: Update bert_tokenizer.py with a TODO comment suggesting the use of the transformers library`
      
      * Update error handling to raise and logging
      
      * Add `create_glyph_lines` function into `TextEmbeddingModule`
      
      * make style
      
      * Up
      
      * Up
      
      * Up
      
      * Up
      
      * Remove several comments
      
      * refactor: Remove ControlNetConditioningEmbedding and update code accordingly
      
      * Up
      
      * Up
      
      * up
      
      * refactor: Update AnyTextPipeline to include new optional parameters
      
      * up
      
      * feat: Add OCR model and its components
      
      * chore: Update `TextEmbeddingModule` to include OCR model components and dependencies
      
      * chore: Update `AuxiliaryLatentModule` to include VAE model and its dependencies for masked image in the editing task
      
      * `make style`
      
      * refactor: Update `AnyTextPipeline`'s docstring
      
      * Update `AuxiliaryLatentModule` to include info dictionary so that text processing is done once
      
      * simplify
      
      * `make style`
      
      * Converting `TextEmbeddingModule` to ordinary `encode_prompt()` function
      
      * Simplify for now
      
      * `make style`
      
      * Up
      
      * feat: Add scripts to convert AnyText controlnet to diffusers
      
      * `make style`
      
      * Fix: Move glyph rendering to `TextEmbeddingModule` from `AuxiliaryLatentModule`
      
      * make style
      
      * Up
      
      * Simplify
      
      * Up
      
      * feat: Add safetensors module for loading model file
      
      * Fix device issues
      
      * Up
      
      * Up
      
      * refactor: Simplify
      
      * refactor: Simplify code for loading models and handling data types
      
      * `make style`
      
      * refactor: Update to() method in FrozenCLIPEmbedderT3 and TextEmbeddingModule
      
      * refactor: Update dtype in embedding_manager.py to match proj.weight
      
      * Up
      
      * Add attribution and adaptation information to pipeline_anytext.py
      
      * Update usage example
      
      * Will refactor `controlnet_cond_embedding` initialization
      
      * Add `AnyTextControlNetConditioningEmbedding` template
      
      * Refactor organization
      
      * style
      
      * style
      
      * Move custom blocks from `AuxiliaryLatentModule` to `AnyTextControlNetConditioningEmbedding`
      
      * Follow one-file policy
      
      * style
      
      * [Docs] Update README and pipeline_anytext.py to use AnyTextControlNetModel
      
      * [Docs] Update import statement for AnyTextControlNetModel in pipeline_anytext.py
      
      * [Fix] Update import path for ControlNetModel, ControlNetOutput in anytext_controlnet.py
      
      * Refactor AnyTextControlNet to use configurable conditioning embedding channels
      
      * Complete control net conditioning embedding in AnyTextControlNetModel
      
      * up
      
      * [FIX] Ensure embeddings use correct device in AnyTextControlNetModel
      
      * up
      
      * up
      
      * style
      
      * [UPDATE] Revise README and example code for AnyTextPipeline integration with DiffusionPipeline
      
      * [UPDATE] Update example code in anytext.py to use correct font file and improve clarity
      
      * down
      
      * [UPDATE] Refactor BasicTokenizer usage to a new Checker class for text processing
      
      * update pillow
      
      * [UPDATE] Remove commented-out code and unnecessary docstring in anytext.py and anytext_controlnet.py for improved clarity
      
      * [REMOVE] Delete frozen_clip_embedder_t3.py as it is in the anytext.py file
      
      * [UPDATE] Replace edict with dict for configuration in anytext.py and RecModel.py for consistency
      
      * 🆙
      
      
      
      * style
      
      * [UPDATE] Revise README.md for clarity, remove unused imports in anytext.py, and add author credits in anytext_controlnet.py
      
      * style
      
      * Update examples/research_projects/anytext/README.md
      Co-authored-by: default avatarAryan <contact.aryanvs@gmail.com>
      
      * Remove commented-out image preparation code in AnyTextPipeline
      
      * Remove unnecessary blank line in README.md
      b88fef47
  22. 06 Mar, 2025 1 commit
  23. 28 Jan, 2025 1 commit
    • Aryan's avatar
      Refactor gradient checkpointing (#10611) · c4d4ac21
      Aryan authored
      * update
      
      * remove unused fn
      
      * apply suggestions based on review
      
      * update + cleanup 🧹
      
      * more cleanup 🧹
      
      * make fix-copies
      
      * update test
      c4d4ac21
  24. 27 Jan, 2025 1 commit
    • Yuqian Hong's avatar
      create a script to train autoencoderkl (#10605) · 4fa24591
      Yuqian Hong authored
      
      
      * create a script to train vae
      
      * update main.py
      
      * update train_autoencoderkl.py
      
      * update train_autoencoderkl.py
      
      * add a check of --pretrained_model_name_or_path and --model_config_name_or_path
      
      * remove the comment, remove diffusers in requiremnets.txt, add validation_image ote
      
      * update autoencoderkl.py
      
      * quality
      
      ---------
      Co-authored-by: default avatarSayak Paul <spsayakpaul@gmail.com>
      4fa24591
  25. 21 Jan, 2025 1 commit
  26. 20 Jan, 2025 1 commit
  27. 16 Jan, 2025 1 commit
  28. 13 Jan, 2025 1 commit
  29. 07 Jan, 2025 3 commits
  30. 06 Dec, 2024 1 commit
  31. 19 Nov, 2024 1 commit
  32. 16 Nov, 2024 1 commit
  33. 08 Nov, 2024 1 commit
  34. 07 Nov, 2024 1 commit
    • Sayak Paul's avatar
      [Core] introduce `controlnet` module (#8768) · ded3db16
      Sayak Paul authored
      
      
      * move vae flax module.
      
      * controlnet module.
      
      * prepare for PR.
      
      * revert a commit
      
      * gracefully deprecate controlnet deps.
      
      * fix
      
      * fix doc path
      
      * fix-copies
      
      * fix path
      
      * style
      
      * style
      
      * conflicts
      
      * fix
      
      * fix-copies
      
      * sparsectrl.
      
      * updates
      
      * fix
      
      * updates
      
      * updates
      
      * updates
      
      * fix
      
      ---------
      Co-authored-by: default avatarDhruv Nair <dhruv.nair@gmail.com>
      ded3db16
  35. 31 Oct, 2024 2 commits
  36. 28 Oct, 2024 1 commit
  37. 26 Oct, 2024 1 commit