Commits · ccc8321651ebb879f70e563274b2d03c84c18f2f · renzhc / diffusers_dcu

13 Mar, 2025 3 commits

Fix aclnnRepeatInterleaveIntWithDim error on NPU for get_1d_rotary_pos_embed (#10820) · ccc83216

ZhengKai91 authored Mar 14, 2025



* get_1d_rotary_pos_embed support npu

* Update src/diffusers/models/embeddings.py

---------
Co-authored-by: Kai zheng <kaizheng@KaideMacBook-Pro.local>
Co-authored-by: hlky <hlky@hlky.ac>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

ccc83216

making ```formatted_images``` initialization compact (#10801) · 5e48cd27

Yaniv Galron authored Mar 13, 2025



compact writing
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

5e48cd27

Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline (#10827) · 5551506b

hlky authored Mar 13, 2025



* Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline


---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

5551506b

12 Mar, 2025 5 commits
- [LoRA] change to warning from info when notifying the users about a LoRA no-op (#11044) · 20e4b6a6
  Sayak Paul authored Mar 12, 2025
```
* move to warning.

* test related changes.
```
  20e4b6a6
- Wan Pipeline scaling fix, type hint warning, multi generator fix (#11007) · 4ea9f89b
  hlky authored Mar 12, 2025
```
* Wan Pipeline scaling fix, type hint warning, multi generator fix

* Apply suggestions from code review
```
  4ea9f89b
- [hybrid inference 🍯🐝] Add VAE encode (#11017) · 733b44ac
  hlky authored Mar 12, 2025
```
* [hybrid inference 🍯🐝

] Add VAE encode

* _toctree: add vae encode

* Add endpoints, tests

* vae_encode docs

* vae encode benchmarks

* api reference

* changelog

* Update docs/source/en/hybrid_inference/overview.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  733b44ac
- Use `output_size` in `repeat_interleave` (#11030) · 8b4f8ba7
  hlky authored Mar 12, 2025
  
  8b4f8ba7
- [Refactor] Clean up import utils boilerplate (#11026) · 54280464
  Dhruv Nair authored Mar 12, 2025
```
* update

* update

* update
```
  54280464
11 Mar, 2025 7 commits
- Fix for multi-GPU WAN inference (#10997) · e7ffeae0
  39th president of the United States, probably authored Mar 11, 2025
```
Ensure that hidden_state and shift/scale are on the same device when running with multiple GPUs

Co-authored-by: Jimmy <39@🇺🇸.com>
```
  e7ffeae0
- Fix missing **kwargs in lora_pipeline.py (#11011) · d87ce2ce
  CyberVy authored Mar 12, 2025
```
* Update lora_pipeline.py

* Apply style fixes

* fix-copies

---------
Co-authored-by: hlky <hlky@hlky.ac>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
```
  d87ce2ce
- chore: fix help messages in advanced diffusion examples (#10923) · 36d0553a
  wonderfan authored Mar 12, 2025
  
  36d0553a
- Fix SD3 IPAdapter feature extractor (#11027) · 7e0db46f
  hlky authored Mar 11, 2025
  
  7e0db46f
- [LoRA] support wan i2v loras from the world. (#11025) · e4b056fe
  Sayak Paul authored Mar 11, 2025
```
* support wan i2v loras from the world.

* remove copied from.

* upates

* add lora.
```
  e4b056fe
- fix: mixture tiling sdxl pipeline - adjust gerating time_ids & embeddings (#11012) · 4e3ddd5a
  Eliseu Silva authored Mar 11, 2025
```
small fix on generating time_ids & embeddings
```
  4e3ddd5a
- [Quantization] Allow loading TorchAO serialized Tensor objects with torch>=2.6 (#11018) · 9add0715
  Dhruv Nair authored Mar 11, 2025
```
* update

* update

* update

* update

* update

* update

* update

* update

* update
```
  9add0715
10 Mar, 2025 7 commits

[`Research Project`] Add AnyText: Multilingual Visual Text Generation And Editing (#8998) · b88fef47

Tolga Cangöz authored Mar 10, 2025

* Add initial template

* Second template

* feat: Add TextEmbeddingModule to AnyTextPipeline

* feat: Add AuxiliaryLatentModule template to AnyTextPipeline

* Add bert tokenizer from the anytext repo for now

* feat: Update AnyTextPipeline's modify_prompt method

This commit adds improvements to the modify_prompt method in the AnyTextPipeline class. The method now handles special characters and replaces selected string prompts with a placeholder. Additionally, it includes a check for Chinese text and translation using the trans_pipe.

* Fill in the `forward` pass of `AuxiliaryLatentModule`

* `make style && make quality`

* `chore: Update bert_tokenizer.py with a TODO comment suggesting the use of the transformers library`

* Update error handling to raise and logging

* Add `create_glyph_lines` function into `TextEmbeddingModule`

* make style

* Up

* Up

* Up

* Up

* Remove several comments

* refactor: Remove ControlNetConditioningEmbedding and update code accordingly

* Up

* Up

* up

* refactor: Update AnyTextPipeline to include new optional parameters

* up

* feat: Add OCR model and its components

* chore: Update `TextEmbeddingModule` to include OCR model components and dependencies

* chore: Update `AuxiliaryLatentModule` to include VAE model and its dependencies for masked image in the editing task

* `make style`

* refactor: Update `AnyTextPipeline`'s docstring

* Update `AuxiliaryLatentModule` to include info dictionary so that text processing is done once

* simplify

* `make style`

* Converting `TextEmbeddingModule` to ordinary `encode_prompt()` function

* Simplify for now

* `make style`

* Up

* feat: Add scripts to convert AnyText controlnet to diffusers

* `make style`

* Fix: Move glyph rendering to `TextEmbeddingModule` from `AuxiliaryLatentModule`

* make style

* Up

* Simplify

* Up

* feat: Add safetensors module for loading model file

* Fix device issues

* Up

* Up

* refactor: Simplify

* refactor: Simplify code for loading models and handling data types

* `make style`

* refactor: Update to() method in FrozenCLIPEmbedderT3 and TextEmbeddingModule

* refactor: Update dtype in embedding_manager.py to match proj.weight

* Up

* Add attribution and adaptation information to pipeline_anytext.py

* Update usage example

* Will refactor `controlnet_cond_embedding` initialization

* Add `AnyTextControlNetConditioningEmbedding` template

* Refactor organization

* style

* style

* Move custom blocks from `AuxiliaryLatentModule` to `AnyTextControlNetConditioningEmbedding`

* Follow one-file policy

* style

* [Docs] Update README and pipeline_anytext.py to use AnyTextControlNetModel

* [Docs] Update import statement for AnyTextControlNetModel in pipeline_anytext.py

* [Fix] Update import path for ControlNetModel, ControlNetOutput in anytext_controlnet.py

* Refactor AnyTextControlNet to use configurable conditioning embedding channels

* Complete control net conditioning embedding in AnyTextControlNetModel

* up

* [FIX] Ensure embeddings use correct device in AnyTextControlNetModel

* up

* up

* style

* [UPDATE] Revise README and example code for AnyTextPipeline integration with DiffusionPipeline

* [UPDATE] Update example code in anytext.py to use correct font file and improve clarity

* down

* [UPDATE] Refactor BasicTokenizer usage to a new Checker class for text processing

* update pillow

* [UPDATE] Remove commented-out code and unnecessary docstring in anytext.py and anytext_controlnet.py for improved clarity

* [REMOVE] Delete frozen_clip_embedder_t3.py as it is in the anytext.py file

* [UPDATE] Replace edict with dict for configuration in anytext.py and RecModel.py for consistency

* 🆙



* style

* [UPDATE] Revise README.md for clarity, remove unused imports in anytext.py, and add author credits in anytext_controlnet.py

* style

* Update examples/research_projects/anytext/README.md
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

* Remove commented-out image preparation code in AnyTextPipeline

* Remove unnecessary blank line in README.md

b88fef47

[Tests] improve quantization tests by additionally measuring the inference memory savings (#11021) · e7e6d852
Sayak Paul authored Mar 10, 2025
```
* memory usage tests

* fixes

* gguf
```
e7e6d852
[LoRA] CogView4 (#10981) · 8eefed65
Aryan authored Mar 10, 2025
```
* update

* make fix-copies

* update
```
8eefed65

[LoRA] Improve warning messages when LoRA loading becomes a no-op (#10187) · 26149c0e

Sayak Paul authored Mar 10, 2025



* updates

* updates

* updates

* updates

* notebooks revert

* fix-copies.

* seeing

* fix

* revert

* fixes

* fixes

* fixes

* remove print

* fix

* conflicts ii.

* updates

* fixes

* better filtering of prefix.

---------
Co-authored-by: hlky <hlky@hlky.ac>

26149c0e

[Single File] Add single file loading for SANA Transformer (#10947) · 0703ce88

Ishan Modi authored Mar 10, 2025



* added support for from_single_file

* added diffusers mapping script

* added testcase

* bug fix

* updated tests

* corrected code quality

* corrected code quality

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

0703ce88

[Quantization] Add Quanto backend (#10756) · f5edaa78

Dhruv Nair authored Mar 10, 2025



* update

* updaet

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* Update docs/source/en/quantization/quanto.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* Update src/diffusers/quantizers/quanto/utils.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* update

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

f5edaa78

Fix for fetching variants only (#10646) · 9a1810f0

Dhruv Nair authored Mar 10, 2025

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

9a1810f0

08 Mar, 2025 1 commit
- [LoRA] Improve copied from comments in the LoRA loader classes (#10995) · 1fddee21
  Sayak Paul authored Mar 08, 2025
```
* more sanity of mind with copied from ...

* better

* better
```
  1fddee21
07 Mar, 2025 7 commits

Add STG to community pipelines (#10960) · b38450d5

Kinam Kim authored Mar 08, 2025



* Support STG for video pipelines

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update pipeline_stg_cogvideox.py

* Update pipeline_stg_hunyuan_video.py

* Update pipeline_stg_ltx.py

* Update pipeline_stg_ltx_image2video.py

* Update pipeline_stg_mochi.py

* Update pipeline_stg_hunyuan_video.py

* Update pipeline_stg_ltx.py

* Update pipeline_stg_ltx_image2video.py

* Update pipeline_stg_mochi.py

* update

* remove rescaling

* Apply style fixes

---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

b38450d5

[Single File] Add single file support for Wan T2V/I2V (#10991) · 1357931d
Dhruv Nair authored Mar 07, 2025
```
* update

* update

* update

* update

* update

* update

* update
```
1357931d
[LoRA] remove full key prefix from peft. (#11004) · a2d3d6af
Sayak Paul authored Mar 07, 2025
```
remove full key prefix from peft.
```
a2d3d6af
Wan VAE move scaling to pipeline (#10998) · 363d1ab7
hlky authored Mar 07, 2025

363d1ab7

Fix Graph Breaks When Compiling CogView4 (#10959) · 6a0137eb

C authored Mar 07, 2025



* Fix Graph Breaks When Compiling CogView4

Eliminate this:

```
t]V0304 10:24:23.421000 3131076 torch/_dynamo/guards.py:2813] [0/4] [__recompiles] Recompiling function forward in /home/zeyi/repos/diffusers/src/diffusers/models/transformers/transformer_cogview4.py:374
V0304 10:24:23.421000 3131076 torch/_dynamo/guards.py:2813] [0/4] [__recompiles]     triggered by the following guard failure(s):
V0304 10:24:23.421000 3131076 torch/_dynamo/guards.py:2813] [0/4] [__recompiles]     - 0/3: ___check_obj_id(L['self'].rope.freqs_h, 139976127328032)    
V0304 10:24:23.421000 3131076 torch/_dynamo/guards.py:2813] [0/4] [__recompiles]     - 0/2: ___check_obj_id(L['self'].rope.freqs_h, 139976107780960)    
V0304 10:24:23.421000 3131076 torch/_dynamo/guards.py:2813] [0/4] [__recompiles]     - 0/1: ___check_obj_id(L['self'].rope.freqs_h, 140022511848960)    
V0304 10:24:23.421000 3131076 torch/_dynamo/guards.py:2813] [0/4] [__recompiles]     - 0/0: ___check_obj_id(L['self'].rope.freqs_h, 140024081342416)   
```

* Update transformer_cogview4.py

* fix cogview4 rotary pos embed

* Apply style fixes

---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

6a0137eb

Hunyuan I2V (#10983) · 2e5203be

Aryan authored Mar 07, 2025

* update

* update

* update

* add tests

* update

* add model tests

* update docs

* update

* update example

* fix defaults

* update

2e5203be

fix wan i2v pipeline bugs (#10975) · d55f4110

yupeng1111 authored Mar 07, 2025



* fix wan i2v pipeline bugs

---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

d55f4110

06 Mar, 2025 9 commits

Add CogVideoX DDIM Inversion to Community Pipelines (#10956) · 748cb0fa

LittleNyima authored Mar 07, 2025



* add cogvideox ddim inversion script

* implement as a pipeline, and add documentation

---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

748cb0fa

[Single File] Add user agent to SF download requests. (#10979) · 790a909b
Dhruv Nair authored Mar 07, 2025
```
update
```
790a909b

Fix Flux Controlnet Pipeline _callback_tensor_inputs Missing Some Elements (#10974) · 54ab4753

CyberVy authored Mar 07, 2025

* Update pipeline_flux_controlnet.py

* Update pipeline_flux_controlnet_image_to_image.py

* Update pipeline_flux_controlnet_inpainting.py

* Update pipeline_flux_controlnet_inpainting.py

* Update pipeline_flux_controlnet_inpainting.py

54ab4753

Bump jinja2 from 3.1.5 to 3.1.6 in /examples/research_projects/realfill (#10984) · f1039930

dependabot[bot] authored Mar 06, 2025

Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.5 to 3.1.6.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/jinja/compare/3.1.5...3.1.6

)

---
updated-dependencies:
- dependency-name: jinja2
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

f1039930

[CI] remove synchornized. (#10980) · 1be02025
Sayak Paul authored Mar 06, 2025
```
removed synchornized.
```
1be02025
fix default values of Flux guidance_scale in docstrings (#10982) · ea81a422
Pierre Chapuis authored Mar 06, 2025

ea81a422
Fix loading OneTrainer Flux LoRA (#10978) · b1502763
hlky authored Mar 06, 2025
```
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
b1502763
[tests] fix tests for save load components (#10977) · 6e2a93de
Sayak Paul authored Mar 06, 2025
```
fix tests
```
6e2a93de

[train_dreambooth_lora.py] Fix the LR Schedulers when `num_train_epochs` is... · 37b8edfb

Jun Yeop Na authored Mar 06, 2025

[train_dreambooth_lora.py] Fix the LR Schedulers when `num_train_epochs` is passed in a distributed training env (#10973)

* updated train_dreambooth_lora to fix the LR schedulers for `num_train_epochs` in distributed training env

* fixed formatting

* remove trailing newlines

* fixed style error

37b8edfb

05 Mar, 2025 1 commit
- use style bot GH Action from `huggingface_hub` (#10970) · fbf6b856
  Célina authored Mar 05, 2025
```
use style bot GH action from hfh
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  fbf6b856