Commits · 552cd32058660573c14ac481d88417e828c68756 · renzhc / diffusers_dcu

09 Apr, 2025 1 commit
- fix: SD3 ControlNet validation so that it runs on a A100. (#11238) · fd02aad4
  Sayak Paul authored Apr 09, 2025
```
* fix: SD3 ControlNet validation so that it runs on a A100.

* use backend-agnostic cache and pass devide.
```
  fd02aad4
08 Apr, 2025 3 commits

[Flux LoRA] fix issues in flux lora scripts (#11111) · 71f34fc5

Linoy Tsaban authored Apr 08, 2025



* remove custom scheduler

* update requirements.txt

* log_validation with mixed precision

* add intermediate embeddings saving when checkpointing is enabled

* remove comment

* fix validation

* add unwrap_model for accelerator, torch.no_grad context for validation, fix accelerator.accumulate call in advanced script

* revert unwrap_model change temp

* add .module to address distributed training bug + replace accelerator.unwrap_model with unwrap model

* changes to align advanced script with canonical script

* make changes for distributed training + unify unwrap_model calls in advanced script

* add module.dtype fix to dreambooth script

* unify unwrap_model calls in dreambooth script

* fix condition in validation run

* mixed precision

* Update examples/advanced_diffusion_training/train_dreambooth_lora_flux_advanced.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* smol style change

* change autocast

* Apply style fixes

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

71f34fc5

[Training] Better image interpolation in training scripts (#11206) · 723dbdd3

Álvaro Somoza authored Apr 08, 2025



* initial

* Update examples/dreambooth/train_dreambooth_lora_sdxl.py
Co-authored-by: hlky <hlky@hlky.ac>

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: hlky <hlky@hlky.ac>

723dbdd3

[train_controlnet.py] Fix the LR schedulers when num_train_epochs is passed in... · fbf61f46

Bhavay Malhotra authored Apr 08, 2025


[train_controlnet.py] Fix the LR schedulers when num_train_epochs is passed in a distributed training env (#8461)

* Create diffusers.yml

* fix num_train_epochs

* Delete diffusers.yml

* Fixed Changes

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

fbf61f46

05 Apr, 2025 1 commit

Add Wan with STG as a community pipeline (#11184) · 41afb669

Edna authored Apr 04, 2025



* Add stg wan to community pipelines

* remove debug prints

* remove unused comment

* Update doc

* Add credit + fix typo

* Apply style fixes

---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

41afb669

04 Apr, 2025 1 commit

Fixed requests.get function call by adding timeout parameter. (#11156) · f10775b1

Kenneth Gerald Hamilton authored Apr 04, 2025



* Fixed requests.get function call by adding timeout parameter.

* declare DIFFUSERS_REQUEST_TIMEOUT in constants and import when needed

* remove unneeded os import

* Apply style fixes

---------
Co-authored-by: Sai-Suraj-27 <sai.suraj.27.729@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

f10775b1

03 Apr, 2025 1 commit

[Model Card] standardize advanced diffusion training sdxl lora (#7615) · d9023a67

Abhipsha Das authored Apr 02, 2025



* model card gen code

* push modelcard creation

* remove optional from params

* add import

* add use_dora check

* correct lora var use in tags

* make style && make quality

---------
Co-authored-by: Aryan <aryan@huggingface.co>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

d9023a67

02 Apr, 2025 1 commit
- feat: [Community Pipeline] - FaithDiff Stable Diffusion XL Pipeline (#11188) · c4646a39
  Eliseu Silva authored Apr 02, 2025
```
* feat: [Community Pipeline] - FaithDiff Stable Diffusion XL Pipeline for Image SR.

* added pipeline
```
  c4646a39
31 Mar, 2025 1 commit
- Add `latents_mean` and `latents_std` to `SDXLLongPromptWeightingPipeline` (#11034) · d6f4774c
  hlky authored Mar 31, 2025
  
  d6f4774c
23 Mar, 2025 1 commit

Update README and example code for AnyText usage (#11028) · 0213179b

Tolga Cangöz authored Mar 23, 2025

* [Documentation] Update README and example code with additional usage instructions for AnyText

* [Documentation] Update README for AnyTextPipeline and improve logging in code

* Remove wget command for font file from example docstring in anytext.py

0213179b

20 Mar, 2025 1 commit
- Notebooks for Community Scripts-8 (#11128) · f424b1b0
  Parag Ekbote authored Mar 21, 2025
```
Add 4 Notebooks and update the missing links for the
example README.
```
  f424b1b0
19 Mar, 2025 1 commit
- [BUG] Fix Autoencoderkl train script (#11113) · fc28791f
  Yuqian Hong authored Mar 19, 2025
```
* add disc_optimizer step (not fix)

* support syncbatchnorm in discriminator
```
  fc28791f
18 Mar, 2025 1 commit
- update readme instructions. (#11096) · 27916822
  Juan Acevedo authored Mar 17, 2025
```
Co-authored-by: Juan Acevedo <jfacevedo@google.com>
```
  27916822
15 Mar, 2025 1 commit

CogView4 Control Block (#10809) · 82188cef

Yuxuan Zhang authored Mar 16, 2025




* cogview4 control training


---------
Co-authored-by: OleehyO <leehy0357@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com>

82188cef

14 Mar, 2025 2 commits

reverts accidental change that removes attn_mask in attn. Improves fl… (#11065) · 6b9a3334

Juan Acevedo authored Mar 14, 2025



reverts accidental change that removes attn_mask in attn. Improves flux ptxla by using flash block sizes. Moves encoding outside the for loop.
Co-authored-by: Juan Acevedo <jfacevedo@google.com>

6b9a3334

[examples/controlnet/train_controlnet_sd3.py] Fixes #11050 - Cast... · 8ead643b

Andreas Jörg authored Mar 14, 2025


[examples/controlnet/train_controlnet_sd3.py] Fixes #11050 - Cast prompt_embeds and pooled_prompt_embeds to weight_dtype to prevent dtype mismatch (#11051)

Fix: dtype mismatch of prompt embeddings in sd3 controlnet training
Co-authored-by: Andreas Jörg <andreasjoerg@MacBook-Pro-von-Andreas-2.fritz.box>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

8ead643b

13 Mar, 2025 1 commit

making ```formatted_images``` initialization compact (#10801) · 5e48cd27

Yaniv Galron authored Mar 13, 2025



compact writing
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

5e48cd27

11 Mar, 2025 2 commits
- chore: fix help messages in advanced diffusion examples (#10923) · 36d0553a
  wonderfan authored Mar 12, 2025
  
  36d0553a
- fix: mixture tiling sdxl pipeline - adjust gerating time_ids & embeddings (#11012) · 4e3ddd5a
  Eliseu Silva authored Mar 11, 2025
```
small fix on generating time_ids & embeddings
```
  4e3ddd5a
10 Mar, 2025 1 commit

[`Research Project`] Add AnyText: Multilingual Visual Text Generation And Editing (#8998) · b88fef47

Tolga Cangöz authored Mar 10, 2025

* Add initial template

* Second template

* feat: Add TextEmbeddingModule to AnyTextPipeline

* feat: Add AuxiliaryLatentModule template to AnyTextPipeline

* Add bert tokenizer from the anytext repo for now

* feat: Update AnyTextPipeline's modify_prompt method

This commit adds improvements to the modify_prompt method in the AnyTextPipeline class. The method now handles special characters and replaces selected string prompts with a placeholder. Additionally, it includes a check for Chinese text and translation using the trans_pipe.

* Fill in the `forward` pass of `AuxiliaryLatentModule`

* `make style && make quality`

* `chore: Update bert_tokenizer.py with a TODO comment suggesting the use of the transformers library`

* Update error handling to raise and logging

* Add `create_glyph_lines` function into `TextEmbeddingModule`

* make style

* Up

* Up

* Up

* Up

* Remove several comments

* refactor: Remove ControlNetConditioningEmbedding and update code accordingly

* Up

* Up

* up

* refactor: Update AnyTextPipeline to include new optional parameters

* up

* feat: Add OCR model and its components

* chore: Update `TextEmbeddingModule` to include OCR model components and dependencies

* chore: Update `AuxiliaryLatentModule` to include VAE model and its dependencies for masked image in the editing task

* `make style`

* refactor: Update `AnyTextPipeline`'s docstring

* Update `AuxiliaryLatentModule` to include info dictionary so that text processing is done once

* simplify

* `make style`

* Converting `TextEmbeddingModule` to ordinary `encode_prompt()` function

* Simplify for now

* `make style`

* Up

* feat: Add scripts to convert AnyText controlnet to diffusers

* `make style`

* Fix: Move glyph rendering to `TextEmbeddingModule` from `AuxiliaryLatentModule`

* make style

* Up

* Simplify

* Up

* feat: Add safetensors module for loading model file

* Fix device issues

* Up

* Up

* refactor: Simplify

* refactor: Simplify code for loading models and handling data types

* `make style`

* refactor: Update to() method in FrozenCLIPEmbedderT3 and TextEmbeddingModule

* refactor: Update dtype in embedding_manager.py to match proj.weight

* Up

* Add attribution and adaptation information to pipeline_anytext.py

* Update usage example

* Will refactor `controlnet_cond_embedding` initialization

* Add `AnyTextControlNetConditioningEmbedding` template

* Refactor organization

* style

* style

* Move custom blocks from `AuxiliaryLatentModule` to `AnyTextControlNetConditioningEmbedding`

* Follow one-file policy

* style

* [Docs] Update README and pipeline_anytext.py to use AnyTextControlNetModel

* [Docs] Update import statement for AnyTextControlNetModel in pipeline_anytext.py

* [Fix] Update import path for ControlNetModel, ControlNetOutput in anytext_controlnet.py

* Refactor AnyTextControlNet to use configurable conditioning embedding channels

* Complete control net conditioning embedding in AnyTextControlNetModel

* up

* [FIX] Ensure embeddings use correct device in AnyTextControlNetModel

* up

* up

* style

* [UPDATE] Revise README and example code for AnyTextPipeline integration with DiffusionPipeline

* [UPDATE] Update example code in anytext.py to use correct font file and improve clarity

* down

* [UPDATE] Refactor BasicTokenizer usage to a new Checker class for text processing

* update pillow

* [UPDATE] Remove commented-out code and unnecessary docstring in anytext.py and anytext_controlnet.py for improved clarity

* [REMOVE] Delete frozen_clip_embedder_t3.py as it is in the anytext.py file

* [UPDATE] Replace edict with dict for configuration in anytext.py and RecModel.py for consistency

* 🆙



* style

* [UPDATE] Revise README.md for clarity, remove unused imports in anytext.py, and add author credits in anytext_controlnet.py

* style

* Update examples/research_projects/anytext/README.md
Co-authored-by: Aryan <contact.aryanvs@gmail.com>

* Remove commented-out image preparation code in AnyTextPipeline

* Remove unnecessary blank line in README.md

b88fef47

07 Mar, 2025 1 commit

Add STG to community pipelines (#10960) · b38450d5

Kinam Kim authored Mar 08, 2025



* Support STG for video pipelines

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update pipeline_stg_cogvideox.py

* Update pipeline_stg_hunyuan_video.py

* Update pipeline_stg_ltx.py

* Update pipeline_stg_ltx_image2video.py

* Update pipeline_stg_mochi.py

* Update pipeline_stg_hunyuan_video.py

* Update pipeline_stg_ltx.py

* Update pipeline_stg_ltx_image2video.py

* Update pipeline_stg_mochi.py

* update

* remove rescaling

* Apply style fixes

---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

b38450d5

06 Mar, 2025 3 commits

Add CogVideoX DDIM Inversion to Community Pipelines (#10956) · 748cb0fa

LittleNyima authored Mar 07, 2025



* add cogvideox ddim inversion script

* implement as a pipeline, and add documentation

---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>

748cb0fa

Bump jinja2 from 3.1.5 to 3.1.6 in /examples/research_projects/realfill (#10984) · f1039930

dependabot[bot] authored Mar 06, 2025

Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.5 to 3.1.6.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/jinja/compare/3.1.5...3.1.6

)

---
updated-dependencies:
- dependency-name: jinja2
  dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

f1039930

[train_dreambooth_lora.py] Fix the LR Schedulers when `num_train_epochs` is... · 37b8edfb

Jun Yeop Na authored Mar 06, 2025

[train_dreambooth_lora.py] Fix the LR Schedulers when `num_train_epochs` is passed in a distributed training env (#10973)

* updated train_dreambooth_lora to fix the LR schedulers for `num_train_epochs` in distributed training env

* fixed formatting

* remove trailing newlines

* fixed style error

37b8edfb

05 Mar, 2025 1 commit

[flux lora training] fix t5 training bug (#10845) · e031caf4

Linoy Tsaban authored Mar 05, 2025



* fix t5 training bug

* Apply style fixes

---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

e031caf4

04 Mar, 2025 2 commits

feat: add Mixture-of-Diffusers ControlNet Tile upscaler Pipeline for SDXL (#10951) · 66bf7ea5
Eliseu Silva authored Mar 04, 2025
```
* feat: add Mixture-of-Diffusers ControlNet Tile upscaler Pipeline for SDXL

* make style make quality
```
66bf7ea5

Fix incorrect seed initialization when args.seed is 0 (#10964) · b8215b1c

Alexey Zolotenkov authored Mar 04, 2025



* Fix seed initialization to handle args.seed = 0 correctly

* Apply style fixes

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

b8215b1c

24 Feb, 2025 2 commits
- [Fix] fp16 unscaling in train_dreambooth_lora_sdxl (#10889) · 170833c2
  SahilCarterr authored Feb 24, 2025
```
Fix fp16 bug
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  170833c2
- Fix `torch_dtype` in Kolors text encoder with `transformers` v4.49 (#10816) · 6f74ef55
  hlky authored Feb 24, 2025
```
* Fix `torch_dtype` in Kolors text encoder with `transformers` v4.49

* Default torch_dtype and warning
```
  6f74ef55
20 Feb, 2025 2 commits

Notebooks for Community Scripts-7 (#10846) · 51941387

Parag Ekbote authored Feb 20, 2025

Add 5 Notebooks, improve their example
scripts and update the missing links for the
example README.

51941387

[LoRA] add LoRA support to Lumina2 and fine-tuning script (#10818) · f10d3c6d

Sayak Paul authored Feb 20, 2025

* feat: lora support for Lumina2.

* fix-copies.

* updates

* updates

* docs.

* fix

* add: training script.

* tests

* updates

* updates

* major updates.

* updates

* fixes

* docs.

* updates

* updates

f10d3c6d

18 Feb, 2025 1 commit

Fix max_shift value in flux and related functions to 1.15 (issue #10675) (#10807) · b75b204a

puhuk authored Feb 18, 2025

This PR updates the max_shift value in flux to 1.15 for consistency across the codebase. In addition to modifying max_shift in flux, all related functions that copy and use this logic, such as calculate_shift in `src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3_img2img.py`, have also been updated to ensure uniform behavior.

b75b204a

16 Feb, 2025 1 commit
- typo fix (#10802) · 952b9131
  Yaniv Galron authored Feb 16, 2025
  
  952b9131
12 Feb, 2025 1 commit

fix: [Community pipeline] Fix flattened elements on image (#10774) · 051ebc3c

Eliseu Silva authored Feb 12, 2025

* feat: new community mixture_tiling_sdxl pipeline for SDXL mixture-of-diffusers support

* fix use of variable latents to tile_latents

* removed references to modules that are not being used in this pipeline

* make style, make quality

* fixfeat: added _get_crops_coords_list function to pipeline to automatically define ctop,cleft coord to focus on image generation, helps to better harmonize the image and corrects the problem of flattened elements.

051ebc3c

11 Feb, 2025 1 commit

feat: new community mixture_tiling_sdxl pipeline for SDXL (#10759) · c4702748

Eliseu Silva authored Feb 11, 2025

* feat: new community mixture_tiling_sdxl pipeline for SDXL mixture-of-diffusers support

* fix use of variable latents to tile_latents

* removed references to modules that are not being used in this pipeline

* make style, make quality

c4702748

06 Feb, 2025 2 commits

[bugfix] NPU Adaption for Sana (#10724) · cd0a4a82

Leo Jiang authored Feb 06, 2025



* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* NPU Adaption for Sanna

* [bugfix]NPU Adaption for Sanna

---------
Co-authored-by: J石页 <jiangshuo9@h-partners.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

cd0a4a82

[Community] Enhanced `Model Search` (#10417) · 145522cb

suzukimain authored Feb 06, 2025

* Added `auto_load_textual_inversion` and `auto_load_lora_weights`

* update README.md

* fix

* make quality

* Fix and `make style`

145522cb

04 Feb, 2025 2 commits
- Notebooks for Community Scripts-6 (#10713) · dbe0094e
  Parag Ekbote authored Feb 04, 2025
```
* Fix Doc Tutorial.

* Add 4 Notebooks and improve their example
scripts.
```
  dbe0094e
- Fix train_text_to_image.py --help (#10711) · f63d3223
  Nicolas authored Feb 03, 2025
  
  f63d3223
31 Jan, 2025 1 commit

Fix inconsistent random transform in instruct pix2pix (#10698) · 5d2d2398

Thanh Le authored Jan 31, 2025

* Update train_instruct_pix2pix.py

Fix inconsistent random transform in instruct_pix2pix

* Update train_instruct_pix2pix_sdxl.py

5d2d2398