Commits · 04717fd861f897012f1239c2951ea21cb7184749 · renzhc / diffusers_dcu

12 Jun, 2024 3 commits

Add Stable Diffusion 3 (#8483) · 04717fd8

Dhruv Nair authored Jun 13, 2024



* up

* add sd3

* update

* update

* add tests

* fix copies

* fix docs

* update

* add dreambooth lora

* add LoRA

* update

* update

* update

* update

* import fix

* update

* Update src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* import fix 2

* update

* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* update

* update

* update

* fix ckpt id

* fix more ids

* update

* missing doc

* Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update docs/source/en/api/pipelines/stable_diffusion/stable_diffusion_3.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update docs/source/en/api/pipelines/stable_diffusion/stable_diffusion_3.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* update'

* fix

* update

* Update src/diffusers/models/autoencoders/autoencoder_kl.py

* Update src/diffusers/models/autoencoders/autoencoder_kl.py

* note on gated access.

* requirements

* licensing

---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

04717fd8

🤫 Quiet IP Adapter Mask Warning (#8475) · 1066fe4c

Greg Hunkins authored Jun 12, 2024



* quiet attn parameters

* fix lint

* make style && make quality

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

1066fe4c

change max_shard_size to 10GB (#8445) · d38f69ea

Sayak Paul authored Jun 12, 2024



* change max_shard_size to 10GB

* add notes to the documentation

* Update src/diffusers/models/modeling_utils.py
Co-authored-by: Lucain <lucainp@gmail.com>

* change to abs limit

---------
Co-authored-by: Lucain <lucainp@gmail.com>

d38f69ea

11 Jun, 2024 2 commits

image_processor.py: Fixed an error in ValueError's message (#8447) · 0a1c13af

Patrick authored Jun 11, 2024



* image_processor.py: Fixed an error in ValueError's message , as the string's join method tried to join types, instead of strings

Bug that occurred:

f"Input is in incorrect format. Currently, we only support {', '.join(supported_formats)}"
TypeError: sequence item 0: expected str instance, type found

* Fixed: C417 Unnecessary `map` usage (rewrite using a generator expression)

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

0a1c13af

fix SEGA pipeline (#8467) · 0028c344

YiYi Xu authored Jun 11, 2024



* fix

* style

---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

0028c344

10 Jun, 2024 1 commit

🔧

chore: use modeling_outputs.Transformer2DModelOutput (#8436) · 1d9a6a81

Jianqi Pan authored Jun 10, 2024

* 🔧 chore: use modeling_outputs.Transformer2DModelOutput

* 🔧 chore: isort

* 🔧

 chore: isort

* style

---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>

1d9a6a81

07 Jun, 2024 2 commits

Move away from `cached_download` (#8419) · 0d68ddf3
Lucain authored Jun 07, 2024
```
* Move away from

* unused constant

* Add custom error
```
0d68ddf3

[Core] support saving and loading of sharded checkpoints (#7830) · 7d887118

Sayak Paul authored Jun 07, 2024



* feat: support saving a model in sharded checkpoints.

* feat: make loading of sharded checkpoints work.

* add tests

* cleanse the loading logic a bit more.

* more resilience while loading from the Hub.

* parallelize shard downloads by using snapshot_download()/

* default to a shard size.

* more fix

* Empty-Commit

* debug

* fix

* uality

* more debugging

* fix more

* initial comments from Benjamin

* move certain methods to loading_utils

* add test to check if the correct number of shards are present.

* add a test to check if loading of sharded checkpoints from the Hub is okay

* clarify the unit when passed as an int.

* use hf_hub for sharding.

* remove unnecessary code

* remove unnecessary function

* lucain's comments.

* fixes

* address high-level comments.

* fix test

* subfolder shenanigans./

* Update src/diffusers/utils/hub_utils.py
Co-authored-by: Lucain <lucainp@gmail.com>

* Apply suggestions from code review
Co-authored-by: Lucain <lucainp@gmail.com>

* remove _huggingface_hub_version as not needed.

* address more feedback.

* add a test for local_files_only=True/

* need hf hub to be at least 0.23.2

* style

* final comment.

* clean up subfolder.

* deal with suffixes in code.

* _add_variant default.

* use weights_name_pattern

* remove add_suffix_keyword

* clean up downloading of sharded ckpts.

* don't return something special when using index.json

* fix more

* don't use bare except

* remove comments and catch the errors better

* fix a couple of things when using is_file()

* empty

---------
Co-authored-by: Lucain <lucainp@gmail.com>

7d887118

06 Jun, 2024 1 commit
- [Core] fix: legacy model mapping (#8416) · a3faf3f2
  Sayak Paul authored Jun 06, 2024
```
* fix: legacy model mapping

* remove print
```
  a3faf3f2
05 Jun, 2024 7 commits

Errata (#8322) · 98730c5d

Tolga Cangöz authored Jun 05, 2024

* Fix typos

* Trim trailing whitespaces

* Remove a trailing whitespace

* chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0

* Revert "chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0"

This reverts commit fd742b30b4258106008a6af4d0dd4664904f8595.

* pokemon -> naruto

* `DPMSolverMultistep` -> `DPMSolverMultistepScheduler`

* Improve Markdown stylization

* Improve style

* Improve style

* Refactor pipeline variable names for consistency

* up style

98730c5d

[Scheduler] fix: EDM schedulers when using the exp sigma schedule. (#8385) · 48207d66
Sayak Paul authored Jun 05, 2024
```
* fix: euledm when using the exp sigma schedule.

* fix-copies

* remove print.

* reduce friction

* yiyi's suggestioms
```
48207d66
[Hunyuan] allow Hunyuan DiT to run under 6GB for GPU VRAM (#8399) · 2f6f426f
Sayak Paul authored Jun 05, 2024
```
* allow hunyuan dit to run under 6GB for GPU VRAM

* add section in the docs/
```
2f6f426f

[LoRA] Remove legacy LoRA code and related adjustments (#8316) · a0542c19

Sayak Paul authored Jun 05, 2024

* remove legacy code from load_attn_procs.

* finish first draft

* fix more.

* fix more

* add test

* add serialization support.

* fix-copies

* require peft backend for lora tests

* style

* fix test

* fix loading.

* empty

* address benjamin's feedback.

a0542c19

[Hunyuan] feat: support chunked ff. (#8397) · a8ad6664
Sayak Paul authored Jun 05, 2024
```
feat: support chunked ff.
```
a8ad6664
[Hunyuan DiT] feat: enable fusing qkv projections when doing attention (#8396) · 14f7b545
Sayak Paul authored Jun 05, 2024
```
* feat: introduce qkv fusion for Hunyuan

* fix copies
```
14f7b545
Update code example in pipeline_stable_unclip_img2img.py EXAMPLE_DOC_STRING (#8401) · 07cd2004
leaps authored Jun 04, 2024
```
Update code example in pipeline_stable_unclip_img2img.py

Previous code caused an error when run
```
07cd2004

04 Jun, 2024 3 commits

[Transformer2DModel] Handle `norm_type` safely while remapping (#8370) · 6ddbf622

Sayak Paul authored Jun 04, 2024



* handle norm_type of transformer2d_model safely.

* log an info when old model class is being returned.

* Apply suggestions from code review
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* remove extra stuff

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

6ddbf622

[HunyuanDiT] minor docs changes in hunyuandit (#8395) · 3ff39e8e
Sayak Paul authored Jun 04, 2024
```
minor docs changes in hunyuandit
```
3ff39e8e
Fix AsymmetricAutoencoderKL forward (#8378) · 6be43bd8
townwish4git authored Jun 04, 2024

6be43bd8

01 Jun, 2024 2 commits

Tencent Hunyuan Team: add HunyuanDiT related updates (#8240) · 41360440

XCL authored Jun 02, 2024



* Hunyuan Team: add HunyuanDiT related updates


---------
Co-authored-by: XCLiu <liuxc1996@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com>

41360440

Fix DREAM training (#8302) · bc108e15

39th president of the United States, probably authored Jun 01, 2024

Co-authored-by: Jimmy <39@🇺🇸

.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

bc108e15

31 May, 2024 2 commits

[Core] Introduce class variants for `Transformer2DModel` (#7647) · 983dec3b

Sayak Paul authored May 31, 2024

* init for patches

* finish patched model.

* continuous transformer

* vectorized transformer2d.

* style.

* inits.

* fix-copies.

* introduce DiTTransformer2DModel.

* fixes

* use REMAPPING as suggested by @DN6

* better logging.

* add pixart transformer model.

* inits.

* caption_channels.

* attention masking.

* fix use_additional_conditions.

* remove print.

* debug

* flatten

* fix: assertion for sigma

* handle remapping for modeling_utils

* add tests for dit transformer2d

* quality

* placeholder for pixart tests

* pixart tests

* add _no_split_modules

* add docs.

* check

* check

* check

* check

* fix tests

* fix tests

* move Transformer output to modeling_output

* move errors better and bring back use_additional_conditions attribute.

* add unnecessary things from DiT.

* clean up pixart

* fix remapping

* fix device_map things in pixart2d.

* replace Transformer2DModel with appropriate classes in dit, pixart tests

* empty

* legacy mixin classes./

* use a remapping dict for fetching class names.

* change to specifc model types in the pipeline implementations.

* move _fetch_remapped_cls_from_config to modeling_loading_utils.py

* fix dependency problems.

* add deprecation note.

983dec3b

Change checkpoint key used to identify CLIP models in single file checkpoints (#8319) · f9fa8a86
Dhruv Nair authored May 31, 2024
```
update
```
f9fa8a86

30 May, 2024 1 commit
- Fix depth pipeline "input/weight type should be the same" error at fp16 (#8321) · 05be622b
  Jonah authored May 30, 2024
```
Fix "input/weight type should be the same"
Co-authored-by: YiYi Xu <yixu310@gmail.com>
```
  05be622b
29 May, 2024 7 commits
- Fix StableDiffusionPipeline when `text_encoder=None` (#8297) · 42cae93b
  Dhruv Nair authored May 30, 2024
```
* update

* update

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  42cae93b
- Fix Copying Mechanism typo/bug (#8232) · a2ecce26
  Tolga Cangöz authored May 29, 2024
```
* Fix copying mechanism typos

* fix copying mecha

* Revert, since they are in TODO

* Fix copying mechanism
```
  a2ecce26
- Simplify `platform_info` assignment in `diffusers-cli env` (#8298) · f4a44b77
  Tolga Cangöz authored May 29, 2024
```
chore: Simplify `platform_info` assignment
```
  f4a44b77
- post release v0.28.0 (#8286) · 581d8aac
  Sayak Paul authored May 29, 2024
```
* post release v0.28.0

* style
```
  581d8aac
- [Core] Refactor `IPAdapterPlusImageProjection` a bit (#7994) · ba1bfac2
  Sayak Paul authored May 29, 2024
```
* use IPAdapterPlusImageProjectionBlock in IPAdapterPlusImageProjection

* reposition IPAdapterPlusImageProjection

* refactor complete?

* fix heads param retrieval.

* update test dict creation method.
```
  ba1bfac2
- move `vqmodel` to `models.autoencoders`. (#8292) · 5edd0b34
  Sayak Paul authored May 29, 2024
```
move vqmodel to models.autoencoders.
```
  5edd0b34
- [Post release 0.28.0] remove deprecated blocks. (#8291) · 3a28e36a
  Sayak Paul authored May 29, 2024
```
* remove deprecated blocks.

* update the location paths.
```
  3a28e36a
28 May, 2024 4 commits

fix pixart-sigma negative prompt handling (#8299) · 3393c01c

Vladimir Mandic authored May 28, 2024



* fix negative prompt

* fix

---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

3393c01c

Fix object has no attribute 'flush' when using without a console (#8271) · b2030a24
Álvaro Somoza authored May 28, 2024
```
fix
```
b2030a24
[LoRA] attempt at fixing onetrainer lora. (#8242) · e6df8eda
Sayak Paul authored May 28, 2024
```
* attempt at fixing onetrainer lora.

* fix
```
e6df8eda

[docs] Add controlnet example to marigold (#8289) · ba824141

Álvaro Somoza authored May 28, 2024



* initial doc

* fix wrong LCM sentence

* implement binary colormap without requiring matplotlib
update section about Marigold for ControlNet
update formatting of marigold_usage.md

* fix indentation

---------
Co-authored-by: anton <anton.obukhov@gmail.com>

ba824141

27 May, 2024 1 commit

[Pipeline] Marigold depth and normals estimation (#7847) · b3d10d6d

Anton Obukhov authored May 27, 2024



* implement marigold depth and normals pipelines in diffusers core

* remove bibtex

* remove deprecations

* remove save_memory argument

* remove validate_vae

* remove config output

* remove batch_size autodetection

* remove presets logic
move default denoising_steps and processing_resolution into the model config
make default ensemble_size 1

* remove no_grad

* add fp16 to the example usage

* implement is_matplotlib_available
use is_matplotlib_available, is_scipy_available for conditional imports in the marigold depth pipeline

* move colormap, visualize_depth, and visualize_normals into export_utils.py

* make the denoising loop more lucid
fix the outputs to always be 4d tensors or lists of pil images
support a 4d input_image case
attempt to support model_cpu_offload_seq
move check_inputs into a separate function
change default batch_size to 1, remove any logic to make it bigger implicitly

* style

* rename denoising_steps into num_inference_steps

* rename input_image into image

* rename input_latent into latents

* remove decode_image
change decode_prediction to use the AutoencoderKL.decode method

* move clean_latent outside of progress_bar

* refactor marigold-reusable image processing bits into MarigoldImageProcessor class

* clean up the usage example docstring

* make ensemble functions members of the pipelines

* add early checks in check_inputs
rename E into ensemble_size in depth ensembling

* fix vae_scale_factor computation

* better compatibility with torch.compile
better variable naming

* move export_depth_to_png to export_utils

* remove encode_prediction

* improve visualize_depth and visualize_normals to accept multi-dimensional data and lists
remove visualization functions from the pipelines
move exporting depth as 16-bit PNGs functionality from the depth pipeline
update example docstrings

* do not shortcut vae.config variables

* change all asserts to raise ValueError

* rename output_prediction_type to output_type

* better variable names
clean up variable deletion code

* better variable names

* pass desc and leave kwargs into the diffusers progress_bar
implement nested progress bar for images and steps loops

* implement scale_invariant and shift_invariant flags in the ensemble_depth function
add scale_invariant and shift_invariant flags readout from the model config
further refactor ensemble_depth
support ensembling without alignment
add ensemble_depth docstring

* fix generator device placement checks

* move encode_empty_text body into the pipeline call

* minor empty text encoding simplifications

* adjust pipelines' class docstrings to explain the added construction arguments

* improve the scipy failure condition
add comments
improve docstrings
change the default use_full_z_range to True

* make input image values range check configurable in the preprocessor
refactor load_image_canonical in preprocessor to reject unknown types and return the image in the expected 4D format of tensor and on right device
support a list of everything as inputs to the pipeline, change type to PipelineImageInput
implement a check that all input list elements have the same dimensions
improve docstrings of pipeline outputs
remove check_input pipeline argument

* remove forgotten print

* add prediction_type model config

* add uncertainty visualization into export utils
fix NaN values in normals uncertainties

* change default of output_uncertainty to False
better handle the case of an attempt to export or visualize none

* fix `output_uncertainty=False`

* remove kwargs
fix check_inputs according to the new inputs of the pipeline

* rename prepare_latent into prepare_latents as in other pipelines
annotate prepare_latents in normals pipeline with "Copied from"
annotate encode_image in normals pipeline with "Copied from"

* move nested-capable `progress_bar` method into the pipelines
revert the original `progress_bar` method in pipeline_utils

* minor message improvement

* fix cpu offloading

* move colormap, visualize_depth, export_depth_to_16bit_png, visualize_normals, visualize_uncertainty to marigold_image_processing.py
update example docstrings

* fix missing comma

* change torch.FloatTensor to torch.Tensor

* fix importing of MarigoldImageProcessor

* fix vae offloading
fix batched image encoding
remove separate encode_image function and use vae.encode instead

* implement marigold's intial tests
relax generator checks in line with other pipelines
implement return_dict __call__ argument in line with other pipelines

* fix num_images computation

* remove MarigoldImageProcessor and outputs from import structure
update tests

* update docstrings

* update init

* update

* style

* fix

* fix

* up

* up

* up

* add simple test

* up

* update expected np input/output to be channel last

* move expand_tensor_or_array into the MarigoldImageProcessor

* rewrite tests to follow conventions - hardcoded slices instead of image artifacts
write more smoke tests

* add basic docs.

* add anton's contribution statement

* remove todos.

* fix assertion values for marigold depth slow tests

* fix assertion values for depth normals.

* remove print

* support AutoencoderTiny in the pipelines

* update documentation page
add Available Pipelines section
add Available Checkpoints section
add warning about num_inference_steps

* fix missing import in docstring
fix wrong value in visualize_depth docstring

* [doc] add marigold to pipelines overview

* [doc] add section "usage examples"

* fix an issue with latents check in the pipelines

* add "Frame-by-frame Video Processing with Consistency" section

* grammarly

* replace tables with images with css-styled images (blindly)

* style

* print

* fix the assertions.

* take from the github runner.

* take the slices from action artifacts

* style.

* update with the slices from the runner.

* remove unnecessary code blocks.

* Revert "[doc] add marigold to pipelines overview"

This reverts commit a505165150afd8dab23c474d1a054ea505a56a5f.

* remove invitation for new modalities

* split out marigold usage examples

* doc cleanup

---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>

b3d10d6d

24 May, 2024 3 commits
- Fix a grammatical error in the `raise` messages (#8272) · db33af06
  Tolga Cangöz authored May 24, 2024
```
Fix grammatical error
```
  db33af06
- Respect `resume_download` deprecation V2 (#8267) · edf5ba6a
  Lucain authored May 24, 2024
```
* Fix resume_downoad FutureWarning

* only resume download
```
  edf5ba6a
- Use `freedesktop_os_release()` in diffusers cli for Python >=3.10 (#8235) · 370146e4
  Dhruv Nair authored May 24, 2024
```
* update

* update
```
  370146e4
23 May, 2024 1 commit
- Fix resize issue in SVD pipeline with VideoProcessor (#8229) · 67b3fe0a
  Dhruv Nair authored May 23, 2024
```
update
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  67b3fe0a