Commits · 4d633bfe9a3830dfd50b3f70d8469a575dbaf75a · renzhc / diffusers_dcu

"docs/vscode:/vscode.git/clone" did not exist on "d32a5980ca553f70fd55600b28dd8bb1017704c6"

03 Jun, 2024 1 commit

Tencent Hunyuan Team - Updated Doc for HunyuanDiT (#8383) · 174cf868

XCL authored Jun 03, 2024

* add hunyuandit doc

* update hunyuandit doc

* update hunyuandit 2d model

* update toctree.yml for hunyuandit

174cf868

31 May, 2024 2 commits

Fix marigold documentation (#8372) · 86555c9f

Anton Obukhov authored Jun 01, 2024

* rename prs-eth/marigold-lcm-v1-0 into prs-eth/marigold-depth-lcm-v1-0

* update image paths in https://huggingface.co/datasets/huggingface/documentation-images

 to use main branch

* fix relative paths to other diffusers pages

* Update docs/source/en/using-diffusers/marigold_usage.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

86555c9f

[Core] Introduce class variants for `Transformer2DModel` (#7647) · 983dec3b

Sayak Paul authored May 31, 2024

* init for patches

* finish patched model.

* continuous transformer

* vectorized transformer2d.

* style.

* inits.

* fix-copies.

* introduce DiTTransformer2DModel.

* fixes

* use REMAPPING as suggested by @DN6

* better logging.

* add pixart transformer model.

* inits.

* caption_channels.

* attention masking.

* fix use_additional_conditions.

* remove print.

* debug

* flatten

* fix: assertion for sigma

* handle remapping for modeling_utils

* add tests for dit transformer2d

* quality

* placeholder for pixart tests

* pixart tests

* add _no_split_modules

* add docs.

* check

* check

* check

* check

* fix tests

* fix tests

* move Transformer output to modeling_output

* move errors better and bring back use_additional_conditions attribute.

* add unnecessary things from DiT.

* clean up pixart

* fix remapping

* fix device_map things in pixart2d.

* replace Transformer2DModel with appropriate classes in dit, pixart tests

* empty

* legacy mixin classes./

* use a remapping dict for fetching class names.

* change to specifc model types in the pipeline implementations.

* move _fetch_remapped_cls_from_config to modeling_loading_utils.py

* fix dependency problems.

* add deprecation note.

983dec3b

29 May, 2024 3 commits
- [docs] Files and formats (#7874) · 9e00b727
  Steven Liu authored May 29, 2024
```
* files and formats

* fix callout

* feedback

* code sample

* feedback
```
  9e00b727
- [docs] DeepFloyd training (#8224) · f7a4626f
  Steven Liu authored May 29, 2024
```
deepfloyd training
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
```
  f7a4626f
- move `vqmodel` to `models.autoencoders`. (#8292) · 5edd0b34
  Sayak Paul authored May 29, 2024
```
move vqmodel to models.autoencoders.
```
  5edd0b34
28 May, 2024 4 commits

[docs] Outpaint (#7964) · 1fa8dbc6
Steven Liu authored May 28, 2024
```
* first draft

* edits
```
1fa8dbc6
[docs] Scheduler features (#7990) · 0ab6dc0f
Steven Liu authored May 28, 2024
```
* noise schedule

* sigmas and zero snr

* feedback

* feedback
```
0ab6dc0f
Fix typo in `philosophy.md` (#8303) · 80cfaeba
Jiwook Han authored May 29, 2024
```
fix typo in philosophy.md
```
80cfaeba

[docs] Add controlnet example to marigold (#8289) · ba824141

Álvaro Somoza authored May 28, 2024



* initial doc

* fix wrong LCM sentence

* implement binary colormap without requiring matplotlib
update section about Marigold for ControlNet
update formatting of marigold_usage.md

* fix indentation

---------
Co-authored-by: anton <anton.obukhov@gmail.com>

ba824141

27 May, 2024 2 commits

[Pipeline] Marigold depth and normals estimation (#7847) · b3d10d6d

Anton Obukhov authored May 27, 2024



* implement marigold depth and normals pipelines in diffusers core

* remove bibtex

* remove deprecations

* remove save_memory argument

* remove validate_vae

* remove config output

* remove batch_size autodetection

* remove presets logic
move default denoising_steps and processing_resolution into the model config
make default ensemble_size 1

* remove no_grad

* add fp16 to the example usage

* implement is_matplotlib_available
use is_matplotlib_available, is_scipy_available for conditional imports in the marigold depth pipeline

* move colormap, visualize_depth, and visualize_normals into export_utils.py

* make the denoising loop more lucid
fix the outputs to always be 4d tensors or lists of pil images
support a 4d input_image case
attempt to support model_cpu_offload_seq
move check_inputs into a separate function
change default batch_size to 1, remove any logic to make it bigger implicitly

* style

* rename denoising_steps into num_inference_steps

* rename input_image into image

* rename input_latent into latents

* remove decode_image
change decode_prediction to use the AutoencoderKL.decode method

* move clean_latent outside of progress_bar

* refactor marigold-reusable image processing bits into MarigoldImageProcessor class

* clean up the usage example docstring

* make ensemble functions members of the pipelines

* add early checks in check_inputs
rename E into ensemble_size in depth ensembling

* fix vae_scale_factor computation

* better compatibility with torch.compile
better variable naming

* move export_depth_to_png to export_utils

* remove encode_prediction

* improve visualize_depth and visualize_normals to accept multi-dimensional data and lists
remove visualization functions from the pipelines
move exporting depth as 16-bit PNGs functionality from the depth pipeline
update example docstrings

* do not shortcut vae.config variables

* change all asserts to raise ValueError

* rename output_prediction_type to output_type

* better variable names
clean up variable deletion code

* better variable names

* pass desc and leave kwargs into the diffusers progress_bar
implement nested progress bar for images and steps loops

* implement scale_invariant and shift_invariant flags in the ensemble_depth function
add scale_invariant and shift_invariant flags readout from the model config
further refactor ensemble_depth
support ensembling without alignment
add ensemble_depth docstring

* fix generator device placement checks

* move encode_empty_text body into the pipeline call

* minor empty text encoding simplifications

* adjust pipelines' class docstrings to explain the added construction arguments

* improve the scipy failure condition
add comments
improve docstrings
change the default use_full_z_range to True

* make input image values range check configurable in the preprocessor
refactor load_image_canonical in preprocessor to reject unknown types and return the image in the expected 4D format of tensor and on right device
support a list of everything as inputs to the pipeline, change type to PipelineImageInput
implement a check that all input list elements have the same dimensions
improve docstrings of pipeline outputs
remove check_input pipeline argument

* remove forgotten print

* add prediction_type model config

* add uncertainty visualization into export utils
fix NaN values in normals uncertainties

* change default of output_uncertainty to False
better handle the case of an attempt to export or visualize none

* fix `output_uncertainty=False`

* remove kwargs
fix check_inputs according to the new inputs of the pipeline

* rename prepare_latent into prepare_latents as in other pipelines
annotate prepare_latents in normals pipeline with "Copied from"
annotate encode_image in normals pipeline with "Copied from"

* move nested-capable `progress_bar` method into the pipelines
revert the original `progress_bar` method in pipeline_utils

* minor message improvement

* fix cpu offloading

* move colormap, visualize_depth, export_depth_to_16bit_png, visualize_normals, visualize_uncertainty to marigold_image_processing.py
update example docstrings

* fix missing comma

* change torch.FloatTensor to torch.Tensor

* fix importing of MarigoldImageProcessor

* fix vae offloading
fix batched image encoding
remove separate encode_image function and use vae.encode instead

* implement marigold's intial tests
relax generator checks in line with other pipelines
implement return_dict __call__ argument in line with other pipelines

* fix num_images computation

* remove MarigoldImageProcessor and outputs from import structure
update tests

* update docstrings

* update init

* update

* style

* fix

* fix

* up

* up

* up

* add simple test

* up

* update expected np input/output to be channel last

* move expand_tensor_or_array into the MarigoldImageProcessor

* rewrite tests to follow conventions - hardcoded slices instead of image artifacts
write more smoke tests

* add basic docs.

* add anton's contribution statement

* remove todos.

* fix assertion values for marigold depth slow tests

* fix assertion values for depth normals.

* remove print

* support AutoencoderTiny in the pipelines

* update documentation page
add Available Pipelines section
add Available Checkpoints section
add warning about num_inference_steps

* fix missing import in docstring
fix wrong value in visualize_depth docstring

* [doc] add marigold to pipelines overview

* [doc] add section "usage examples"

* fix an issue with latents check in the pipelines

* add "Frame-by-frame Video Processing with Consistency" section

* grammarly

* replace tables with images with css-styled images (blindly)

* style

* print

* fix the assertions.

* take from the github runner.

* take the slices from action artifacts

* style.

* update with the slices from the runner.

* remove unnecessary code blocks.

* Revert "[doc] add marigold to pipelines overview"

This reverts commit a505165150afd8dab23c474d1a054ea505a56a5f.

* remove invitation for new modalities

* split out marigold usage examples

* doc cleanup

---------
Co-authored-by: yiyixuxu <yixu310@gmail.com>
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>

b3d10d6d

Add details about 1-stage implementation in I2VGen-XL docs (#8282) · 4d40c914
Dhaivat Bhatt authored May 27, 2024
```
* Add details about 1-stage implementation

* Add details about 1-stage implementation
```
4d40c914

24 May, 2024 3 commits

Fix CPU Offloading Usage & Typos (#8230) · 0ab63ff6

Tolga Cangöz authored May 24, 2024

* Fix typos

* Fix `pipe.enable_model_cpu_offload()` usage

* Fix cpu offloading

* Update numbers

0ab63ff6

sampling bug fix in diffusers tutorial "basic_training.md" (#8223) · 1096f88e

Yue Wu authored May 24, 2024

sampling bug fix in basic_training.md

In the diffusers basic training tutorial, setting the manual seed argument (generator=torch.manual_seed(config.seed)) in the pipeline call inside evaluate() function rewinds the dataloader shuffling, leading to overfitting due to the model seeing same sequence of training examples after every evaluation call. Using generator=torch.Generator(device='cpu').manual_seed(config.seed) avoids this.

1096f88e

Clean up `from_single_file` docs (#8268) · cef4a512
Dhruv Nair authored May 24, 2024
```
* update

* update
```
cef4a512

21 May, 2024 1 commit
- [docs] VideoProcessor (#7965) · fdb1baa0
  Steven Liu authored May 20, 2024
```
* fix?

* fix?

* fix
```
  fdb1baa0
20 May, 2024 2 commits

[docs] add doc for PixArtSigmaPipeline (#7857) · 0f0defdb

Junsong Chen authored May 21, 2024



* 1. add doc for PixArtSigmaPipeline;

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Guillaume LEGENDRE <glegendre01@gmail.com>
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>
Co-authored-by: Bagheera <59658056+bghira@users.noreply.github.com>
Co-authored-by: bghira <bghira@users.github.com>
Co-authored-by: Hyoungwon Cho <jhw9811@korea.ac.kr>
Co-authored-by: yiyixuxu <yixu310@gmail.com>
Co-authored-by: Tolga Cangöz <46008593+standardAI@users.noreply.github.com>
Co-authored-by: Philip Pham <phillypham@google.com>

0f0defdb

Fix typo in "attention" (#7977) · d6ca1209
Jacob Marks authored May 20, 2024

d6ca1209

13 May, 2024 1 commit
- Official callbacks (#7761) · fdb05f54
  Álvaro Somoza authored May 12, 2024
  
  fdb05f54
10 May, 2024 4 commits

#7535 Update FloatTensor type hints to Tensor (#7883) · be4afa0b

Mark Van Aken authored May 10, 2024

* find & replace all FloatTensors to Tensor

* apply formatting

* Update torch.FloatTensor to torch.Tensor in the remaining files

* formatting

* Fix the rest of the places where FloatTensor is used as well as in documentation

* formatting

* Update new file from FloatTensor to Tensor

be4afa0b

[Core] introduce videoprocessor. (#7776) · 04f4bd54

Sayak Paul authored May 10, 2024



* introduce videoprocessor.

* fix quality

* address yiyi's feedback

* fix preprocess_video call.

* video_processor -> image_processor

* fix

* fix more.

* quality

* image_processor -> video_processor

* support List[List[PIL.Image.Image]]

* change to video_processor.

* documentation

* Apply suggestions from code review

* changes

* remove print.

* refactor video processor (part # 7776) (#7861)

* update

* update remove deprecate

* Update src/diffusers/video_processor.py

* update

* Apply suggestions from code review

* deprecate list of 5d for video and list of 4d for image + apply other feedbacks

* up

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* add doc.

* tensor2vid -> postprocess_video.

* refactor preprocess with preprocess_video

* set default values.

* empty commit

* more refactoring of prepare_latents in animatediff vid2vid

* checking documentation

* remove documentation for now.

* fix animatediff sdxl

* fix test failure [part of video processor PR] (#7905)

up

* remove preceed_with_frames.

* doc

* fix

* fix

* remove video input as a single-frame video.

---------
Co-authored-by: YiYi Xu <yixu310@gmail.com>

04f4bd54

add missing image processors to the docs (#7910) · 82be58c5
Sayak Paul authored May 10, 2024
```
add missing processors.
```
82be58c5

upgrade to python 3.10 in the Dockerfiles (#7893) · 66956356

Sayak Paul authored May 10, 2024

* upgrade to python 3.10

* fix

* try https://askubuntu.com/questions/1459694/can-not-find-python3-10-after-apt-get-installation

* fix

* up

* yes

* okay

* up

* up

* up

* up

* up

* check

* okay

* up

* i[

* fix

66956356

09 May, 2024 2 commits

[scheduler] support custom `timesteps` and `sigmas` (#7817) · b934215d

YiYi Xu authored May 09, 2024



* support custom sigmas and timesteps, dpm euler

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

b934215d

[Refactor] Better align `from_single_file` logic with `from_pretrained` (#7496) · cb0f3b49

Dhruv Nair authored May 09, 2024



* refactor unet single file loading a bit.

* retrieve the unet from create_diffusers_unet_model_from_ldm

* update

* update

* updae

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* tests

* update

* update

* update

* Update docs/source/en/api/single_file.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update docs/source/en/api/single_file.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* Update docs/source/en/api/loaders/single_file.md
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update src/diffusers/loaders/single_file.py
Co-authored-by: YiYi Xu <yixu310@gmail.com>

* Update docs/source/en/api/loaders/single_file.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update docs/source/en/api/loaders/single_file.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update docs/source/en/api/loaders/single_file.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Update docs/source/en/api/loaders/single_file.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

cb0f3b49

08 May, 2024 1 commit

[Pipeline] AnimateDiff SDXL (#6721) · 818f7607

Aryan authored May 08, 2024



* update conversion script to handle motion adapter sdxl checkpoint

* add animatediff xl

* handle addition_embed_type

* fix output

* update

* add imports

* make fix-copies

* add decode latents

* update docstrings

* add animatediff sdxl to docs

* remove unnecessary lines

* update example

* add test

* revert conv_in conv_out kernel param

* remove unused param addition_embed_type_num_heads

* latest IPAdapter impl

* make fix-copies

* fix return

* add IPAdapterTesterMixin to tests

* fix return

* revert based on suggestion

* add freeinit

* fix test_to_dtype test

* use StableDiffusionMixin instead of different helper methods

* fix progress bar iterations

* apply suggestions from review

* hardcode flip_sin_to_cos and freq_shift

* make fix-copies

* fix ip adapter implementation

* fix last failing test

* make style

* Update docs/source/en/api/pipelines/animatediff.md
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

* remove todo

* fix doc-builder errors

---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

818f7607

07 May, 2024 1 commit

7879 - adjust documentation to use naruto dataset, since pokemon is now gated (#7880) · 8edaf3b7

Bagheera authored May 07, 2024



* 7879 - adjust documentation to use naruto dataset, since pokemon is now gated

* replace references to pokemon in docs

* more references to pokemon replaced

* Japanese translation update

---------
Co-authored-by: bghira <bghira@users.github.com>

8edaf3b7

06 May, 2024 1 commit
- [docs] Distilled inference (#7834) · 0d23645b
  Steven Liu authored May 06, 2024
```
* combine

* edits
```
  0d23645b
03 May, 2024 2 commits

[docs] LCM (#7829) · 49b959b5
Steven Liu authored May 03, 2024
```
* lcm

* lcm lora

* fix

* fix hfoption

* edits
```
49b959b5

Add Ascend NPU support for SDXL fine-tuning and fix the model saving bug when... · 58237364

HelloWorldBeginner authored May 04, 2024


Add Ascend NPU support for SDXL fine-tuning and fix the model saving bug when using DeepSpeed. (#7816)

* Add Ascend NPU support for SDXL fine-tuning and fix the model saving bug when using DeepSpeed.

* fix check code quality

* Decouple the NPU flash attention and make it an independent module.

* add doc and unit tests for npu flash attention.

---------
Co-authored-by: mhh001 <mahonghao1@huawei.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

58237364

30 Apr, 2024 1 commit
- [docs] Community pipelines (#7819) · 0d083702
  Steven Liu authored Apr 30, 2024
```
* community pipelines

* feedback

* consolidate
```
  0d083702
28 Apr, 2024 1 commit

Update InstantStyle usage in IP-Adapter documentation (#7806) · 50296739

Jenyuan-Huang authored Apr 29, 2024



* enable control ip-adapter per-transformer block on-the-fly


---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>
Co-authored-by: ResearcherXman <xhs.research@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

50296739

26 Apr, 2024 1 commit
- [Docs] Update image masking and face id example (#7780) · 8e4ca1b6
  Fabio Rigano authored Apr 27, 2024
```
* [Docs] Update image masking and face id example

* Update docs

* Fix docs
```
  8e4ca1b6
25 Apr, 2024 2 commits

[docs] Refactor image quality docs (#7758) · fa750a15

Steven Liu authored Apr 25, 2024

* refactor

* code snippets

* fix path

* fix path in guide

* code outputs

* align toctree title

* title

* fix title

fa750a15

[docs] Reproducible pipelines (#7769) · 18168801
Steven Liu authored Apr 25, 2024
```
* reproducibility

* feedback

* feedback

* fix path

* github link
```
18168801

23 Apr, 2024 1 commit
- [docs] Clean up toctree (#7715) · 7404f1e9
  Steven Liu authored Apr 23, 2024
```
* toctree

* optim

* feedback

* improve overview
```
  7404f1e9
22 Apr, 2024 2 commits

Support InstantStyle (#7668) · 21c747fa

Jenyuan-Huang authored Apr 23, 2024



* enable control ip-adapter per-transformer block on-the-fly

---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com>
Co-authored-by: ResearcherXman <xhs.research@gmail.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com>

21c747fa

[docs] AutoPipeline (#7714) · 33b363ed

Steven Liu authored Apr 22, 2024



* autopipeline

* edits

* feedback

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

33b363ed

19 Apr, 2024 1 commit

Move IP Adapter Face ID to core (#7186) · b5c8b555

Fabio Rigano authored Apr 19, 2024



* Switch to peft and multi proj layers

* Move Face ID loading and inference to core

---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

b5c8b555

17 Apr, 2024 1 commit
- [docs] Pipeline loading (#7684) · 7635d3d3
  Steven Liu authored Apr 17, 2024
```
* pipelines

* schedulers and models

* community pipelines

* feedback
```
  7635d3d3