[docs] Clarify dtypes for Sana (#10248)

update Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

[docs] Clarify dtypes for Sana (#10248)
update Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
f9d5a932 · Aryan · GitHub · ac863934 · f9d5a932
Unverified Commit f9d5a932 authored Dec 17, 2024 by Aryan Committed by GitHub Dec 17, 2024
Hide whitespace changes
Inline Side-by-side

Showing with 2 additions and 0 deletions

docs/source/en/api/pipelines/sana.md docs/source/en/api/pipelines/sana.md +2 -0

No files found.
--- a/docs/source/en/api/pipelines/sana.md
+++ b/docs/source/en/api/pipelines/sana.md
@@ -42,6 +42,8 @@ Available models:

 Refer to [this](https://huggingface.co/collections/Efficient-Large-Model/sana-673efba2a57ed99843f11f9e) collection for more information.

+Note: The recommended dtype mentioned is for the transformer weights. The text encoder and VAE weights must stay in `torch.bfloat16` or `torch.float32` for the model to work correctly. Please refer to the inference example below to see how to load the model with the recommended dtype. 
+
 <Tip>

 Make sure to pass the `variant` argument for downloaded checkpoints to use lower disk space. Set it to `"fp16"` for models with recommended dtype as `torch.float16`, and `"bf16"` for models with recommended dtype as `torch.bfloat16`. By default, `torch.float32` weights are downloaded, which use twice the amount of disk storage. Additionally, `torch.float32` weights can be downcasted on-the-fly by specifying the `torch_dtype` argument. Read about it in the [docs](https://huggingface.co/docs/diffusers/v0.31.0/en/api/pipelines/overview#diffusers.DiffusionPipeline.from_pretrained).