Avoiding graph break by changing the way we infer dtype in vae.decoder (#12512)

* Changing the way we infer dtype to avoid force evaluation of lazy tensors * changing way to infer dtype to ensure type consistency * more robust infering of dtype * removing the upscale dtype entirely

Avoiding graph break by changing the way we infer dtype in vae.decoder (#12512)
* Changing the way we infer dtype to avoid force evaluation of lazy tensors * changing way to infer dtype to ensure type consistency * more robust infering of dtype * removing the upscale dtype entirely
9f3c0fdc · Pavle Padjin · GitHub · 84e16575 · 9f3c0fdc
Unverified Commit 9f3c0fdc authored Oct 30, 2025 by Pavle Padjin Committed by GitHub Oct 30, 2025
Show whitespace changes
Inline Side-by-side

Showing with 0 additions and 3 deletions

src/diffusers/models/autoencoders/vae.py src/diffusers/models/autoencoders/vae.py +0 -3

No files found.
--- a/src/diffusers/models/autoencoders/vae.py
+++ b/src/diffusers/models/autoencoders/vae.py
@@ -286,11 +286,9 @@ class Decoder(nn.Module):
        sample = self.conv_in(sample)
-        upscale_dtype = next(iter(self.up_blocks.parameters())).dtype
        if torch.is_grad_enabled() and self.gradient_checkpointing:
            # middle
            sample = self._gradient_checkpointing_func(self.mid_block, sample, latent_embeds)
-            sample = sample.to(upscale_dtype)
            # up
            for up_block in self.up_blocks:
@@ -298,7 +296,6 @@ class Decoder(nn.Module):
        else:
            # middle
            sample = self.mid_block(sample, latent_embeds)
-            sample = sample.to(upscale_dtype)
            # up
            for up_block in self.up_blocks: