mps cross-attention hack: don't crash on fp16 (#2258)

* mps cross-attention hack: don't crash on fp16 * Make conversion explicit.

mps cross-attention hack: don't crash on fp16 (#2258)
* mps cross-attention hack: don't crash on fp16 * Make conversion explicit.
e619db24 · Pedro Cuenca · GitHub · 111228cb · e619db24
Unverified Commit e619db24 authored Feb 07, 2023 by Pedro Cuenca Committed by GitHub Feb 07, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

src/diffusers/models/cross_attention.py src/diffusers/models/cross_attention.py +1 -1

No files found.
--- a/src/diffusers/models/cross_attention.py
+++ b/src/diffusers/models/cross_attention.py
@@ -251,7 +251,7 @@ class CrossAttention(nn.Module):
                # HACK: MPS: Does not support padding by greater than dimension of input tensor.
                # Instead, we can manually construct the padding tensor.
                padding_shape = (attention_mask.shape[0], attention_mask.shape[1], target_length)
-                padding = torch.zeros(padding_shape, device=attention_mask.device)
+                padding = torch.zeros(padding_shape, dtype=attention_mask.dtype, device=attention_mask.device)
                attention_mask = torch.concat([attention_mask, padding], dim=2)
            else:
                attention_mask = F.pad(attention_mask, (0, target_length), value=0.0)