Fix head_to_batch_dim for IPAdapterAttnProcessor (#7077)

* Fix IPAdapterAttnProcessor * Fix batch_to_head_dim and revert reshape

Fix head_to_batch_dim for IPAdapterAttnProcessor (#7077)
* Fix IPAdapterAttnProcessor * Fix batch_to_head_dim and revert reshape
1c47d1fc · Fabio Rigano · GitHub · bbf70c87 · 1c47d1fc
Unverified Commit 1c47d1fc authored Feb 25, 2024 by Fabio Rigano Committed by GitHub Feb 25, 2024
Hide whitespace changes
Inline Side-by-side

Showing with 7 additions and 3 deletions

src/diffusers/models/attention_processor.py src/diffusers/models/attention_processor.py +7 -3

No files found.
--- a/src/diffusers/models/attention_processor.py
+++ b/src/diffusers/models/attention_processor.py
@@ -559,12 +559,16 @@ class Attention(nn.Module):
            `torch.Tensor`: The reshaped tensor.
        """
        head_size = self.heads
-        batch_size, seq_len, dim = tensor.shape
+        if tensor.ndim == 3:
-        tensor = tensor.reshape(batch_size, seq_len, head_size, dim // head_size)
+            batch_size, seq_len, dim = tensor.shape
+            extra_dim = 1
+        else:
+            batch_size, extra_dim, seq_len, dim = tensor.shape
+        tensor = tensor.reshape(batch_size, seq_len * extra_dim, head_size, dim // head_size)
        tensor = tensor.permute(0, 2, 1, 3)
        if out_dim == 3:
-            tensor = tensor.reshape(batch_size * head_size, seq_len, dim // head_size)
+            tensor = tensor.reshape(batch_size * head_size, seq_len * extra_dim, dim // head_size)
        return tensor