use torch.matmul instead of einsum in attnetion. (#445)

* use torch.matmul instead of einsum * fix softmax

use torch.matmul instead of einsum in attnetion. (#445)
* use torch.matmul instead of einsum * fix softmax
5adb0a7b · Suraj Patil · GitHub · b2b3b1a8 · 5adb0a7b
Unverified Commit 5adb0a7b authored Sep 09, 2022 by Suraj Patil Committed by GitHub Sep 09, 2022
Hide whitespace changes
Inline Side-by-side

Showing with 2 additions and 4 deletions

src/diffusers/models/attention.py src/diffusers/models/attention.py +2 -4

No files found.
--- a/src/diffusers/models/attention.py
+++ b/src/diffusers/models/attention.py
@@ -275,11 +275,9 @@ class CrossAttention(nn.Module):
        for i in range(hidden_states.shape[0] // slice_size):
            start_idx = i * slice_size
            end_idx = (i + 1) * slice_size
-            attn_slice = (
+            attn_slice = torch.matmul(query[start_idx:end_idx], key[start_idx:end_idx].transpose(1, 2)) * self.scale
-                torch.einsum("b i d, b j d -> b i j", query[start_idx:end_idx], key[start_idx:end_idx]) * self.scale
-            )
            attn_slice = attn_slice.softmax(dim=-1)
-            attn_slice = torch.einsum("b i j, b j d -> b i d", attn_slice, value[start_idx:end_idx])
+            attn_slice = torch.matmul(attn_slice, value[start_idx:end_idx])
            hidden_states[start_idx:end_idx] = attn_slice