Fix slow tests (#689)

* revert using baddbmm in attention - to fix `test_stable_diffusion_memory_chunking` test * styling

Fix slow tests (#689)
* revert using baddbmm in attention - to fix `test_stable_diffusion_memory_chunking` test * styling
b2cfc7a0 · Nouamane Tazi · GitHub · 552b9670 · b2cfc7a0
Unverified Commit b2cfc7a0 authored Sep 30, 2022 by Nouamane Tazi Committed by GitHub Sep 30, 2022
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 7 deletions

src/diffusers/models/attention.py src/diffusers/models/attention.py +2 -7

No files found.
--- a/src/diffusers/models/attention.py
+++ b/src/diffusers/models/attention.py
@@ -274,13 +274,8 @@ class CrossAttention(nn.Module):
        return self.to_out(hidden_states)
    def _attention(self, query, key, value):
-        attention_scores = torch.baddbmm(
+        # TODO: use baddbmm for better performance
-            torch.empty(query.shape[0], query.shape[1], key.shape[1], dtype=query.dtype, device=query.device),
+        attention_scores = torch.matmul(query, key.transpose(-1, -2)) * self.scale
-            query,
-            key.transpose(-1, -2),
-            beta=0,
-            alpha=self.scale,
-        )
        attention_probs = attention_scores.softmax(dim=-1)
        # compute attention output
        hidden_states = torch.matmul(attention_probs, value)