Fix enable memory efficient attention on ROCm (#10564)

* fix enable memory efficient attention on ROCm while calling CK implementation * Update attention_processor.py refactor of picking a set element

Fix enable memory efficient attention on ROCm (#10564)
* fix enable memory efficient attention on ROCm while calling CK implementation * Update attention_processor.py refactor of picking a set element
1ae9b059 · Max Podkorytov · GitHub · aad69ac2 · 1ae9b059
Unverified Commit 1ae9b059 authored Jan 31, 2025 by Max Podkorytov Committed by GitHub Jan 31, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 6 additions and 5 deletions

src/diffusers/models/attention_processor.py src/diffusers/models/attention_processor.py +6 -5

No files found.
--- a/src/diffusers/models/attention_processor.py
+++ b/src/diffusers/models/attention_processor.py
@@ -405,11 +405,12 @@ class Attention(nn.Module):
            else:
                try:
                    # Make sure we can run the memory efficient attention
-                    _ = xformers.ops.memory_efficient_attention(
-                        torch.randn((1, 2, 40), device="cuda"),
-                        torch.randn((1, 2, 40), device="cuda"),
-                        torch.randn((1, 2, 40), device="cuda"),
-                    )
+                    dtype = None
+                    if attention_op is not None:
+                        op_fw, op_bw = attention_op
+                        dtype, *_ = op_fw.SUPPORTED_DTYPES
+                    q = torch.randn((1, 2, 40), device="cuda", dtype=dtype)
+                    _ = xformers.ops.memory_efficient_attention(q, q, q)
                except Exception as e:
                    raise e