Fixes LoRAXFormersCrossAttnProcessor (#2207)

Related to #2124 The current implementation is throwing a shape mismatch error. Which makes sense, as this line is obviously missing, comparing to XFormersCrossAttnProcessor and LoRACrossAttnProcessor. I don't have formal tests, but I compared `LoRACrossAttnProcessor` and `LoRAXFormersCrossAttnProcessor` ad-hoc, and they produce the same results with this fix.

Fixes LoRAXFormersCrossAttnProcessor (#2207)
Related to #2124 The current implementation is throwing a shape mismatch error. Which makes sense, as this line is obviously missing, comparing to XFormersCrossAttnProcessor and LoRACrossAttnProcessor. I don't have formal tests, but I compared `LoRACrossAttnProcessor` and `LoRAXFormersCrossAttnProcessor` ad-hoc, and they produce the same results with this fix.
58c416ab · Jorge C. Gomes · GitHub · d46d78c5 · 58c416ab
Unverified Commit 58c416ab authored Feb 03, 2023 by Jorge C. Gomes Committed by GitHub Feb 03, 2023
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 0 deletions

src/diffusers/models/cross_attention.py src/diffusers/models/cross_attention.py +1 -0

No files found.
--- a/src/diffusers/models/cross_attention.py
+++ b/src/diffusers/models/cross_attention.py
@@ -431,6 +431,7 @@ class LoRAXFormersCrossAttnProcessor(nn.Module):
        value = attn.head_to_batch_dim(value).contiguous()

        hidden_states = xformers.ops.memory_efficient_attention(query, key, value, attn_bias=attention_mask)
+        hidden_states = attn.batch_to_head_dim(hidden_states)

        # linear proj
        hidden_states = attn.to_out[0](hidden_states) + scale * self.to_out_lora(hidden_states)