update flash_attn.py

58042164 · zhuwenwen · dbd62f84 · 58042164
Commit 58042164 authored May 30, 2025 by zhuwenwen
Show whitespace changes
Inline Side-by-side

Showing with 0 additions and 1 deletion

vllm/attention/backends/flash_attn.py vllm/attention/backends/flash_attn.py +0 -1

No files found.
--- a/vllm/attention/backends/flash_attn.py
+++ b/vllm/attention/backends/flash_attn.py
@@ -977,7 +977,6 @@ class FlashAttentionImpl(AttentionImpl):
                        v_descale=layer._v_scale.expand(descale_shape),
                    )
                else:
-                    decode_output = decode_output.unsqueeze(1)
                    decode_output = flash_attn_with_kvcache(
                        q=decode_query.unsqueeze(1),
                        k_cache=key_cache,