Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
6d4f9d3a
Unverified
Commit
6d4f9d3a
authored
Feb 27, 2026
by
haosdent
Committed by
GitHub
Feb 27, 2026
Browse files
[Bugfix] Fix DCP + FA3 crash due to missing num_splits in _forward_with_dcp (#35082)
Signed-off-by:
haosdent
<
haosdent@gmail.com
>
parent
fbe3f012
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
0 deletions
+2
-0
vllm/v1/attention/backends/flash_attn.py
vllm/v1/attention/backends/flash_attn.py
+2
-0
No files found.
vllm/v1/attention/backends/flash_attn.py
View file @
6d4f9d3a
...
@@ -847,6 +847,7 @@ class FlashAttentionImpl(AttentionImpl):
...
@@ -847,6 +847,7 @@ class FlashAttentionImpl(AttentionImpl):
q_descale
=
q_descale
,
q_descale
=
q_descale
,
k_descale
=
k_descale
,
k_descale
=
k_descale
,
v_descale
=
v_descale
,
v_descale
=
v_descale
,
num_splits
=
attn_metadata
.
max_num_splits
,
)
)
# FA returns LSE in shape [ H, B ] but cp_lse_ag_out_rs wants [ B, H ]
# FA returns LSE in shape [ H, B ] but cp_lse_ag_out_rs wants [ B, H ]
context_attn_out_cor
,
context_lse_cor
=
cp_lse_ag_out_rs
(
context_attn_out_cor
,
context_lse_cor
=
cp_lse_ag_out_rs
(
...
@@ -876,6 +877,7 @@ class FlashAttentionImpl(AttentionImpl):
...
@@ -876,6 +877,7 @@ class FlashAttentionImpl(AttentionImpl):
q_descale
=
q_descale
,
q_descale
=
q_descale
,
k_descale
=
k_descale
,
k_descale
=
k_descale
,
v_descale
=
v_descale
,
v_descale
=
v_descale
,
num_splits
=
attn_metadata
.
max_num_splits
,
)
)
assert
context_attn_out_cor
.
shape
==
query_attn_out
.
shape
assert
context_attn_out_cor
.
shape
==
query_attn_out
.
shape
assert
context_lse_cor
.
shape
==
query_lse
.
shape
assert
context_lse_cor
.
shape
==
query_lse
.
shape
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment