Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
006aea17
Unverified
Commit
006aea17
authored
Mar 09, 2026
by
Woosuk Kwon
Committed by
GitHub
Mar 09, 2026
Browse files
[BugFix] Remove incorrect assert in split_decodes_and_prefills (#36553)
Signed-off-by:
Woosuk Kwon
<
woosuk@inferact.ai
>
parent
0836be3b
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
0 additions
and
1 deletion
+0
-1
vllm/v1/attention/backends/utils.py
vllm/v1/attention/backends/utils.py
+0
-1
No files found.
vllm/v1/attention/backends/utils.py
View file @
006aea17
...
...
@@ -528,7 +528,6 @@ def split_decodes_and_prefills(
# requests may have a query length of 0 but since they are padding its fine
# to treat them as decodes (ensures num_decodes matches the captured size)
if
torch
.
all
((
query_lens
==
query_lens
[
0
])
|
(
query_lens
==
0
)):
assert
num_reqs
*
query_lens
[
0
]
==
num_tokens
,
"tokens not padded correctly"
return
num_reqs
,
0
,
num_tokens
,
0
# all decodes
is_prefill
=
query_lens
!=
query_lens
[
0
]
else
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment