Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
d4aa1443
Unverified
Commit
d4aa1443
authored
Oct 29, 2025
by
Nick Hill
Committed by
GitHub
Oct 29, 2025
Browse files
[BugFix] Fix handling of resumed reqs in `SharedStorageConnector` (#27719)
Signed-off-by:
Nick Hill
<
nhill@redhat.com
>
parent
fcb1d570
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
24 additions
and
26 deletions
+24
-26
vllm/distributed/kv_transfer/kv_connector/v1/shared_storage_connector.py
...d/kv_transfer/kv_connector/v1/shared_storage_connector.py
+24
-26
No files found.
vllm/distributed/kv_transfer/kv_connector/v1/shared_storage_connector.py
View file @
d4aa1443
...
...
@@ -336,16 +336,14 @@ class SharedStorageConnector(KVConnectorBase_V1):
cached_reqs
=
scheduler_output
.
scheduled_cached_reqs
for
i
,
req_id
in
enumerate
(
cached_reqs
.
req_ids
):
resumed_from_preemption
=
cached_reqs
.
resumed_from_preemption
[
i
]
if
not
resumed_from_preemption
or
req_id
not
in
self
.
_requests_need_load
:
continue
num_computed_tokens
=
cached_reqs
.
num_computed_tokens
[
i
]
num_new_tokens
=
scheduler_output
.
num_scheduled_tokens
[
req_id
]
new_block_ids
=
cached_reqs
.
new_block_ids
[
i
]
resumed_from_preemption
=
cached_reqs
.
resumed_from_preemption
[
i
]
# NOTE(rob): here we rely on the resumed requests being
# the first N requests in the list scheduled_cache_reqs.
if
not
resumed_from_preemption
:
break
if
req_id
in
self
.
_requests_need_load
:
# NOTE(rob): cached_req_data does not have the full
# list of token ids (only new tokens). So we look it
# up in the actual request object.
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment