Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
3344fbec
Commit
3344fbec
authored
Nov 25, 2025
by
lizhigong
Browse files
add recive mtp tokenids and req_ids in pp rank 0 from pp rank 1
parent
a183111e
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
7 additions
and
0 deletions
+7
-0
vllm/zero_overhead/v1/PP2mtp/gpu_model_runner.py
vllm/zero_overhead/v1/PP2mtp/gpu_model_runner.py
+7
-0
No files found.
vllm/zero_overhead/v1/PP2mtp/gpu_model_runner.py
View file @
3344fbec
...
...
@@ -401,6 +401,13 @@ class V1ZeroModelRunner(GPUModelRunner):
return
self
.
kv_connector_no_forward
(
scheduler_output
)
if
get_pp_group
().
is_first_rank
and
self
.
last_draft_token_ids
!=
None
:
recv_draft_dict
=
get_pp_group
().
recv_tensor_dict
(
all_gather_group
=
get_tp_group
())
self
.
last_draft_token_ids
=
recv_draft_dict
[
'draft_token_ids'
]
self
.
last_sampled_req_ids
=
recv_draft_dict
[
'sampled_req_ids '
]
# Prepare the decoder inputs.
(
attn_metadata
,
attention_cuda_graphs
,
logits_indices
,
spec_decode_metadata
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment