Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
37d2b35d
Commit
37d2b35d
authored
Dec 31, 2025
by
王敏
Browse files
[feat]解决deepep auto模式 mtp >1 时卡住
parent
cb64c6bc
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
5 deletions
+5
-5
vllm/v1/spec_decode/eagle.py
vllm/v1/spec_decode/eagle.py
+5
-5
No files found.
vllm/v1/spec_decode/eagle.py
View file @
37d2b35d
...
...
@@ -562,15 +562,15 @@ class EagleProposer:
)
if
self
.
dp_size
>
1
and
self
.
enable_expert_parallel
and
self
.
num_speculative_tokens
>
1
:
num_tokens
=
1
for
_
in
range
(
self
.
num_speculative_tokens
-
1
):
with
set_forward_context
(
attn_metadata
,
self
.
vllm_config
,
num_tokens
=
num_input_tokens
,
num_tokens_across_dp
=
num_tokens_across_dp
):
num_tokens
=
num_tokens
,):
self
.
model
(
self
.
input_ids
[:
num_
input_
tokens
],
self
.
positions
[:
num_
input_
tokens
],
self
.
hidden_states
[:
num_
input_
tokens
],
self
.
input_ids
[:
num_tokens
],
self
.
positions
[:
num_tokens
],
self
.
hidden_states
[:
num_tokens
],
)
def
validate_same_kv_cache_group
(
self
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment