Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
76e22965
Commit
76e22965
authored
Dec 31, 2025
by
jujl1
Browse files
feat: 兼容MTP零消耗和主模型+MTP零消耗(VLLM_ZERO_OVERHEAD_ENHANCE=1)开启
parent
8a413453
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
1 deletion
+2
-1
vllm/zero_overhead/v1/gpu_model_runner.py
vllm/zero_overhead/v1/gpu_model_runner.py
+2
-1
No files found.
vllm/zero_overhead/v1/gpu_model_runner.py
View file @
76e22965
...
...
@@ -703,7 +703,8 @@ class V1ZeroModelRunner(GPUModelRunner):
is_output_valid
=
False
# Get the valid generated tokens.
sampled_token_ids
=
sampler_output
.
sampled_token_ids
over_head_enhance
=
envs
.
VLLM_ZERO_OVERHEAD_ENHANCE
and
self
.
speculative_config
over_head_enhance
=
(
envs
.
VLLM_ZERO_OVERHEAD_ENHANCE
and
self
.
speculative_config
is
not
None
)
if
over_head_enhance
:
# if not self.speculative_config:
# self.fix_req_ids = self.last_sampled_req_ids
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment