Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
808a9ed9
Commit
808a9ed9
authored
Feb 26, 2026
by
jujl1
Browse files
fix: zero overhead KERNEL VMFault
parent
fa3bae2e
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
2 deletions
+3
-2
vllm/zero_overhead/v1/eagle.py
vllm/zero_overhead/v1/eagle.py
+3
-2
No files found.
vllm/zero_overhead/v1/eagle.py
View file @
808a9ed9
...
...
@@ -154,10 +154,11 @@ class V1ZeroEagleProposer(EagleProposer):
self
.
attn_metadata_cudagraph
.
decode
.
seq_lens
[:
attn_metadata
.
num_decode_tokens
]
=
(
attn_metadata
.
decode
.
seq_lens
)
use_ep
=
self
.
vllm_config
.
parallel_config
.
enable_expert_parallel
with
set_forward_context
(
per_layer_attn_metadata
,
self
.
vllm_config
,
num_tokens
=
num_input_tokens
,
):
#
skip_cuda_graphs=not decoding):
num_tokens
=
num_input_tokens
,
skip_cuda_graphs
=
not
(
decoding
or
use_ep
)
):
ret_hidden_states
=
self
.
model
(
self
.
input_ids
[:
num_input_tokens
],
self
.
positions
[:
num_input_tokens
],
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment