Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
fc67613a
Commit
fc67613a
authored
Apr 18, 2026
by
zhuwenwen
Browse files
Merge tag 'v0.19.1' into v0.19.0
parents
31aec25b
b1388b1f
Changes
82
Expand all
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
10 additions
and
3 deletions
+10
-3
vllm/v1/core/single_type_kv_cache_manager.py
vllm/v1/core/single_type_kv_cache_manager.py
+9
-3
vllm/v1/spec_decode/eagle.py
vllm/v1/spec_decode/eagle.py
+1
-0
No files found.
vllm/v1/core/single_type_kv_cache_manager.py
View file @
fc67613a
...
@@ -565,11 +565,17 @@ class SlidingWindowManager(SingleTypeKVCacheManager):
...
@@ -565,11 +565,17 @@ class SlidingWindowManager(SingleTypeKVCacheManager):
for
computed
in
computed_blocks
:
for
computed
in
computed_blocks
:
computed
.
pop
()
computed
.
pop
()
if
use_eagle
and
computed_blocks
[
0
]:
if
use_eagle
and
computed_blocks
[
0
]:
assert
kv_cache_spec
.
block_size
==
alignment_tokens
,
(
"aligned_length is not compatible with eagle now"
)
for
computed
in
computed_blocks
:
for
computed
in
computed_blocks
:
computed
.
pop
()
computed
.
pop
()
# Re-align after eagle pop: the pop may break the alignment
# when block_size != alignment_tokens (hybrid models with
# different page sizes, e.g. Gemma4).
while
(
block_size
!=
alignment_tokens
and
len
(
computed_blocks
[
0
])
*
block_size
%
alignment_tokens
!=
0
):
for
computed
in
computed_blocks
:
computed
.
pop
()
return
computed_blocks
return
computed_blocks
def
get_num_skipped_tokens
(
self
,
num_computed_tokens
:
int
)
->
int
:
def
get_num_skipped_tokens
(
self
,
num_computed_tokens
:
int
)
->
int
:
...
...
vllm/v1/spec_decode/eagle.py
View file @
fc67613a
This diff is collapsed.
Click to expand it.
Prev
1
2
3
4
5
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment