Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
24b7fb45
Unverified
Commit
24b7fb45
authored
Apr 01, 2025
by
Ekagra Ranjan
Committed by
GitHub
Apr 01, 2025
Browse files
[Spec Decode] Fix input triton kernel for eagle (#15909)
parent
58f5a597
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
2 deletions
+1
-2
vllm/v1/spec_decode/eagle.py
vllm/v1/spec_decode/eagle.py
+1
-2
No files found.
vllm/v1/spec_decode/eagle.py
View file @
24b7fb45
...
...
@@ -250,13 +250,12 @@ def prepare_input_kernel(
num_tokens
=
end_pos
-
start_pos
index_start
=
tl
.
load
(
cu_query_lens_ptr
+
pid
)
indices
=
index_start
+
tl
.
arange
(
0
,
BLOCK_SIZE
)
num_blocks
=
tl
.
cdiv
(
num_tokens
,
BLOCK_SIZE
)
for
i
in
tl
.
range
(
num_blocks
):
offset
=
i
*
BLOCK_SIZE
+
tl
.
arange
(
0
,
BLOCK_SIZE
)
tl
.
store
(
out_ptr
+
start_pos
+
offset
,
ind
ices
,
ind
ex_start
+
offset
,
mask
=
offset
<
num_tokens
,
)
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment