Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
71276043
Commit
71276043
authored
Mar 26, 2026
by
wanghl6
Browse files
convert_req_index_to_global_index opt
parent
a0ac95b0
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
12 additions
and
0 deletions
+12
-0
vllm/v1/attention/backends/mla/flashmla_sparse.py
vllm/v1/attention/backends/mla/flashmla_sparse.py
+12
-0
No files found.
vllm/v1/attention/backends/mla/flashmla_sparse.py
View file @
71276043
...
@@ -302,6 +302,18 @@ def triton_convert_req_index_to_global_index(
...
@@ -302,6 +302,18 @@ def triton_convert_req_index_to_global_index(
prefill_workspace_starts: int32 [num_prefills], 0-indexed workspace
prefill_workspace_starts: int32 [num_prefills], 0-indexed workspace
starts for each prefill request
starts for each prefill request
"""
"""
if
(
envs
.
USE_LIGHTOP_CONVERT_REQ_INDEX_TO_GLOBAL_INDEX
):
from
lightop
import
op
return
op
.
convert_req_index_to_global_index
(
req_id
,
block_table
,
token_indices
,
BLOCK_SIZE
,
NUM_TOPK_TOKENS
,
HAS_PREFILL_WORKSPACE
,
prefill_workspace_request_ids
,
prefill_workspace_starts
)
assert
req_id
.
dtype
==
torch
.
int32
assert
req_id
.
dtype
==
torch
.
int32
assert
block_table
.
dtype
==
torch
.
int32
assert
block_table
.
dtype
==
torch
.
int32
assert
token_indices
.
dtype
==
torch
.
int32
assert
token_indices
.
dtype
==
torch
.
int32
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment