Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
e0c910bb
Unverified
Commit
e0c910bb
authored
Nov 14, 2025
by
Thomas Parnell
Committed by
GitHub
Nov 14, 2025
Browse files
[Hybrid] [Kernel] Fix chunk scan kernel when BLOCK_SIZE_DSTATE > 128 (#28295)
Signed-off-by:
Thomas Parnell
<
tpa@zurich.ibm.com
>
parent
bf3ffb61
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
vllm/model_executor/layers/mamba/ops/ssd_chunk_scan.py
vllm/model_executor/layers/mamba/ops/ssd_chunk_scan.py
+1
-1
No files found.
vllm/model_executor/layers/mamba/ops/ssd_chunk_scan.py
View file @
e0c910bb
...
@@ -245,7 +245,7 @@ def _chunk_scan_fwd_kernel(
...
@@ -245,7 +245,7 @@ def _chunk_scan_fwd_kernel(
)
)
if
not
HAS_INITSTATES
and
(
seq_idx
!=
seq_idx_prev
):
if
not
HAS_INITSTATES
and
(
seq_idx
!=
seq_idx_prev
):
prev_states
=
tl
.
zeros
(
prev_states
=
tl
.
zeros
(
(
BLOCK_SIZE_
DSTATE
,
BLOCK_SIZE_
K
),
dtype
=
C_ptr
.
dtype
.
element_ty
(
BLOCK_SIZE_
K
,
BLOCK_SIZE_
N
),
dtype
=
C_ptr
.
dtype
.
element_ty
)
)
else
:
else
:
prev_states
=
tl
.
load
(
prev_states
=
tl
.
load
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment