Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
a9e532af
Unverified
Commit
a9e532af
authored
Mar 11, 2026
by
tvirolai-amd
Committed by
GitHub
Mar 11, 2026
Browse files
[ROCm][Perf] Allow MTP lens > 1 in Sparse MLA (#36681)
Signed-off-by:
Teemu Virolainen
<
teemu.virolainen@amd.com
>
parent
f3163bba
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
4 additions
and
0 deletions
+4
-0
vllm/v1/spec_decode/eagle.py
vllm/v1/spec_decode/eagle.py
+4
-0
No files found.
vllm/v1/spec_decode/eagle.py
View file @
a9e532af
...
@@ -214,11 +214,15 @@ class SpecDecodeBaseProposer:
...
@@ -214,11 +214,15 @@ class SpecDecodeBaseProposer:
# Determine allowed attention backends once during initialization.
# Determine allowed attention backends once during initialization.
self
.
allowed_attn_types
:
tuple
|
None
=
None
self
.
allowed_attn_types
:
tuple
|
None
=
None
if
current_platform
.
is_rocm
():
if
current_platform
.
is_rocm
():
from
vllm.v1.attention.backends.mla.rocm_aiter_mla_sparse
import
(
ROCMAiterMLASparseMetadata
,
)
from
vllm.v1.attention.backends.rocm_attn
import
RocmAttentionMetadata
from
vllm.v1.attention.backends.rocm_attn
import
RocmAttentionMetadata
rocm_types
=
[
rocm_types
=
[
TritonAttentionMetadata
,
TritonAttentionMetadata
,
RocmAttentionMetadata
,
RocmAttentionMetadata
,
ROCMAiterMLASparseMetadata
,
]
]
# ROCM_AITER_FA is an optional backend
# ROCM_AITER_FA is an optional backend
# We check is_enabled() here to avoid importing the backend module during
# We check is_enabled() here to avoid importing the backend module during
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment