Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
82f3f30e
Unverified
Commit
82f3f30e
authored
Mar 11, 2026
by
Pleaplusone
Committed by
GitHub
Mar 10, 2026
Browse files
[ROCm][Perf] Enable `sparse_mla`'s cudagraph on ROCm platform (#35719)
Signed-off-by:
ganyi
<
ygan@amd.com
>
parent
9095cbbf
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
3 additions
and
4 deletions
+3
-4
vllm/v1/attention/backends/mla/rocm_aiter_mla_sparse.py
vllm/v1/attention/backends/mla/rocm_aiter_mla_sparse.py
+3
-1
vllm/v1/attention/ops/rocm_aiter_mla_sparse.py
vllm/v1/attention/ops/rocm_aiter_mla_sparse.py
+0
-3
No files found.
vllm/v1/attention/backends/mla/rocm_aiter_mla_sparse.py
View file @
82f3f30e
...
...
@@ -151,7 +151,9 @@ class ROCMAiterMLASparseMetadata(AttentionMetadata):
class
ROCMAiterMLASparseMetadataBuilder
(
AttentionMetadataBuilder
[
ROCMAiterMLASparseMetadata
]
):
_cudagraph_support
:
ClassVar
[
AttentionCGSupport
]
=
AttentionCGSupport
.
NEVER
_cudagraph_support
:
ClassVar
[
AttentionCGSupport
]
=
(
AttentionCGSupport
.
UNIFORM_SINGLE_TOKEN_DECODE
)
def
__init__
(
self
,
...
...
vllm/v1/attention/ops/rocm_aiter_mla_sparse.py
View file @
82f3f30e
...
...
@@ -327,9 +327,6 @@ def rocm_fp8_paged_mqa_logits(
aiter_paged_mqa_logits_module
=
None
if
rocm_aiter_ops
.
is_enabled
():
aiter_paged_mqa_logits_module
=
paged_mqa_logits_module
()
# FIXME(ganyi): Temporarily disable the aiter path until nightly docker
# update aiter to the fix PR.
aiter_paged_mqa_logits_module
=
None
if
aiter_paged_mqa_logits_module
is
not
None
:
deepgemm_fp8_paged_mqa_logits_stage1
=
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment