Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
14e2b073
Unverified
Commit
14e2b073
authored
Aug 19, 2025
by
Lucas Wilkinson
Committed by
GitHub
Aug 19, 2025
Browse files
[BugFix] fix CUTLASS MLA full cudagraph (#23200)
Signed-off-by:
Lucas Wilkinson
<
lwilkins@redhat.com
>
parent
0f4f0191
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
vllm/v1/attention/backends/mla/cutlass_mla.py
vllm/v1/attention/backends/mla/cutlass_mla.py
+1
-1
No files found.
vllm/v1/attention/backends/mla/cutlass_mla.py
View file @
14e2b073
...
@@ -21,7 +21,7 @@ logger = init_logger(__name__)
...
@@ -21,7 +21,7 @@ logger = init_logger(__name__)
class
CutlassMLAMetadataBuilder
(
MLACommonMetadataBuilder
[
MLACommonMetadata
]):
class
CutlassMLAMetadataBuilder
(
MLACommonMetadataBuilder
[
MLACommonMetadata
]):
# enable full CUDA Graph support for decode-only capture
# enable full CUDA Graph support for decode-only capture
attn_
cudagraph_support
:
ClassVar
[
cudagraph_support
:
ClassVar
[
AttentionCGSupport
]
=
AttentionCGSupport
.
UNIFORM_SINGLE_TOKEN_DECODE
AttentionCGSupport
]
=
AttentionCGSupport
.
UNIFORM_SINGLE_TOKEN_DECODE
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment