"vllm/vscode:/vscode.git/clone" did not exist on "bb5b640359cc6695cb7818a24680e226f72a4da7"
Unverified Commit 14e2b073 authored by Lucas Wilkinson's avatar Lucas Wilkinson Committed by GitHub
Browse files

[BugFix] fix CUTLASS MLA full cudagraph (#23200)


Signed-off-by: default avatarLucas Wilkinson <lwilkins@redhat.com>
parent 0f4f0191
...@@ -21,7 +21,7 @@ logger = init_logger(__name__) ...@@ -21,7 +21,7 @@ logger = init_logger(__name__)
class CutlassMLAMetadataBuilder(MLACommonMetadataBuilder[MLACommonMetadata]): class CutlassMLAMetadataBuilder(MLACommonMetadataBuilder[MLACommonMetadata]):
# enable full CUDA Graph support for decode-only capture # enable full CUDA Graph support for decode-only capture
attn_cudagraph_support: ClassVar[ cudagraph_support: ClassVar[
AttentionCGSupport] = AttentionCGSupport.UNIFORM_SINGLE_TOKEN_DECODE AttentionCGSupport] = AttentionCGSupport.UNIFORM_SINGLE_TOKEN_DECODE
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment