Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
a8102998
Unverified
Commit
a8102998
authored
Jan 22, 2026
by
Andreas Karatzas
Committed by
GitHub
Jan 21, 2026
Browse files
[ROCm][CI][Docs] Add comment explaining TRITON_ATTN fallback for ROCm (#32835)
Signed-off-by:
Andreas Karatzas
<
akaratza@amd.com
>
parent
eb1629da
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
0 deletions
+2
-0
tests/v1/spec_decode/test_acceptance_length.py
tests/v1/spec_decode/test_acceptance_length.py
+2
-0
No files found.
tests/v1/spec_decode/test_acceptance_length.py
View file @
a8102998
...
...
@@ -88,6 +88,8 @@ def get_available_attention_backends() -> list[str]:
get_valid_backends
=
getattr
(
current_platform
.
__class__
,
"get_valid_backends"
,
None
)
if
get_valid_backends
is
None
:
if
current_platform
.
is_rocm
():
# ROCm uses Triton as its default attention backend since
# Flash Attention is not supported.
return
[
"TRITON_ATTN"
]
else
:
return
[
"FLASH_ATTN"
]
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment