Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
64deead7
Unverified
Commit
64deead7
authored
Nov 25, 2025
by
vllmellm
Committed by
GitHub
Nov 25, 2025
Browse files
[Bugfix] [ROCm] [UX]: revert Flex attention backend (#29371)
Signed-off-by:
vllmellm
<
vllm.ellm@embeddedllm.com
>
parent
7992324f
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
10 additions
and
0 deletions
+10
-0
tests/v1/attention/test_rocm_attention_backends_selection.py
tests/v1/attention/test_rocm_attention_backends_selection.py
+6
-0
vllm/platforms/rocm.py
vllm/platforms/rocm.py
+4
-0
No files found.
tests/v1/attention/test_rocm_attention_backends_selection.py
View file @
64deead7
...
@@ -36,6 +36,12 @@ def mock_on_gfx9():
...
@@ -36,6 +36,12 @@ def mock_on_gfx9():
@
pytest
.
mark
.
parametrize
(
@
pytest
.
mark
.
parametrize
(
"env_vars, selected_backend, expected_backend_path"
,
"env_vars, selected_backend, expected_backend_path"
,
[
[
# Test Case: Explicit FLEX_ATTENTION backend
(
{},
"FLEX_ATTENTION"
,
AttentionBackendEnum
.
FLEX_ATTENTION
.
get_path
(),
),
# Test Case 1: Default (no env vars, no explicit backend)
# Test Case 1: Default (no env vars, no explicit backend)
(
(
{},
{},
...
...
vllm/platforms/rocm.py
View file @
64deead7
...
@@ -262,6 +262,10 @@ class RocmPlatform(Platform):
...
@@ -262,6 +262,10 @@ class RocmPlatform(Platform):
f
"is not MLA type while requested for MLA backend."
f
"is not MLA type while requested for MLA backend."
)
)
if
selected_backend
==
AttentionBackendEnum
.
FLEX_ATTENTION
:
logger
.
info
(
"Using FlexAttention backend."
)
return
AttentionBackendEnum
.
FLEX_ATTENTION
.
get_path
()
if
selected_backend
==
AttentionBackendEnum
.
TRITON_ATTN
:
if
selected_backend
==
AttentionBackendEnum
.
TRITON_ATTN
:
logger
.
info
(
"Using Triton Attention backend on V1 engine."
)
logger
.
info
(
"Using Triton Attention backend on V1 engine."
)
return
AttentionBackendEnum
.
TRITON_ATTN
.
get_path
()
return
AttentionBackendEnum
.
TRITON_ATTN
.
get_path
()
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment