Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
527bcd14
Unverified
Commit
527bcd14
authored
Jan 31, 2026
by
jennyyyyzhen
Committed by
GitHub
Jan 31, 2026
Browse files
[ROCM] Enable aiter attn backend for qwen3-next model (#32492)
Signed-off-by:
jennyyyyzhen
<
yzhen@hmc.edu
>
parent
f68e3ea4
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
2 additions
and
2 deletions
+2
-2
docs/design/attention_backends.md
docs/design/attention_backends.md
+1
-1
vllm/v1/attention/backends/rocm_aiter_fa.py
vllm/v1/attention/backends/rocm_aiter_fa.py
+1
-1
No files found.
docs/design/attention_backends.md
View file @
527bcd14
...
...
@@ -168,7 +168,7 @@ Priority is **1 = highest** (tried first).
|
`FLASH_ATTN`
| FA3
*
| fp16, bf16 |
`auto`
,
`bfloat16`
,
`fp8`
,
`fp8_e4m3`
,
`fp8_e5m2`
| %16 | Any | ✅ | ❌ | All | 9.x |
|
`FLASH_ATTN_DIFFKV`
| | fp16, bf16 |
`auto`
| Any | Any | ❌ | ❌ | Decoder | Any |
|
`FLEX_ATTENTION`
| | fp16, bf16, fp32 |
`auto`
,
`bfloat16`
| Any | Any | ❌ | ✅ | Decoder, Encoder Only | Any |
|
`ROCM_AITER_FA`
| | fp16, bf16 |
`auto`
|
%
16 | 64, 128, 256 | ❌ | ❌ | Decoder | N/A |
|
`ROCM_AITER_FA`
| | fp16, bf16 |
`auto`
| 16
, 32
| 64, 128, 256 | ❌ | ❌ | Decoder | N/A |
|
`ROCM_AITER_UNIFIED_ATTN`
| | fp16, bf16 |
`auto`
| Any | Any | ❌ | ❌ | Decoder | N/A |
|
`ROCM_ATTN`
| | fp16, bf16, fp32 |
`auto`
| 16, 32, 544 | 32, 64, 96, 128, 160, 192, 224, 256 | ❌ | ❌ | Decoder | N/A |
|
`TREE_ATTN`
| | fp16, bf16 |
`auto`
| %16 | 32, 64, 96, 128, 160, 192, 224, 256 | ❌ | ❌ | Decoder | Any |
...
...
vllm/v1/attention/backends/rocm_aiter_fa.py
View file @
527bcd14
...
...
@@ -683,7 +683,7 @@ class AiterFlashAttentionBackend(AttentionBackend):
@
staticmethod
def
get_supported_kernel_block_sizes
()
->
list
[
int
|
MultipleOf
]:
return
[
MultipleOf
(
16
)
]
return
[
16
,
32
]
@
classmethod
def
get_supported_head_sizes
(
cls
)
->
list
[
int
]:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment