Unverified Commit 7e081ba7 authored by vllmellm's avatar vllmellm Committed by GitHub
Browse files

[BugFix] Revert ROCm Custom Paged Attention Env Flag Check (#17022)


Signed-off-by: default avatarvllmellm <vllm.ellm@embeddedllm.com>
parent 1e013fa3
......@@ -118,6 +118,7 @@ def use_rocm_custom_paged_attention(qtype: torch.dtype, head_size: int,
and (head_size == 64 or head_size == 128)
and (block_size == 16 or block_size == 32)
and (gqa_ratio >= 1 and gqa_ratio <= 16) and max_seq_len <= 32768
and (envs.VLLM_ROCM_CUSTOM_PAGED_ATTN)
and not (envs.VLLM_ROCM_USE_AITER_PAGED_ATTN
and envs.VLLM_ROCM_USE_AITER))
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment