Unverified Commit a8102998 authored by Andreas Karatzas's avatar Andreas Karatzas Committed by GitHub
Browse files

[ROCm][CI][Docs] Add comment explaining TRITON_ATTN fallback for ROCm (#32835)


Signed-off-by: default avatarAndreas Karatzas <akaratza@amd.com>
parent eb1629da
......@@ -88,6 +88,8 @@ def get_available_attention_backends() -> list[str]:
get_valid_backends = getattr(current_platform.__class__, "get_valid_backends", None)
if get_valid_backends is None:
if current_platform.is_rocm():
# ROCm uses Triton as its default attention backend since
# Flash Attention is not supported.
return ["TRITON_ATTN"]
else:
return ["FLASH_ATTN"]
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment