Unverified Commit 3468f17e authored by Matthew Bonanni's avatar Matthew Bonanni Committed by GitHub
Browse files

[V0 deprecation] Remove _VLLM_V1 suffixes from attention backend names (#25489)


Signed-off-by: default avatarMatthew Bonanni <mbonanni@redhat.com>
Signed-off-by: default avatarMatthew Bonanni <mbonanni001@gmail.com>
parent 71b25b0d
......@@ -155,7 +155,7 @@ class TritonAttentionBackend(AttentionBackend):
@staticmethod
def get_name() -> str:
return "TRITON_ATTN_VLLM_V1"
return "TRITON_ATTN"
@staticmethod
def get_impl_cls() -> type["TritonAttentionImpl"]:
......
......@@ -90,7 +90,7 @@ class XFormersAttentionBackend(AttentionBackend):
@staticmethod
def get_name() -> str:
return "XFORMERS_VLLM_V1"
return "XFORMERS"
@staticmethod
def get_impl_cls() -> type["XFormersAttentionImpl"]:
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment