Unverified Commit 1b7624bf authored by Serena's avatar Serena Committed by GitHub
Browse files

[misc] Add FlashMLA as a new option of VLLM_ATTENTION_BACKEND env (#14267)

parent ac60dc7f
...@@ -321,6 +321,7 @@ environment_variables: dict[str, Callable[[], Any]] = { ...@@ -321,6 +321,7 @@ environment_variables: dict[str, Callable[[], Any]] = {
# - "XFORMERS": use XFormers # - "XFORMERS": use XFormers
# - "ROCM_FLASH": use ROCmFlashAttention # - "ROCM_FLASH": use ROCmFlashAttention
# - "FLASHINFER": use flashinfer # - "FLASHINFER": use flashinfer
# - "FLASHMLA": use FlashMLA
"VLLM_ATTENTION_BACKEND": "VLLM_ATTENTION_BACKEND":
lambda: os.getenv("VLLM_ATTENTION_BACKEND", None), lambda: os.getenv("VLLM_ATTENTION_BACKEND", None),
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment