Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
36239f79
Unverified
Commit
36239f79
authored
Jun 19, 2025
by
Michael Goin
Committed by
GitHub
Jun 19, 2025
Browse files
Fix FA2 fallback for Blackwell V1 (#19781)
Signed-off-by:
mgoin
<
mgoin64@gmail.com
>
parent
dfada85e
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
vllm/platforms/cuda.py
vllm/platforms/cuda.py
+1
-1
No files found.
vllm/platforms/cuda.py
View file @
36239f79
...
@@ -255,7 +255,7 @@ class CudaPlatformBase(Platform):
...
@@ -255,7 +255,7 @@ class CudaPlatformBase(Platform):
"install FlashInfer for better performance."
)
"install FlashInfer for better performance."
)
pass
pass
# FlashAttention is the default for SM 8.0+ GPUs
# FlashAttention is the default for SM 8.0+ GPUs
el
if
cls
.
has_device_capability
(
80
):
if
cls
.
has_device_capability
(
80
):
logger
.
info_once
(
"Using Flash Attention backend on V1 engine."
)
logger
.
info_once
(
"Using Flash Attention backend on V1 engine."
)
return
(
"vllm.v1.attention.backends."
return
(
"vllm.v1.attention.backends."
"flash_attn.FlashAttentionBackend"
)
"flash_attn.FlashAttentionBackend"
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment