Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
change
sglang
Commits
087751a8
Unverified
Commit
087751a8
authored
May 08, 2025
by
Stefan He
Committed by
GitHub
May 08, 2025
Browse files
Remove unecessary is_fa3_supported check (#6112)
parent
911f3ba6
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
0 additions
and
4 deletions
+0
-4
sgl-kernel/python/sgl_kernel/flash_attn.py
sgl-kernel/python/sgl_kernel/flash_attn.py
+0
-4
No files found.
sgl-kernel/python/sgl_kernel/flash_attn.py
View file @
087751a8
...
@@ -144,10 +144,6 @@ def flash_attn_with_kvcache(
...
@@ -144,10 +144,6 @@ def flash_attn_with_kvcache(
logsumexp of each row of the matrix QK^T * scaling (e.g., log of the softmax
logsumexp of each row of the matrix QK^T * scaling (e.g., log of the softmax
normalization factor).
normalization factor).
"""
"""
if
not
is_fa3_supported
():
raise
NotImplementedError
(
"flash_attn at sgl-kernel is only supported on sm90 and cu123 above"
)
assert
k_cache
.
stride
(
-
1
)
==
1
,
"k_cache must have contiguous last dimension"
assert
k_cache
.
stride
(
-
1
)
==
1
,
"k_cache must have contiguous last dimension"
assert
v_cache
.
stride
(
-
1
)
==
1
,
"v_cache must have contiguous last dimension"
assert
v_cache
.
stride
(
-
1
)
==
1
,
"v_cache must have contiguous last dimension"
if
softmax_scale
is
None
:
if
softmax_scale
is
None
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment