Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
9fb3ae4e
Unverified
Commit
9fb3ae4e
authored
Oct 08, 2025
by
Wentao Ye
Committed by
GitHub
Oct 08, 2025
Browse files
[Bug] Fix DeepGEMM Attention Test (#26423)
Signed-off-by:
yewentao256
<
zhyanwentao@126.com
>
parent
76afe4ed
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
2 additions
and
2 deletions
+2
-2
pyproject.toml
pyproject.toml
+1
-0
tests/kernels/attention/test_deepgemm_attention.py
tests/kernels/attention/test_deepgemm_attention.py
+1
-2
No files found.
pyproject.toml
View file @
9fb3ae4e
...
...
@@ -184,6 +184,7 @@ ba = "ba"
[tool.typos.type.py.extend-words]
ba
=
"ba"
nd
=
"nd"
[tool.typos.type.cpp]
extend-glob
=
["*.cu"]
...
...
tests/kernels/attention/test_deepgemm_attention.py
View file @
9fb3ae4e
...
...
@@ -82,8 +82,7 @@ def _ref_fp8_mqa_logits(
torch
.
arange
(
0
,
seq_len_kv
,
device
=
"cuda"
)[
None
,
:]
<
cu_seqlen_ke
[:,
None
]
)
mask
=
mask_lo
&
mask_hi
score
=
torch
.
einsum
(
"mhd,and->hmn"
,
q
,
k
)
score
=
torch
.
einsum
(
"mhd,nd->hmn"
,
q
,
k
)
logits
=
(
score
.
relu
()
*
weights
.
unsqueeze
(
-
1
).
transpose
(
0
,
1
)).
sum
(
dim
=
0
)
logits
=
logits
.
masked_fill
(
~
mask
,
float
(
"-inf"
))
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment