Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
221bf725
Unverified
Commit
221bf725
authored
Oct 19, 2025
by
Jianyu Huang
Committed by
GitHub
Oct 19, 2025
Browse files
output type conversion fix (#27159)
parent
b3aba04e
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
4 deletions
+1
-4
vllm/model_executor/layers/batch_invariant.py
vllm/model_executor/layers/batch_invariant.py
+1
-4
No files found.
vllm/model_executor/layers/batch_invariant.py
View file @
221bf725
...
@@ -134,10 +134,7 @@ def matmul_kernel_persistent(
...
@@ -134,10 +134,7 @@ def matmul_kernel_persistent(
bias_ptrs
=
bias_ptr
+
offs_cn
bias_ptrs
=
bias_ptr
+
offs_cn
bias
=
tl
.
load
(
bias_ptrs
,
mask
=
offs_cn
<
N
,
other
=
0.0
).
to
(
tl
.
float32
)
bias
=
tl
.
load
(
bias_ptrs
,
mask
=
offs_cn
<
N
,
other
=
0.0
).
to
(
tl
.
float32
)
accumulator
+=
bias
accumulator
+=
bias
if
c_ptr
.
dtype
.
element_ty
==
tl
.
float8e4nv
:
c
=
accumulator
.
to
(
c_ptr
.
dtype
.
element_ty
)
c
=
accumulator
.
to
(
tl
.
float8e4nv
)
else
:
c
=
accumulator
.
to
(
tl
.
float16
)
tl
.
store
(
c_ptrs
,
c
,
mask
=
c_mask
)
tl
.
store
(
c_ptrs
,
c
,
mask
=
c_mask
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment