Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
fe543820
Unverified
Commit
fe543820
authored
Feb 06, 2026
by
Charlie Fu
Committed by
GitHub
Feb 06, 2026
Browse files
[Rocm][Bugfix] Fix dtype not same for gemm_a4w4 op (#33734)
Signed-off-by:
charlifu
<
charlifu@amd.com
>
parent
77c09e11
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
6 additions
and
1 deletion
+6
-1
vllm/model_executor/layers/quantization/quark/schemes/quark_ocp_mx.py
...xecutor/layers/quantization/quark/schemes/quark_ocp_mx.py
+6
-1
No files found.
vllm/model_executor/layers/quantization/quark/schemes/quark_ocp_mx.py
View file @
fe543820
...
...
@@ -118,7 +118,12 @@ try:
)
gemm_a4w4
(
x_q
,
weight
,
x_s
,
weight_scale
.
view
(
x_s
.
dtype
),
y
,
bpreshuffle
=
True
x_q
,
weight
.
view
(
x_q
.
dtype
),
x_s
,
weight_scale
.
view
(
x_s
.
dtype
),
y
,
bpreshuffle
=
True
,
)
return
y
[:
M
]
else
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment