Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
28a6d542
Unverified
Commit
28a6d542
authored
Jul 16, 2025
by
Michael Goin
Committed by
GitHub
Jul 16, 2025
Browse files
[Bugfix] Fix Machete zero point issue for GPTQ models on SM90 (#21066)
Signed-off-by:
mgoin
<
mgoin64@gmail.com
>
parent
58760e12
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
5 additions
and
0 deletions
+5
-0
vllm/model_executor/layers/quantization/kernels/mixed_precision/machete.py
...or/layers/quantization/kernels/mixed_precision/machete.py
+5
-0
No files found.
vllm/model_executor/layers/quantization/kernels/mixed_precision/machete.py
View file @
28a6d542
...
...
@@ -126,6 +126,11 @@ class MacheteLinearKernel(MPLinearKernel):
if
c
.
has_g_idx
:
x_2d
=
self
.
act_perm
(
x_2d
)
if
c
.
zero_points
:
assert
w_zp
is
not
None
else
:
w_zp
=
None
output
=
ops
.
machete_mm
(
a
=
x_2d
,
b_q
=
w_q
,
b_type
=
c
.
weight_type
,
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment