Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
6e2c176e
Unverified
Commit
6e2c176e
authored
Jul 13, 2025
by
Michael Goin
Committed by
GitHub
Jul 12, 2025
Browse files
[Bugfix] Restrict Machete to only run on Hopper (#20830)
Signed-off-by:
mgoin
<
mgoin64@gmail.com
>
parent
a86754a1
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
0 deletions
+3
-0
vllm/model_executor/layers/quantization/kernels/mixed_precision/machete.py
...or/layers/quantization/kernels/mixed_precision/machete.py
+3
-0
No files found.
vllm/model_executor/layers/quantization/kernels/mixed_precision/machete.py
View file @
6e2c176e
...
@@ -32,6 +32,9 @@ class MacheteLinearKernel(MPLinearKernel):
...
@@ -32,6 +32,9 @@ class MacheteLinearKernel(MPLinearKernel):
if
not
current_platform
.
is_cuda
():
if
not
current_platform
.
is_cuda
():
return
False
,
"Machete only supported on CUDA"
return
False
,
"Machete only supported on CUDA"
if
not
current_platform
.
is_device_capability
(
90
):
return
False
,
"Machete requires compute capability of 90 (Hopper)"
if
c
.
has_g_idx
and
\
if
c
.
has_g_idx
and
\
c
.
partition_weight_shape
[
0
]
!=
c
.
full_weight_shape
[
0
]:
c
.
partition_weight_shape
[
0
]
!=
c
.
full_weight_shape
[
0
]:
return
False
,
"Act reordering currently not supported by Machete, "
\
return
False
,
"Act reordering currently not supported by Machete, "
\
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment