Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
09bdce60
Commit
09bdce60
authored
Mar 03, 2026
by
yangql
Browse files
修复awq-marlin的bug
parent
ef8dd155
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
2 deletions
+3
-2
vllm/_custom_ops.py
vllm/_custom_ops.py
+3
-2
No files found.
vllm/_custom_ops.py
View file @
09bdce60
...
@@ -1515,8 +1515,9 @@ def awq_marlin_moe_repack(
...
@@ -1515,8 +1515,9 @@ def awq_marlin_moe_repack(
output
=
torch
.
empty
((
num_experts
,
size_k
//
16
,
size_n
*
(
num_bits
//
2
)),
output
=
torch
.
empty
((
num_experts
,
size_k
//
16
,
size_n
*
(
num_bits
//
2
)),
device
=
b_q_weight
.
device
,
device
=
b_q_weight
.
device
,
dtype
=
b_q_weight
.
dtype
)
dtype
=
b_q_weight
.
dtype
)
output
[
e
]
=
op
.
awq_marlin_repack
(
b_q_weight
[
e
],
size_k
,
for
e
in
range
(
num_experts
):
size_n
,
num_bits
)
output
[
e
]
=
op
.
awq_marlin_repack
(
b_q_weight
[
e
],
size_k
,
size_n
,
num_bits
)
return
output
return
output
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment