Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
0fc8fa75
Unverified
Commit
0fc8fa75
authored
Aug 17, 2025
by
Simon Mo
Committed by
GitHub
Aug 17, 2025
Browse files
fix: gptq marlin weight loading failure (#23066)
parent
21e39436
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
vllm/model_executor/layers/quantization/gptq_marlin.py
vllm/model_executor/layers/quantization/gptq_marlin.py
+1
-1
No files found.
vllm/model_executor/layers/quantization/gptq_marlin.py
View file @
0fc8fa75
...
@@ -56,7 +56,7 @@ def get_moe_quant_method(
...
@@ -56,7 +56,7 @@ def get_moe_quant_method(
# Dynamic per module/layer rules may override base config
# Dynamic per module/layer rules may override base config
override_config
(
cloned_config
,
prefix
=
prefix
)
override_config
(
cloned_config
,
prefix
=
prefix
)
return
moe_method_cls
(
cloned_config
)
return
moe_method_cls
(
cloned_config
,
layer
.
moe_config
)
return
None
return
None
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment