Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
3de3eadf
Unverified
Commit
3de3eadf
authored
May 30, 2025
by
Wenhua Cheng
Committed by
GitHub
May 29, 2025
Browse files
improve the robustness of parsing vlms config in AutoRound (#18894)
Signed-off-by:
wenhuach21
<
wenhua.cheng@intel.com
>
parent
3132290a
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
3 additions
and
2 deletions
+3
-2
vllm/model_executor/layers/quantization/auto_round.py
vllm/model_executor/layers/quantization/auto_round.py
+3
-2
No files found.
vllm/model_executor/layers/quantization/auto_round.py
View file @
3de3eadf
...
@@ -116,8 +116,9 @@ class AutoRoundConfig(QuantizationConfig):
...
@@ -116,8 +116,9 @@ class AutoRoundConfig(QuantizationConfig):
quantized
=
True
quantized
=
True
if
self
.
block_name_to_quantize
:
if
self
.
block_name_to_quantize
:
quantized
=
any
(
name
in
layer_name
quantized
=
any
(
for
name
in
self
.
block_name_to_quantize
)
layer_name
.
startswith
(
name
)
for
name
in
self
.
block_name_to_quantize
)
elif
isinstance
(
layer
,
ParallelLMHead
):
elif
isinstance
(
layer
,
ParallelLMHead
):
quantized
=
False
quantized
=
False
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment