Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
6f1beee6
Commit
6f1beee6
authored
Mar 25, 2026
by
zhuwenwen
Browse files
Update modelopt.py
parent
eefa41c1
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
1 deletion
+1
-1
vllm/model_executor/layers/quantization/modelopt.py
vllm/model_executor/layers/quantization/modelopt.py
+1
-1
No files found.
vllm/model_executor/layers/quantization/modelopt.py
View file @
6f1beee6
...
@@ -2209,7 +2209,7 @@ class ModelOptMixedPrecisionConfig(ModelOptQuantConfigBase):
...
@@ -2209,7 +2209,7 @@ class ModelOptMixedPrecisionConfig(ModelOptQuantConfigBase):
return
ModelOptFp8LinearMethod
(
self
.
fp8_config
)
return
ModelOptFp8LinearMethod
(
self
.
fp8_config
)
if
quant_algo
==
"NVFP4"
:
if
quant_algo
==
"NVFP4"
:
return
ModelOptNvFp4LinearMethod
(
self
.
nvfp4_config
)
return
ModelOptNvFp4LinearMethod
(
self
.
nvfp4_config
)
# Layer not in quantized_layers
鈥?
leave unquantized
# Layer not in quantized_layers
—
leave unquantized
return
UnquantizedLinearMethod
()
return
UnquantizedLinearMethod
()
if
isinstance
(
layer
,
FusedMoE
):
if
isinstance
(
layer
,
FusedMoE
):
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment