Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
OpenDAS
vllm_cscc
Commits
20852c8f
Unverified
Commit
20852c8f
authored
Nov 19, 2025
by
Li, Jiang
Committed by
GitHub
Nov 19, 2025
Browse files
[CPU] Refactor CPU WNA16 (#28826)
Signed-off-by:
jiang1.li
<
jiang1.li@intel.com
>
parent
40b6b38f
Changes
22
Expand all
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
626 additions
and
1 deletion
+626
-1
vllm/model_executor/layers/quantization/cpu_wna16.py
vllm/model_executor/layers/quantization/cpu_wna16.py
+625
-0
vllm/model_executor/layers/quantization/ipex_quant.py
vllm/model_executor/layers/quantization/ipex_quant.py
+1
-1
No files found.
vllm/model_executor/layers/quantization/cpu_wna16.py
0 → 100644
View file @
20852c8f
This diff is collapsed.
Click to expand it.
vllm/model_executor/layers/quantization/ipex_quant.py
View file @
20852c8f
...
@@ -134,7 +134,7 @@ class IPEXConfig(QuantizationConfig):
...
@@ -134,7 +134,7 @@ class IPEXConfig(QuantizationConfig):
def
override_quantization_method
(
def
override_quantization_method
(
cls
,
hf_quant_cfg
,
user_quant
cls
,
hf_quant_cfg
,
user_quant
)
->
QuantizationMethods
|
None
:
)
->
QuantizationMethods
|
None
:
if
not
current_platform
.
is_cpu
()
and
not
current_platform
.
is_xpu
():
if
not
current_platform
.
is_xpu
():
return
None
return
None
quant_method
=
hf_quant_cfg
.
get
(
"quant_method"
,
""
).
lower
()
quant_method
=
hf_quant_cfg
.
get
(
"quant_method"
,
""
).
lower
()
...
...
Prev
1
2
Next
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment