[ROCm][quantization] improve OCP weight quant parser robust (#34431)

Signed-off-by: xuebwang-amd <xuebwang@amd.com> Co-authored-by: TJian <tunjian.tan@embeddedllm.com>

[ROCm][quantization] improve OCP weight quant parser robust (#34431)
Signed-off-by: xuebwang-amd <xuebwang@amd.com> Co-authored-by: TJian <tunjian.tan@embeddedllm.com>
766e1678 · xuebwang-amd · GitHub · becbe248 · 766e1678
Unverified Commit 766e1678 authored Feb 13, 2026 by xuebwang-amd Committed by GitHub Feb 12, 2026
Show whitespace changes
Inline Side-by-side

Showing with 7 additions and 0 deletions

vllm/model_executor/layers/quantization/quark/quark.py vllm/model_executor/layers/quantization/quark/quark.py +7 -0

No files found.
--- a/vllm/model_executor/layers/quantization/quark/quark.py
+++ b/vllm/model_executor/layers/quantization/quark/quark.py
@@ -337,6 +337,13 @@ class QuarkConfig(QuantizationConfig):
            )
            return False
+        if isinstance(weight_quant, list):
+            logger.debug(
+                "Quark model's weight quantization is incompatible with OCP_MX format: "
+                "weight_quant is a list (e.g. fp8_w4a8), OCP_MX requires a single dict."
+            )
+            return False
        # Input and weight qscheme needs to be per group.
        if weight_quant.get("qscheme") != "per_group":
            logger.debug(