[BUGFIX] deepseek-v2-lite failed due to fused_qkv_a_proj name update (#21414)

Signed-off-by: Chendi.Xue <chendi.xue@intel.com>

[BUGFIX] deepseek-v2-lite failed due to fused_qkv_a_proj name update (#21414)
Signed-off-by: Chendi.Xue <chendi.xue@intel.com>
08d2bd78 · Chendi.Xue · GitHub · 4f76a05f · 08d2bd78
Unverified Commit 08d2bd78 authored Jul 22, 2025 by Chendi.Xue Committed by GitHub Jul 22, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 5 additions and 2 deletions

vllm/model_executor/models/deepseek_v2.py vllm/model_executor/models/deepseek_v2.py +5 -2

No files found.
--- a/vllm/model_executor/models/deepseek_v2.py
+++ b/vllm/model_executor/models/deepseek_v2.py
@@ -885,13 +885,16 @@ class DeepseekV2ForCausalLM(nn.Module, SupportsPP, MixtureOfExperts):
                # for mlp.experts[0].gate_gate_up_proj, which breaks load.
                if (("mlp.experts." in name) and name not in params_dict):
                    continue
-                name = name.replace(weight_name, param_name)
+                name_mapped = name.replace(weight_name, param_name)

                # QKV fusion is optional, fall back to normal
                # weight loading if it's not enabled
+                # if go with fusion option, then update name
                if ((param_name == "fused_qkv_a_proj")
-                        and name not in params_dict):
+                        and name_mapped not in params_dict):
                    continue
+                else:
+                    name = name_mapped
                # Skip loading extra bias for GPTQ models.
                if name.endswith(".bias") and name not in params_dict:
                    continue