[Bugfix] fix Qwen3VLMoe load when pp > 1 (#25838)

Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com>

[Bugfix] fix Qwen3VLMoe load when pp > 1 (#25838)
Signed-off-by: liuye.hj <liuye.hj@alibaba-inc.com> Co-authored-by: liuye.hj <liuye.hj@alibaba-inc.com>
471997ad · JJJYmmm · GitHub · b1ded114 · 471997ad
Unverified Commit 471997ad authored Sep 29, 2025 by JJJYmmm Committed by GitHub Sep 28, 2025
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

vllm/model_executor/models/qwen3_vl_moe.py vllm/model_executor/models/qwen3_vl_moe.py +2 -2

No files found.
--- a/vllm/model_executor/models/qwen3_vl_moe.py
+++ b/vllm/model_executor/models/qwen3_vl_moe.py
@@ -212,6 +212,8 @@ class Qwen3MoeLLMModel(Qwen3MoeModel):
                    # attempted to load as other weights later
                    is_expert_weight = True
                    name_mapped = name.replace(weight_name, param_name)
+                    if is_pp_missing_parameter(name_mapped, self):
+                        continue
                    if is_fused_expert:
                        loaded_weight = loaded_weight.transpose(-1,
                                                                -2)  # no bias
@@ -230,8 +232,6 @@ class Qwen3MoeLLMModel(Qwen3MoeModel):
                                name_mapped, params_dict, loaded_weight,
                                shard_id, num_experts)
                    else:
-                        if is_pp_missing_parameter(name_mapped, self):
-                            continue
                        # Skip loading extra parameters for GPTQ/modelopt models
                        if name_mapped.endswith(
                                ignore_suffixes