[model] make llama4 compatible with pure dense layers (#17315)

Signed-off-by: Lucia Fang <fanglu@fb.com>

[model] make llama4 compatible with pure dense layers (#17315)
Signed-off-by: Lucia Fang <fanglu@fb.com>
b4ac4fa0 · Lucia Fang · GitHub · e1360005 · b4ac4fa0
Unverified Commit b4ac4fa0 authored Apr 28, 2025 by Lucia Fang Committed by GitHub Apr 29, 2025
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

vllm/model_executor/models/llama4.py vllm/model_executor/models/llama4.py +2 -2

No files found.
--- a/vllm/model_executor/models/llama4.py
+++ b/vllm/model_executor/models/llama4.py
@@ -273,8 +273,8 @@ class Llama4DecoderLayer(nn.Module):
            cache_config=cache_config,
            prefix=f"{prefix}.self_attn",
        )
-        is_moe_layer = (self.layer_idx +
+        is_moe_layer = config.interleave_moe_layer_step > 0 and (
-                        1) % config.interleave_moe_layer_step == 0
+            self.layer_idx + 1) % config.interleave_moe_layer_step == 0
        if is_moe_layer:
            self.feed_forward = Llama4MoE(
                config=config,