[Llama4] Update `attn_temperature_tuning` (#19997)

Signed-off-by: Brayden Zhong <b8zhong@uwaterloo.ca>

[Llama4] Update `attn_temperature_tuning` (#19997)
Signed-off-by: Brayden Zhong <b8zhong@uwaterloo.ca>
1afa9948 · Brayden Zhong · GitHub · 0d06b533 · 1afa9948
Unverified Commit 1afa9948 authored Jun 24, 2025 by Brayden Zhong Committed by GitHub Jun 24, 2025
Show whitespace changes
Inline Side-by-side

Showing with 1 addition and 2 deletions

vllm/model_executor/models/llama4.py vllm/model_executor/models/llama4.py +1 -2

No files found.
--- a/vllm/model_executor/models/llama4.py
+++ b/vllm/model_executor/models/llama4.py
@@ -148,9 +148,8 @@ class Llama4Attention(nn.Module):
        self.q_size = self.num_heads * self.head_dim
        self.kv_size = self.num_kv_heads * self.head_dim
        self.scaling = self.head_dim**-0.5
-        # TODO: attn_temperature_tuning should be a bool in huggingface
        self.attn_temperature_tuning = self.nope and \
-            config.attn_temperature_tuning > 0
+            config.attn_temperature_tuning
        self.floor_scale = getattr(config, "floor_scale", 8192.0)
        self.attn_scale = getattr(config, "attn_scale", 0.1)