[Misc] Add some comments in qwen3-next (#28267)

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

[Misc] Add some comments in qwen3-next (#28267)
Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>
7ae5a5fb · Jiangyun Zhu · GitHub · de2b7830 · 7ae5a5fb
Unverified Commit 7ae5a5fb authored Nov 09, 2025 by Jiangyun Zhu Committed by GitHub Nov 08, 2025
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 0 deletions

vllm/model_executor/models/qwen3_next.py vllm/model_executor/models/qwen3_next.py +2 -0

No files found.
--- a/vllm/model_executor/models/qwen3_next.py
+++ b/vllm/model_executor/models/qwen3_next.py
@@ -462,6 +462,8 @@ class Qwen3NextGatedDeltaNet(nn.Module, MambaBase):
        # ============================================================
        # Part 2: Core Attention (Custom Op)
        # ============================================================
+        # Note: we should not use torch.empty here like other attention backends,
+        # see discussions in https://github.com/vllm-project/vllm/pull/28182
        core_attn_out = torch.zeros(
            (num_tokens, self.num_v_heads // self.tp_size, self.head_v_dim),
            dtype=hidden_states.dtype,