Unverified Commit 7ae5a5fb authored by Jiangyun Zhu's avatar Jiangyun Zhu Committed by GitHub
Browse files

[Misc] Add some comments in qwen3-next (#28267)


Signed-off-by: default avatarzjy0516 <riverclouds.zhu@qq.com>
parent de2b7830
......@@ -462,6 +462,8 @@ class Qwen3NextGatedDeltaNet(nn.Module, MambaBase):
# ============================================================
# Part 2: Core Attention (Custom Op)
# ============================================================
# Note: we should not use torch.empty here like other attention backends,
# see discussions in https://github.com/vllm-project/vllm/pull/28182
core_attn_out = torch.zeros(
(num_tokens, self.num_v_heads // self.tp_size, self.head_v_dim),
dtype=hidden_states.dtype,
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment