Fix tensor device and dtype placement in Qwen2VL model (#26219)

Signed-off-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: Yuanfeng Li <yuanfengli@meta.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>

Fix tensor device and dtype placement in Qwen2VL model (#26219)
Signed-off-by: Cyrus Leung <cyrus.tl.leung@gmail.com> Co-authored-by: Yuanfeng Li <yuanfengli@meta.com> Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
86ee9491 · yuafng · GitHub · 4570535e · 86ee9491
Unverified Commit 86ee9491 authored Oct 04, 2025 by yuafng Committed by GitHub Oct 04, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 1 deletion

vllm/model_executor/models/qwen2_vl.py vllm/model_executor/models/qwen2_vl.py +1 -1

No files found.
--- a/vllm/model_executor/models/qwen2_vl.py
+++ b/vllm/model_executor/models/qwen2_vl.py
@@ -720,7 +720,7 @@ class Qwen2VisionTransformer(nn.Module):
        rotary_pos_emb = self.rot_pos_emb(grid_thw)
        # compute cu_seqlens
-        grid_thw_ = torch.tensor(grid_thw)
+        grid_thw_ = torch.tensor(grid_thw, device=x.device, dtype=torch.long)
        cu_seqlens = torch.repeat_interleave(grid_thw_[:, 1] * grid_thw_[:, 2],
                                             grid_thw_[:, 0]).cumsum(
                                                 dim=0, dtype=torch.int32)