[BugFix][AMD][Deepseek] fix a dtype mismatch error for deepseek running on AMD (#23864)

Signed-off-by: Jinghui Zhang <jinghuizhang0804@gmail.com>

[BugFix][AMD][Deepseek] fix a dtype mismatch error for deepseek running on AMD (#23864)
Signed-off-by: Jinghui Zhang <jinghuizhang0804@gmail.com>
5264015d · Jinghui Zhang · GitHub · 98ac0cb3 · 5264015d
Unverified Commit 5264015d authored Aug 28, 2025 by Jinghui Zhang Committed by GitHub Aug 28, 2025
Hide whitespace changes
Inline Side-by-side

Showing with 4 additions and 4 deletions

vllm/model_executor/layers/fused_moe/rocm_aiter_fused_moe.py vllm/model_executor/layers/fused_moe/rocm_aiter_fused_moe.py +4 -4

No files found.
--- a/vllm/model_executor/layers/fused_moe/rocm_aiter_fused_moe.py
+++ b/vllm/model_executor/layers/fused_moe/rocm_aiter_fused_moe.py
@@ -279,7 +279,7 @@ def rocm_aiter_grouped_topk(
    if e_score_correction_bias is not None:
        torch.ops.vllm.rocm_aiter_biased_grouped_topk(
            gating_output,
-            e_score_correction_bias,
+            e_score_correction_bias.to(gating_output.dtype),
            topk_weights,
            topk_ids,
            num_expert_group,
@@ -409,15 +409,15 @@ def shuffle_weights(
    *tensors: torch.Tensor, layout: tuple[int, int] = (16, 16)
 ) -> tuple[torch.Tensor, ...]:
    """
-    Applies shuffle_weight function from AITER to each 
+    Applies shuffle_weight function from AITER to each
    input tensor and returns them.
    Rearranges (shuffles) the input tensor/s
    into a specified block layout for optimized computation.
    Args:
        *tensors: Variable number of torch.Tensor objects.
-        layout: A pair of integers specifying the 
+        layout: A pair of integers specifying the
        block sizes used to divide the tensors during shuffling.
        Default is (16, 16).