Fix NaN from stale FP4 scale padding in create_fp4_scale_tensor (#38148)

Signed-off-by: Elvir Crncevic <elvircrn@gmail.com> Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com>

Fix NaN from stale FP4 scale padding in create_fp4_scale_tensor (#38148)
Signed-off-by: Elvir Crncevic <elvircrn@gmail.com> Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com>
0fab52f0 · Elvir Crnčević · GitHub · 91e4521f · 0fab52f0
Unverified Commit 0fab52f0 authored Apr 01, 2026 by Elvir Crnčević Committed by GitHub Mar 31, 2026
Show whitespace changes
Inline Side-by-side

Showing with 2 additions and 2 deletions

vllm/_custom_ops.py vllm/_custom_ops.py +2 -2

No files found.
--- a/vllm/_custom_ops.py
+++ b/vllm/_custom_ops.py
@@ -56,11 +56,11 @@ def create_fp4_scale_tensor(
        rounded_m = round_up(m, 128)
        scale_n = n // block_size
        rounded_n = round_up(scale_n, 4)
-        return torch.empty(
+        return torch.zeros(
            (rounded_m, rounded_n // 4), device=device, dtype=torch.int32
        )
    else:
-        return torch.empty((m, n // block_size), device=device, dtype=torch.uint8)
+        return torch.zeros((m, n // block_size), device=device, dtype=torch.uint8)
 def create_fp4_output_tensors(