Unverified Commit 0fab52f0 authored by Elvir Crnčević's avatar Elvir Crnčević Committed by GitHub
Browse files

Fix NaN from stale FP4 scale padding in create_fp4_scale_tensor (#38148)


Signed-off-by: default avatarElvir Crncevic <elvircrn@gmail.com>
Co-authored-by: default avatarTyler Michael Smith <tyler@neuralmagic.com>
parent 91e4521f
...@@ -56,11 +56,11 @@ def create_fp4_scale_tensor( ...@@ -56,11 +56,11 @@ def create_fp4_scale_tensor(
rounded_m = round_up(m, 128) rounded_m = round_up(m, 128)
scale_n = n // block_size scale_n = n // block_size
rounded_n = round_up(scale_n, 4) rounded_n = round_up(scale_n, 4)
return torch.empty( return torch.zeros(
(rounded_m, rounded_n // 4), device=device, dtype=torch.int32 (rounded_m, rounded_n // 4), device=device, dtype=torch.int32
) )
else: else:
return torch.empty((m, n // block_size), device=device, dtype=torch.uint8) return torch.zeros((m, n // block_size), device=device, dtype=torch.uint8)
def create_fp4_output_tensors( def create_fp4_output_tensors(
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment