Unverified Commit 60bcbf2a authored by Xiaoyu Zhang's avatar Xiaoyu Zhang Committed by GitHub
Browse files

remove moe_align_block_size torch.zeros in small batch/expert mode (#5298)

parent a0a9f6d6
......@@ -707,7 +707,7 @@ def moe_align_block_size(
dtype=torch.int32,
device=topk_ids.device,
)
cumsum_buffer = torch.zeros(
cumsum_buffer = torch.empty(
num_experts + 1, dtype=torch.int32, device=topk_ids.device
)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment