Unverified Commit 5a7e10fe authored by Cheng Wan's avatar Cheng Wan Committed by GitHub
Browse files

[MoE] fix: incorrect weight initialization for cutlass_fused_experts_fp8 (#10144)

parent 33467c05
......@@ -656,7 +656,7 @@ class Fp8MoEMethod(FusedMoEMethodBase):
)
self.c_strides2 = torch.full(
(num_experts,),
intermediate_size_per_partition,
hidden_size,
device=w2_weight.device,
dtype=torch.int64,
)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment