Unverified Commit 5a7e10fe authored by Cheng Wan's avatar Cheng Wan Committed by GitHub
Browse files

[MoE] fix: incorrect weight initialization for cutlass_fused_experts_fp8 (#10144)

parent 33467c05
...@@ -656,7 +656,7 @@ class Fp8MoEMethod(FusedMoEMethodBase): ...@@ -656,7 +656,7 @@ class Fp8MoEMethod(FusedMoEMethodBase):
) )
self.c_strides2 = torch.full( self.c_strides2 = torch.full(
(num_experts,), (num_experts,),
intermediate_size_per_partition, hidden_size,
device=w2_weight.device, device=w2_weight.device,
dtype=torch.int64, dtype=torch.int64,
) )
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment