Unverified Commit ee9d6ca6 authored by Yangcheng Li's avatar Yangcheng Li Committed by GitHub
Browse files

[fix/misc] remove duplicate row in deepseek v2 model (#5279)

parent 2dd64894
......@@ -180,7 +180,6 @@ class DeepseekV2MoE(nn.Module):
else 0
)
self.routed_scaling_factor = config.routed_scaling_factor
if self.tp_size > config.n_routed_experts:
raise ValueError(
f"Tensor parallel size {self.tp_size} is greater than "
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment