"vscode:/vscode.git/clone" did not exist on "41d3071918bd6dcb6259bac8a617cfed45d42707"
Commit c3dfb0f6 authored by Jee Jee Li's avatar Jee Jee Li Committed by simon-mo
Browse files

[Bench] Add DeepSeekV32 to MoE benchmark (#25962)


Signed-off-by: default avatarJee Jee Li <pandaleefree@gmail.com>
Signed-off-by: default avatarsimon-mo <simon.mo@hey.com>
parent 83f3c9be
......@@ -584,8 +584,9 @@ def main(args: argparse.Namespace):
topk = config.num_experts_per_tok
intermediate_size = config.intermediate_size
elif config.architectures[0] in (
"DeepseekV3ForCausalLM",
"DeepseekV2ForCausalLM",
"DeepseekV3ForCausalLM",
"DeepseekV32ForCausalLM",
"Glm4MoeForCausalLM",
):
E = config.n_routed_experts
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment