Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
evt_fugx1
dcu_megatron
Commits
ee3ff5df
Commit
ee3ff5df
authored
May 07, 2025
by
silencealiang
Browse files
Update deepseekv3 parameters to avoid expert select fault
parent
54740897
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
examples/deepseek_v3/train_deepseekv3_671B_1nodes.sh
examples/deepseek_v3/train_deepseekv3_671B_1nodes.sh
+2
-2
No files found.
examples/deepseek_v3/train_deepseekv3_671B_1nodes.sh
View file @
ee3ff5df
...
@@ -114,8 +114,8 @@ moe_options=" \
...
@@ -114,8 +114,8 @@ moe_options=" \
--moe-pad-expert-input-to-capacity
\
--moe-pad-expert-input-to-capacity
\
--moe-token-dispatcher-type alltoall
\
--moe-token-dispatcher-type alltoall
\
--moe-router-topk
${
ROUTER_TOPK
}
\
--moe-router-topk
${
ROUTER_TOPK
}
\
--moe-router-group-topk
2
\
--moe-router-group-topk
1
\
--moe-router-num-groups
4
\
--moe-router-num-groups
1
\
--num-experts
${
NUM_EXPERTS
}
\
--num-experts
${
NUM_EXPERTS
}
\
--expert-model-parallel-size
${
EP
}
\
--expert-model-parallel-size
${
EP
}
\
--expert-tensor-parallel-size
${
ETP
}
\
--expert-tensor-parallel-size
${
ETP
}
\
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment