Commit b85974a6 authored by silencealiang's avatar silencealiang
Browse files

Update train_deepseekv3_671B_multinodes.sh

parent 84af95b0
......@@ -36,7 +36,7 @@ export NCCL_TOPO_FILE="./topo-input.xml"
# enable BatchLinear
export GROUPED_GEMM_BatchLinear=1
export MP_PP0_LAYERS=2 # 是否使能视实际情况而定
export MP_PP0_LAYERS=5 # 是否使能视实际情况而定
### BASE CONFIG ###
MODEL_SIZE=A37B
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment