Commit 340ddce9 authored by wxj's avatar wxj
Browse files

Update README.md

parent ed2d07ae
Pipeline #2095 failed with stages
in 0 seconds
......@@ -113,7 +113,8 @@ DATA_PATH="./dataset/oscar-1GB-gpt_text_document"
3. 如果需要免密,docker run容器时,docker -v /root/.ssh 挂载.ssh目录
4. 容器间mpirun执行: `mpirun -np .. --hostfile hosts -mca plm_rsh_args "-p 12345" ./xx.sh master_ip`
设有节点192.168.1.1和192.168.1.2两个节点, 每个节点上8张卡, 192.168.1.1作为master节点
**案例**: 设有节点192.168.1.1和192.168.1.2两个节点, 每个节点上8张卡, 192.168.1.1作为master节点
hosts文件:
```txt
......@@ -171,7 +172,7 @@ DATA_PATH="/datasets/oscar-1GB-llama_text_document"
- 多机多卡训练
设有节点192.168.1.1和192.168.1.2两个节点, 每个节点上8张卡, 192.168.1.1作为master节点
**案例**: 设有节点192.168.1.1和192.168.1.2两个节点, 每个节点上8张卡, 192.168.1.1作为master节点
hosts配置如上文GTP所示
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment