"torchvision/vscode:/vscode.git/clone" did not exist on "22bc44ed76ec314e8c27df1575757d82f2eda0a3"
Commit 59851047 authored by dcuai's avatar dcuai
Browse files

Update README.md

parent ca1c52dd
......@@ -24,7 +24,7 @@ ResNet50使用了多个具有残差连接的残差块来解决梯度消失或梯
```
拉取镜像:
docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:1.10.0-centos7.6-dtk-22.10.1-py37-latest
docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:2.1.0-ubuntu20.04-dtk24.04.1-py3.10
创建并启动容器:
docker run --shm-size 16g --network=host --name=resnet50_pytorch --privileged --device=/dev/kfd --device=/dev/dri --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -v $PWD/resnet50-pytorch:/home/resnet50_pytorch -it <Your Image ID> bash
安装依赖:
......@@ -45,11 +45,11 @@ docker run --rm --shm-size 16g --network=host --name=resnet50_pytorch --privileg
https://developer.hpccube.com/tool/
```
DTK驱动:dtk22.10.1
python:python3.7
torch:1.10.0
torchvision:0.10.0
apex:0.1
DTK驱动:dtk24.04.1
python:python3.10
torch:2.1.0
torchvision:0.16.0
apex:1.1
```
`Tips:以上DTK、python、torch等DCU相关工具包,版本需要严格一一对应`
......@@ -100,13 +100,17 @@ python3 train.py --batch-size=64 --arch=resnet50 -j 6 --epochs=90 --amp --opt-le
### 单机四卡训练(单精度)
```
mpirun --allow-run-as-root --bind-to none -np 4 scrips/single_process.sh localhost resnet50 64
cd scrips
chmod +x single_process.sh
mpirun --allow-run-as-root --bind-to none -np 4 single_process.sh localhost resnet50 64
```
### 单机四卡训练(混合精度)
```
mpirun --allow-run-as-root --bind-to none -np 4 scrips/single_process_amp.sh localhost resnet50 64
cd scrips
chmod +x single_process_amp.sh
mpirun --allow-run-as-root --bind-to none -np 4 single_process_amp.sh localhost resnet50 64
```
## result
......@@ -140,6 +144,6 @@ mpirun --allow-run-as-root --bind-to none -np 4 scrips/single_process_amp.sh loc
https://developer.hpccube.com/codes/modelzoo/resnet50-pytorch
# 参考
# 参考资料
https://github.com/pytorch/examples/tree/master/imagenet
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment