Commit a85d578d authored by zhougaofeng's avatar zhougaofeng
Browse files

Update README.md

parent d42063c1
......@@ -28,7 +28,7 @@ DeepSeekMath在DeepSeek-Coder-Base-v1.5-7B的基础上进行预训练,使用了
此处提供[光源](https://www.sourcefind.cn/#/service-details)拉取 docker 镜像的地址与使用步骤
```
docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:2.1.0-centos7.6-dtk24.04-py310
docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:2.1.0-centos7.6-dtk23.10.1-py310
docker run -it --shm-size=1024G -v <Host Path>:<Container Path> -v /opt/hyhal:/opt/hyhal --privileged=true --device=/dev/kfd --device=/dev/dri/ --group-add video --name deepseek-math <your IMAGE ID> bash # <your IMAGE ID>为以上拉取的docker的镜像ID替换,本镜像为:c85ed27005f2
cd /home/deepseek-math-pytorch
pip install -r requirements.txt -i https://mirrors.aliyun.com/pypi/simple/ --trusted-host mirrors.aliyun.com
......@@ -114,7 +114,6 @@ python 7B_single_node.py
使用的加速卡:2张 DCU-K100-64G
<div align=center>
<img src="./doc/inference.png"/>
<img src="./doc/result.png"/>
</div>
......@@ -124,7 +123,7 @@ python 7B_single_node.py
根据测试结果情况填写表格:
| device | train_loss |
| :------: | :------: |
| DCU-K100 | 1.0415 |
| DCU-K100 | 1.0417 |
| GPU-A800 | 1.0419 |
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment