Commit cf6398fb authored by zhougaofeng's avatar zhougaofeng
Browse files

Update README.md

parent fc2244aa
......@@ -8,7 +8,7 @@
## 模型结构
DeepSeekMath在DeepSeek-Coder-Base-v1.5-7B的基础上进行预训练,使用了来自Common Crawl的120B与数学相关的token,以及自然语言和代码数据。竞赛级别的MATH基准测试的准确率优于更大参数量的qwen-72B、Llemma-34B等模型
DeepSeekMath在DeepSeek-Coder-Base-v1.5-7B的基础上进行预训练,使用了来自Common Crawl的120B与数学相关的token,以及自然语言和代码数据。DeepSeekMath 7B在没有依赖外部工具包和投票技术的情况下,在竞争级别的MATH基准上取得了令人印象深刻的51.7%的分数,接近Gemini-Ultra和GPT-4的性能水平。
<div align=center>
<img src="./doc/model_accuracy.png"/>
</div>
......@@ -32,7 +32,7 @@ docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:2.1.0-centos7.6-dtk2
docker run -it --shm-size=1024G -v <Host Path>:<Container Path> -v /opt/hyhal:/opt/hyhal --privileged=true --device=/dev/kfd --device=/dev/dri/ --group-add video --name deepseek-math <your IMAGE ID> bash # <your IMAGE ID>为以上拉取的docker的镜像ID替换,本镜像为:c85ed27005f2
cd /home/deepseek-math-pytorch
pip install -r requirements.txt -i https://mirrors.aliyun.com/pypi/simple/ --trusted-host mirrors.aliyun.com
# deepspeed、bitsandbytes可从whl.zip文件里获取安装:
# bitsandbytes可从whl.zip文件里获取安装:
pip install bitsandbytes-0.42.0-py3-none-any.whl
pip uninstall vllm
......@@ -47,7 +47,7 @@ pip uninstall vllm
docker build -t deepseek-math-df:latest .
docker run -it --shm-size=1024G -v <Host Path>:<Container Path> -v /opt/hyhal:/opt/hyhal --privileged=true --device=/dev/kfd --device=/dev/dri/ --group-add video --name deepseek-math deepseek-math-df bash
pip install -r requirements.txt -i https://mirrors.aliyun.com/pypi/simple/ --trusted-host mirrors.aliyun.com
# deepspeed、bitsandbytes可从whl.zip文件里获取安装:
# bitsandbytes可从whl.zip文件里获取安装:
pip install bitsandbytes-0.42.0-py3-none-any.whl
pip uninstall vllm
......@@ -127,7 +127,7 @@ sh 7B_single_node.sh
| DCU-K100 | 1.0415 |
| GPU-A800 | 1.0419 |
## 应用场景
### 算法类别
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment