Commit fe37e5a1 authored by zhougaofeng's avatar zhougaofeng
Browse files

Update README.md

parent 139b3988
...@@ -21,11 +21,10 @@ InternLM-Math是基于InternLM2-Base模型进行数学预训练得到的大型 ...@@ -21,11 +21,10 @@ InternLM-Math是基于InternLM2-Base模型进行数学预训练得到的大型
``` ```
docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:2.1.0-centos7.6-dtk24.04-py310 docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:2.1.0-centos7.6-dtk24.04-py310
docker run -it --shm-size=1024G -v /parastor/home/internlm-math-pytorch:/home/internlm-math-pytorch -v /opt/hyhal:/opt/hyhal --privileged=true --device=/dev/kfd --device=/dev/dri/ --group-add video --name internlm-math <your IMAGE ID> bash # <your IMAGE ID>为以上拉取的docker的镜像ID替换,本镜像为:c85ed27005f2 docker run -it --shm-size=1024G -v <Host Path>:<Container Path> -v /opt/hyhal:/opt/hyhal --privileged=true --device=/dev/kfd --device=/dev/dri/ --group-add video --name internlm-math <your IMAGE ID> bash # <your IMAGE ID>为以上拉取的docker的镜像ID替换,本镜像为:c85ed27005f2
cd /home/internlm-math-pytorch cd /home/internlm-math-pytorch
pip install -r requirement.txt -i https://mirrors.aliyun.com/pypi/simple/ --trusted-host mirrors.aliyun.com pip install -r requirements.txt -i https://mirrors.aliyun.com/pypi/simple/ --trusted-host mirrors.aliyun.com
# deepspeed、bitsandbytes可从whl.zip文件里获取安装: # bitsandbytes可从whl.zip文件里获取安装:
pip install deepspeed-0.12.3+das1.0+gita724046.abi0.dtk2404.torch2.1.0-cp310-cp310-manylinux2014_x86_64.whl
pip install bitsandbytes-0.42.0-py3-none-any.whl pip install bitsandbytes-0.42.0-py3-none-any.whl
``` ```
...@@ -37,9 +36,8 @@ pip install bitsandbytes-0.42.0-py3-none-any.whl ...@@ -37,9 +36,8 @@ pip install bitsandbytes-0.42.0-py3-none-any.whl
``` ```
docker build -t internlm-math-df:latest . docker build -t internlm-math-df:latest .
docker run -it --shm-size=1024G -v /parastor/home/internlm-math-pytorch:/home/internlm-math-pytorch -v /opt/hyhal:/opt/hyhal --privileged=true --device=/dev/kfd --device=/dev/dri/ --group-add video --name internlm-math internlm-math-df bash docker run -it --shm-size=1024G -v /parastor/home/internlm-math-pytorch:/home/internlm-math-pytorch -v /opt/hyhal:/opt/hyhal --privileged=true --device=/dev/kfd --device=/dev/dri/ --group-add video --name internlm-math internlm-math-df bash
pip install -r requirement.txt -i https://mirrors.aliyun.com/pypi/simple/ --trusted-host mirrors.aliyun.com pip install -r requirements.txt -i https://mirrors.aliyun.com/pypi/simple/ --trusted-host mirrors.aliyun.com
# deepspeed、bitsandbytes可从whl.zip文件里获取安装: # bitsandbytes可从whl.zip文件里获取安装:
pip install deepspeed-0.12.3+das1.0+gita724046.abi0.dtk2404.torch2.1.0-cp310-cp310-manylinux2014_x86_64.whl
pip install bitsandbytes-0.42.0-py3-none-any.whl pip install bitsandbytes-0.42.0-py3-none-any.whl
``` ```
...@@ -80,7 +78,7 @@ pip install -r requirements.txt ...@@ -80,7 +78,7 @@ pip install -r requirements.txt
## 训练 ## 训练
一般情况下,ModelZoo 上的项目提供单机训练的启动方法即可,单机单卡、单机多卡至少提供其一训练方法。 根据实际路径修改模型路径和数据集路径
### 单机单卡 ### 单机单卡
...@@ -98,7 +96,7 @@ sh multi_node.sh ...@@ -98,7 +96,7 @@ sh multi_node.sh
## 推理 ## 推理
``` ```
cd examples/inference cd inference
sh single_node.sh sh single_node.sh
``` ```
...@@ -107,7 +105,7 @@ sh single_node.sh ...@@ -107,7 +105,7 @@ sh single_node.sh
使用的加速卡:2张 DCU-K100-64G 使用的加速卡:2张 DCU-K100-64G
<div align=center> <div align=center>
<img src="./doc/training_loss.png"/> <img src="./doc/inf_result.png"/>
</div> </div>
### 精度 ### 精度
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment