Commit 707a54ef authored by dcuai's avatar dcuai
Browse files

更换dtk24.04.1镜像

parent 3c136bf1
......@@ -24,7 +24,7 @@ mv tinyllama_pytorch TinyLlama # 去框架名后缀
### Docker(方法一)
```
docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:2.1.0-centos7.6-dtk23.10-py38
docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:2.1.0-ubuntu20.04-dtk24.04.1-py3.10
# <your IMAGE ID>为以上拉取的docker的镜像ID替换,本镜像为:ffa1f63239fc
docker run -it --shm-size=32G -v $PWD/TinyLlama:/home/TinyLlama -v /opt/hyhal:/opt/hyhal --privileged=true --device=/dev/kfd --device=/dev/dri/ --group-add video --name tinyllama <your IMAGE ID> bash
cd /home/TinyLlama
......@@ -41,12 +41,17 @@ docker run --shm-size=32G --name tinyllama -v /opt/hyhal:/opt/hyhal --privileged
1、关于本项目DCU显卡所需的特殊深度学习库可从光合开发者社区下载安装:
- https://developer.hpccube.com/tool/
```
DTK驱动:dtk23.10
python:python3.8
DTK驱动:dtk24.04.1
python:python3.10
torch:2.1.0
torchvision:0.16.0
triton:2.1.0
apex:0.1
apex:1.1.0
flash_attn:2.0.4
xformers:0.0.25
rotary-emb:0.1
dropout-layer-norm:0.1
xentropy-cuda-lib:0.1
```
`Tips:以上dtk驱动、python、torch等DCU相关工具版本需要严格一一对应。`
......@@ -65,23 +70,7 @@ rm -rf /lib64/libstdc++.so.6
ln -s /lib64/libstdc++.so.6.0.26 /lib64/libstdc++.so.6
```
上述环境配置适于finetune及其推理,若希望从头训练,参照环境[`PRETRAIN.md`](./PRETRAIN.md),还需要安装以下几个库:
```
# 以下安装包可从whl.zip文件里获取
# flash_attn-2
pip install flash_attn-2.0.4_torch2.1_dtk2310-cp38-cp38-linux_x86_64.whl
# xformers
tar -xvf xformers.tar -C .
cd xformers
pip install xformers==0.0.23 --no-deps
bash patch_xformers.rocm.sh
# rotary
pip install rotary_emb-0.1_torch2.1_dtk2310-cp38-cp38-linux_x86_64.whl
# layer_norm
pip install dropout_layer_norm-0.1_torch2.1_dtk23.10-cp38-cp38-linux_x86_64.whl
# xentropy
pip install xentropy_cuda_lib-0.1_torch2.1_dtk2310-cp38-cp38-linux_x86_64.whl
```
## 数据集
`openassistant-guanaco`
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment