Commit 605b67a3 authored by dcuai's avatar dcuai
Browse files

Update README.md

parent 5b036548
...@@ -76,8 +76,8 @@ squad-v1.1 eval脚本:[evaluate-v1.1.py](https://github.com/allenai/bi-att-flo ...@@ -76,8 +76,8 @@ squad-v1.1 eval脚本:[evaluate-v1.1.py](https://github.com/allenai/bi-att-flo
### Docker(方式一) ### Docker(方式一)
``` ```
docker pull image.sourcefind.cn:5000/dcu/admin/base/tensorflow:2.7.0-centos7.6-dtk-22.10.1-py37-latest docker pull image.sourcefind.cn:5000/dcu/admin/base/tensorflow:2.13.1-ubuntu20.04-dtk24.04.1-py3.10
docker run -dit --network=host --name=bert_tensorflow --privileged --device=/dev/kfd --device=/dev/dri --ipc=host --shm-size=16G --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -u root --ulimit stack=-1:-1 --ulimit memlock=-1:-1 image.sourcefind.cn:5000/dcu/admin/base/tensorflow:2.7.0-centos7.6-dtk-22.10.1-py37-latest docker run -dit --network=host --name=bert_tensorflow --privileged --device=/dev/kfd --device=/dev/dri --ipc=host --shm-size=16G --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -u root --ulimit stack=-1:-1 --ulimit memlock=-1:-1 image.sourcefind.cn:5000/dcu/admin/base/tensorflow:2.13.1-ubuntu20.04-dtk24.04.1-py3.10
docker exec -it bert_tensorflow /bin/bash docker exec -it bert_tensorflow /bin/bash
pip install -r requirements.txt pip install -r requirements.txt
``` ```
...@@ -94,15 +94,15 @@ pip install -r requirements.txt ...@@ -94,15 +94,15 @@ pip install -r requirements.txt
### Conda(方式三) ### Conda(方式三)
``` ```
conda create -n bert_tensorflow python=3.7 conda create -n bert_tensorflow python=3.10
pip install -r requirements.txt pip install -r requirements.txt
``` ```
安装过程可能顶掉DCU版本的tensorflow,可以到[开发者社区](https://cancon.hpccube.com:65024/4/main/tensorflow/dtk22.10)下载DCU版本对应包 安装过程可能顶掉DCU版本的tensorflow,可以到[开发者社区](https://developer.hpccube.com/tool/)下载DCU版本对应包
[tensorflow2.7](https://cancon.hpccube.com:65024/directlink/4/tensorflow/dtk22.10/tensorflow-2.7.0+git67f0ade9.dtk2210-cp37-cp37m-manylinux2014_x86_64.whl) [tensorflow2.13.1](https://cancon.hpccube.com:65024/directlink/4/tensorflow/DAS1.1/tensorflow-2.13.1+das1.1.git56b06c8.abi1.dtk2404-cp311-cp311-manylinux_2_31_x86_64.whl)
[DTK22.10.1](https://cancon.hpccube.com:65024/directlink/1/DTK-22.10.1/CentOS7.6/DTK-22.10.1-CentOS7.6-x86_64.tar.gz) [DTK24.04.1](https://cancon.hpccube.com:65024/directlink/1/DTK-24.04.1/Ubuntu20.04.1/DTK-24.04.1-Ubuntu20.04.1-x86_64.tar.gz)
### python版本兼容 ### python版本兼容
...@@ -161,6 +161,8 @@ python3 tf2_encoder_checkpoint_converter.py \ ...@@ -161,6 +161,8 @@ python3 tf2_encoder_checkpoint_converter.py \
将转换完后的bert_model.ckpt-1.data-00000-of-00001 改为bert_model.ckpt.data-00000-of-00001 将转换完后的bert_model.ckpt-1.data-00000-of-00001 改为bert_model.ckpt.data-00000-of-00001
bert_model.ckpt-1.index改为 bert_model.ckpt.index bert_model.ckpt-1.index改为 bert_model.ckpt.index
如果报错 'no attriute experimental',则删除报错行中的experimental
``` ```
### 单卡运行-MNLI ### 单卡运行-MNLI
...@@ -182,6 +184,13 @@ sh bert_class.sh ...@@ -182,6 +184,13 @@ sh bert_class.sh
--model_dir 模型保存文件夹 --model_dir 模型保存文件夹
--distribution_strategy 分布式策略 --distribution_strategy 分布式策略
--num_gpus 使用gpu数量 --num_gpus 使用gpu数量
修改库版本:
absl==1.4.0
tf-models-official==2.6.0
``` ```
### 多卡运行-MNLI ### 多卡运行-MNLI
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment