Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
BERT_tensorflow
Commits
605b67a3
Commit
605b67a3
authored
Sep 07, 2024
by
dcuai
Browse files
Update README.md
parent
5b036548
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
15 additions
and
6 deletions
+15
-6
README.md
README.md
+15
-6
No files found.
README.md
View file @
605b67a3
...
...
@@ -76,8 +76,8 @@ squad-v1.1 eval脚本:[evaluate-v1.1.py](https://github.com/allenai/bi-att-flo
### Docker(方式一)
```
docker pull image.sourcefind.cn:5000/dcu/admin/base/tensorflow:2.
7.0-centos7.6-dtk-22.10.1-py37-latest
docker run -dit --network=host --name=bert_tensorflow --privileged --device=/dev/kfd --device=/dev/dri --ipc=host --shm-size=16G --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -u root --ulimit stack=-1:-1 --ulimit memlock=-1:-1 image.sourcefind.cn:5000/dcu/admin/base/tensorflow:2.
7.0-centos7.6-dtk-22.10.1-py37-latest
docker pull image.sourcefind.cn:5000/dcu/admin/base/tensorflow:2.
13.1-ubuntu20.04-dtk24.04.1-py3.10
docker run -dit --network=host --name=bert_tensorflow --privileged --device=/dev/kfd --device=/dev/dri --ipc=host --shm-size=16G --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -u root --ulimit stack=-1:-1 --ulimit memlock=-1:-1 image.sourcefind.cn:5000/dcu/admin/base/tensorflow:2.
13.1-ubuntu20.04-dtk24.04.1-py3.10
docker exec -it bert_tensorflow /bin/bash
pip install -r requirements.txt
```
...
...
@@ -94,15 +94,15 @@ pip install -r requirements.txt
### Conda(方式三)
```
conda create -n bert_tensorflow python=3.
7
conda create -n bert_tensorflow python=3.
10
pip install -r requirements.txt
```
安装过程可能顶掉DCU版本的tensorflow,可以到
[
开发者社区
](
https://
cancon
.hpccube.com
:65024/4/main/tensorflow/dtk22.10
)
下载DCU版本对应包
安装过程可能顶掉DCU版本的tensorflow,可以到
[
开发者社区
](
https://
developer
.hpccube.com
/tool/
)
下载DCU版本对应包
[
tensorflow2.
7
](
https://cancon.hpccube.com:65024/directlink/4/tensorflow/
dtk22
.1
0
/tensorflow-2.
7.0+git67f0ade9
.dtk2
210
-cp3
7
-cp3
7m
-manylinux
2014
_x86_64.whl
)
[
tensorflow2.
13.1
](
https://cancon.hpccube.com:65024/directlink/4/tensorflow/
DAS1
.1/tensorflow-2.
13.1+das1.1.git56b06c8.abi1
.dtk2
404
-cp3
11
-cp3
11
-manylinux
_2_31
_x86_64.whl
)
[
DTK2
2.10
.1
](
https://cancon.hpccube.com:65024/directlink/1/DTK-2
2.10.1/CentOS7.6/DTK-22.10.1-CentOS7.6
-x86_64.tar.gz
)
[
DTK2
4.04
.1
](
https://cancon.hpccube.com:65024/directlink/1/DTK-2
4.04.1/Ubuntu20.04.1/DTK-24.04.1-Ubuntu20.04.1
-x86_64.tar.gz
)
### python版本兼容
...
...
@@ -161,6 +161,8 @@ python3 tf2_encoder_checkpoint_converter.py \
将转换完后的bert_model.ckpt-1.data-00000-of-00001 改为bert_model.ckpt.data-00000-of-00001
bert_model.ckpt-1.index改为 bert_model.ckpt.index
如果报错 'no attriute experimental',则删除报错行中的experimental
```
### 单卡运行-MNLI
...
...
@@ -182,6 +184,13 @@ sh bert_class.sh
--model_dir 模型保存文件夹
--distribution_strategy 分布式策略
--num_gpus 使用gpu数量
修改库版本:
absl==1.4.0
tf-models-official==2.6.0
```
### 多卡运行-MNLI
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment