README.md 2.14 KB
Newer Older
chenxj's avatar
chenxj committed
1
# bert_large_squad_onnxruntime
chenxj's avatar
chenxj committed
2
## 论文
chenxj's avatar
chenxj committed
3
[BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding](https://arxiv.org/pdf/1810.04805.pdf)
sugon_cxj's avatar
sugon_cxj committed
4
## 模型结构
chenxj's avatar
chenxj committed
5
6
7
8
9
10
11
12
13
14
bert_large_squad核心是transformer,transformer结构如下:

![image](https://developer.hpccube.com/codes/modelzoo/bert_large_squad_onnx/-/raw/main/resources/transformer.png)
## 算法原理
bert_large_squad模型的主要参数为:24个transformer层、1024个hidden size、16个self-attention heads,简要原理可用下图表示:

![image](https://developer.hpccube.com/codes/modelzoo/bert_large_squad_onnx/-/raw/main/resources/squad.png)
## 数据集
暂无合适中文数据集
## 环境配置
sugon_cxj's avatar
sugon_cxj committed
15
16
17
[光源](https://www.sourcefind.cn/#/service-details)可拉取推理的docker镜像,在[光合开发者社区](https://cancon.hpccube.com:65024/4/main/)可下载onnxruntime安装包。bert_large_squad_onnx推荐的镜像如下:
```
docker pull image.sourcefind.cn:5000/dcu/admin/base/custom:ort1.14.0_migraphx3.0.0-dtk22.10.1
chenxj's avatar
chenxj committed
18
docker run -d -t --privileged --device=/dev/kfd --device=/dev/dri/ --network=host --group-add video --name bert-test image.sourcefind.cn:5000/dcu/admin/base/custom:ort1.14.0_migraphx3.0.0-dtk22.10.1
chenxj's avatar
chenxj committed
19
docker exec -it bert-test bash
sugon_cxj's avatar
sugon_cxj committed
20
21
```
[huggingface](https://huggingface.co/ctuning/mlperf-inference-bert-onnx-fp32-squad-v1.1)下载模型model.onnx到当前目录
sugon_cxj's avatar
sugon_cxj committed
22

chenzk's avatar
chenzk committed
23
24
25
预训练权重快速下载中心:[SCNet AIModels](http://113.200.138.88:18080/aimodels) ,项目中的预训练权重可从快速下载通道下载:
[mlperf-inference-bert-onnx-fp32-squad-v1.1](http://113.200.138.88:18080/aimodels/mlperf-inference-bert-onnx-fp32-squad-v1.1)

sugon_cxj's avatar
sugon_cxj committed
26
27
执行fp16转换
```
chenxj's avatar
chenxj committed
28
pip3 install -r requirements.txt
sugon_cxj's avatar
sugon_cxj committed
29
30
python3 fp16-convert.py
```
chenxj's avatar
chenxj committed
31
## 推理
sugon_cxj's avatar
sugon_cxj committed
32
33
34
```
python3 main.py
```
chenxj's avatar
chenxj committed
35
36
## result
![image](https://developer.hpccube.com/codes/modelzoo/bert_large_squad_onnx/-/raw/main/resources/bert_result.png)
chenxj's avatar
chenxj committed
37
### 精度
chenxj's avatar
chenxj committed
38
暂无
chenxj's avatar
chenxj committed
39
40
## 应用场景
### 算法类别
chenxj's avatar
chenxj committed
41
对话问答
chenxj's avatar
chenxj committed
42
### 热点应用行业
chenxj's avatar
chenxj committed
43
医疗,科研,金融,教育
sugon_cxj's avatar
sugon_cxj committed
44
## 源码仓库及问题反馈
chenxj's avatar
chenxj committed
45
https://developer.hpccube.com/codes/modelzoo/bert_large_squad_onnxruntime
chenxj's avatar
chenxj committed
46
## 参考资料
sugon_cxj's avatar
sugon_cxj committed
47
https://github.com/google-research/bert
sugon_cxj's avatar
sugon_cxj committed
48