README.md 3.69 KB
Newer Older
liucong's avatar
liucong committed
1
# GPT2
2

liucong's avatar
liucong committed
3
4
5
6
## 论文
Language Models are Unsupervised Multitask Learners

- https://d4mucfpksywv.cloudfront.net/better-language-models/language-models.pdf
7
8

## 模型结构
liucong's avatar
liucong committed
9
第二代生成式预训练模型(Generative Pre-Training2),GPT2主要使用Transformer的Decoder模块为特征提取器,并对Transformer Decoder进行了一些改动,原本的Decoder包含了两个Multi-Head Attention结构,而GPT2只保留了Mask Multi-Head Attention。
10

liucong's avatar
liucong committed
11
<img src="./Doc/Images/GPT_03.png" style="zoom:55%;" align=middle>
liucong's avatar
liucong committed
12

liucong's avatar
liucong committed
13
## 算法原理
liucong's avatar
liucong committed
14

liucong's avatar
liucong committed
15
GPT-2中使用了掩模自注意力(masked self-attention),通过屏蔽当前位置的右边token,使模型可以更好的预测下一个token。
16

liucong's avatar
liucong committed
17
<img src="./Doc/Images/GPT_04.png" style="zoom:70%;" align=middle>
18

liucong's avatar
liucong committed
19
20
## 环境配置

liucong's avatar
liucong committed
21
### Docker(方法一)
liucong's avatar
liucong committed
22
23
24
25

拉取镜像:

```
liucong's avatar
liucong committed
26
docker pull image.sourcefind.cn:5000/dcu/admin/base/migraphx:4.3.0-ubuntu20.04-dtk24.04.1-py3.10
27
28
```

liucong's avatar
liucong committed
29
创建并启动容器,安装相关依赖:
liucong's avatar
liucong committed
30
31

```
liucong's avatar
liucong committed
32
docker run --shm-size 16g --network=host --name=gpt2_migraphx --privileged --device=/dev/kfd --device=/dev/dri --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -v $PWD/gpt2_migraphx:/home/gpt2_migraphx -v /opt/hyhal:/opt/hyhal:ro -it <Your Image ID> /bin/bash
liucong's avatar
liucong committed
33
34
35
36
37
38
39
40
41
42
43

# 激活dtk
source /opt/dtk/env.sh
```

### Dockerfile(方法二)

```
cd ./docker
docker build --no-cache -t gpt2_migraphx:2.0 .

liucong's avatar
liucong committed
44
docker run --shm-size 16g --network=host --name=gpt2_migraphx --privileged --device=/dev/kfd --device=/dev/dri --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -v $PWD/gpt2_migraphx:/home/gpt2_migraphx -v /opt/hyhal:/opt/hyhal:ro -it <Your Image ID> /bin/bash
liucong's avatar
liucong committed
45

liucong's avatar
liucong committed
46
47
# 激活dtk
source /opt/dtk/env.sh
liucong's avatar
liucong committed
48
49
```

liucong's avatar
liucong committed
50
51
52
53
## 数据集

采用交互式界面,通过输入开头诗词,GPT2模型可以推理出后续的诗句。

liucong's avatar
liucong committed
54
55
56
57
## 推理

### Python版本推理

chenzk's avatar
chenzk committed
58
本次采用GPT-2模型进行诗词生成任务,模型文件下载链接:https://pan.baidu.com/s/1KWeoUuakCZ5dualK69qCcw , 提取码:4pmh ,并将GPT2_shici.onnx模型文件保存在Resource/文件夹下。下面介绍如何运行python代码示例,Python示例的详细说明见Doc目录下的Tutorial_Python.md。
liucong's avatar
liucong committed
59
60

#### 设置环境变量
liucong's avatar
liucong committed
61

liucong's avatar
liucong committed
62
63
64
```
export PYTHONPATH=/opt/dtk/lib:$PYTHONPATH
```
liucong's avatar
liucong committed
65

liucong's avatar
liucong committed
66
#### 运行示例
liucong's avatar
liucong committed
67

liucong's avatar
liucong committed
68
```
liucong's avatar
liucong committed
69
70
# 进入gpt2 migraphx工程根目录
cd <path_to_gpt2_migraphx> 
liucong's avatar
liucong committed
71
72

# 进入示例程序目录
liucong's avatar
liucong committed
73
cd Python/
liucong's avatar
liucong committed
74
75

# 安装依赖
liucong's avatar
liucong committed
76
pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple
liucong's avatar
liucong committed
77

liucong's avatar
liucong committed
78
# 运行示例
liucong's avatar
liucong committed
79
python gpt2.py
80
81
```

liucong's avatar
liucong committed
82
### C++版本推理
liucong's avatar
liucong committed
83

chenzk's avatar
chenzk committed
84
本次采用GPT-2模型进行诗词生成任务,模型文件下载链接:https://pan.baidu.com/s/1KWeoUuakCZ5dualK69qCcw , 提取码:4pmh ,并将GPT2_shici.onnx模型文件保存在Resource/文件夹下。下面介绍如何运行C++代码示例,C++示例的详细说明见Doc目录下的Tutorial_Cpp.md。
85

liucong's avatar
liucong committed
86
#### 构建工程
87
88
89
90
91

```
rbuild build -d depend
```

liucong's avatar
liucong committed
92
#### 设置环境变量
93
94
95
96

将依赖库依赖加入环境变量LD_LIBRARY_PATH,在~/.bashrc中添加如下语句:

```
liucong's avatar
liucong committed
97
export LD_LIBRARY_PATH=<path_to_gpt2_migraphx>/depend/lib/:$LD_LIBRARY_PATH
98
99
100
101
102
103
```

然后执行:

```
source ~/.bashrc
104
105
```

liucong's avatar
liucong committed
106
#### 运行示例
107
108

```python
liucong's avatar
liucong committed
109
110
# 进入gpt2 migraphx工程根目录
cd <path_to_gpt2_migraphx> 
111

liucong's avatar
liucong committed
112
# 进入build目录
liucong's avatar
liucong committed
113
cd build/
114

liucong's avatar
liucong committed
115
116
# 执行示例程序
./GPT2
117
118
```

liucong's avatar
liucong committed
119
120
121
122
## result

### python版本

liucong's avatar
liucong committed
123
<img src="./Doc/Images/result_Python.png" style="zoom:70%;" align=middle>
liucong's avatar
liucong committed
124
125

### C++版本
126

liucong's avatar
liucong committed
127
<img src="./Doc/Images/result_C++.png" style="zoom:70%;" align=middle>
128

liucong's avatar
liucong committed
129
130
131
132
## 应用场景

### 算法类别

liucong's avatar
liucong committed
133
`对话问答`
liucong's avatar
liucong committed
134
135
136

### 热点应用行业

liucong's avatar
liucong committed
137
`政府`,`零售`,`教育`,`科研`
liucong's avatar
liucong committed
138

liucong's avatar
liucong committed
139
## 源码仓库及问题反馈
140

chenzk's avatar
chenzk committed
141
https://developer.sourcefind.cn/codes/modelzoo/gpt2_migraphx
142

liucong's avatar
liucong committed
143
## 参考资料
144

chenzk's avatar
chenzk committed
145
https://github.com/Morizeyao/GPT2-Chinese