README.md 3.88 KB
Newer Older
dcuai's avatar
dcuai committed
1
2
3
# XrayGLM
## 论文
- 此处填写算法论文的在线pdf地址
wangsen's avatar
wangsen committed
4

dcuai's avatar
dcuai committed
5
6
## 模型结构
通用领域的大语言模型 (LLM),例如 ChatGPT,在遵循指令和产生类似人类响应方面取得了显著的成功,这种成功间接促进了多模态大模型的研究和发展,如通用领域的多模态大模型MiniGPT-4、mPLUG-Owl、Multimodal-GPT和LLaVA ,然而,此类多模态大模型却很少出现在医学领域的研究中,阻碍了相关研究发展。visual-med-alpaca虽然在医学多模态大模型方面做出了一些很有成效的工作,然而其数据为英文诊断报告,不利于促进中文领域医学多模态大模型的研究发展。为此,我们开发了XrayGLM以解决上述问题。XrayGLM在医学影像诊断和多轮交互对话上显示出了非凡的潜力。借助ChatGPT以及公开的数据集,构造了一个X光影像-诊断报告对的医学多模态数据集;把数据集分为TP和FN。
wangsen's avatar
wangsen committed
7
8
9

![Alt text](./images/image2.png)

dcuai's avatar
dcuai committed
10
## 算法原理
wangsen's avatar
wangsen committed
11
12
13
14
诊断数据集在VisualGLM-6B进行微调训练, 最后输出对胸片的询问结果

![Alt text](./images/image1.png)

dcuai's avatar
dcuai committed
15
16
## 环境配置
### Docker(方式一)
wangsen's avatar
wangsen committed
17
18
19
20
21
22
23
24
25
26
27
28
推荐使用docker方式运行,提供拉取的docker镜像:

```
docker pull image.sourcefind.cn:5000/dcu/admin/base/pytorch:2.1.0-ubuntu20.04-dtk24.04.1-py3.10
docker run -dit --shm-size 80g --network=host --name=xrayglm --privileged --device=/dev/kfd --device=/dev/dri --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -u root -v /opt/hyhal/:/opt/hyhal/:ro image.sourcefind.cn:5000/dcu/admin/base/pytorch:2.1.0-ubuntu20.04-dtk24.04.1-py3.10 /bin/bash
docker exec -it xrayglm  /bin/bash
```

安装docker中没有的依赖:

```
pip install -r requirements.txt  -i http://mirrors.aliyun.com/pypi/simple/ --trusted-host mirrors.aliyun.com
MPU王荣胜's avatar
MPU王荣胜 committed
29
30
```

dcuai's avatar
dcuai committed
31
### Dockerfile(方式二)
MPU王荣胜's avatar
MPU王荣胜 committed
32

wangsen's avatar
wangsen committed
33
34
```
docker build -t xrayglm:latest .
wangsen's avatar
wangsen committed
35
docker run -dit --shm-size 80g --network=host --name=xrayglm --privileged --device=/dev/kfd --device=/dev/dri --group-add video --cap-add=SYS_PTRACE --security-opt seccomp=unconfined -u root -v /opt/hyhal/:/opt/hyhal/:ro   xrayglm:latest /bin/bash
wangsen's avatar
wangsen committed
36
docker exec -it xrayglm   /bin/bash
MPU王荣胜's avatar
MPU王荣胜 committed
37
38
```

MPU王荣胜's avatar
MPU王荣胜 committed
39

dcuai's avatar
dcuai committed
40
### Conda(方式三)
MPU王荣胜's avatar
MPU王荣胜 committed
41

wangsen's avatar
wangsen committed
42
43
44
45
46
1.创建conda虚拟环境:

```
conda create -n xrayglm python=3.10
conda activate xrayglm 
MPU王荣胜's avatar
MPU王荣胜 committed
47
48
```

wangsen's avatar
wangsen committed
49
50
51
52
2.关于本项目DCU显卡所需的工具包、深度学习库等均可从光合开发者社区下载安装。
- [DTK 24.04.1](https://cancon.hpccube.com:65024/directlink/1/DTK-24.04.1/Ubuntu20.04.1/DTK-24.04.1-Ubuntu20.04.1-x86_64.tar.gz)
- [Pytorch 2.1](https://cancon.hpccube.com:65024/directlink/4/pytorch/DAS1.2/torch-2.1.0+das.opt1.dtk24042-cp310-cp310-manylinux_2_28_x86_64.whl)
- [DeepSpeed 0.12.3](https://download.sourcefind.cn:65024/directlink/4/deepspeed/DAS1.1/deepspeed-0.12.3+gita724046.abi1.dtk2404.torch2.1.0-cp310-cp310-manylinux_2_31_x86_64.whl)
MPU王荣胜's avatar
MPU王荣胜 committed
53
54


wangsen's avatar
wangsen committed
55
Tips:以上dtk驱动、torch等工具版本需要严格一一对应。
MPU王荣胜's avatar
MPU王荣胜 committed
56
57


wangsen's avatar
wangsen committed
58
3. 其它依赖库参照requirements.txt安装:
MPU王荣胜's avatar
MPU王荣胜 committed
59
60

```
wangsen's avatar
wangsen committed
61
62
63
python setup.py install
pip install -r requirements.txt -i http://mirrors.aliyun.com/pypi/simple/ --trusted-host mirrors.aliyun.com
```
MPU王荣胜's avatar
MPU王荣胜 committed
64

dcuai's avatar
dcuai committed
65
## 数据集
MPU王荣胜's avatar
MPU王荣胜 committed
66

wangsen's avatar
wangsen committed
67
暂无
MPU王荣胜's avatar
MPU王荣胜 committed
68

dcuai's avatar
dcuai committed
69
## 训练
MPU王荣胜's avatar
MPU王荣胜 committed
70

wangsen's avatar
wangsen committed
71
暂无
MPU王荣胜's avatar
MPU王荣胜 committed
72

dcuai's avatar
dcuai committed
73
## 推理
MPU王荣胜's avatar
MPU王荣胜 committed
74

dcuai's avatar
dcuai committed
75
cli推理
wangsen's avatar
wangsen committed
76

wangsen's avatar
wangsen committed
77
根据需要修改预训练模型以及cli_demo.py中48行visualglm-6b模型的位置
MPU王荣胜's avatar
MPU王荣胜 committed
78

wangsen's avatar
wangsen committed
79
80
```
python cli_demo.py --from_pretrained checkpoints/checkpoints-XrayGLM-300 --prompt_zh '详细描述这张胸部X光片的诊断结果'
MPU王荣胜's avatar
MPU王荣胜 committed
81
82
```

dcuai's avatar
dcuai committed
83
84
## result

wangsen's avatar
wangsen committed
85
![Alt text](./images/image3.png)
MPU王荣胜's avatar
MPU王荣胜 committed
86

dcuai's avatar
dcuai committed
87
### 精度
MPU王荣胜's avatar
MPU王荣胜 committed
88

dcuai's avatar
dcuai committed
89

MPU王荣胜's avatar
MPU王荣胜 committed
90

dcuai's avatar
dcuai committed
91
## 应用场景
MPU王荣胜's avatar
MPU王荣胜 committed
92

dcuai's avatar
dcuai committed
93
94
95
### 算法类别
NLP
   
dcuai's avatar
dcuai committed
96
### 热点应用行业
MPU王荣胜's avatar
MPU王荣胜 committed
97

wangsen's avatar
wangsen committed
98
科研   医疗 教育
MPU王荣胜's avatar
MPU王荣胜 committed
99

dcuai's avatar
dcuai committed
100
101
102
103
104
105
106
107
## 预训练模型
需下载模型 xrayGLM-300和visualglm-6b模型

- [checkpoints-XrayGLM-300](https://huggingface.co/wangrongsheng/XrayGLM-300/tree/main)
- [visualglm-6b](http://113.200.138.88:18080/aimodels/visualglm-6b)


## 源码仓库及问题反馈
wangsen's avatar
wangsen committed
108
109


wangsen's avatar
wangsen committed
110
111
http://developer.sourcefind.cn/codes/modelzoo/xrayglm.git

wangsen's avatar
wangsen committed
112
113


dcuai's avatar
dcuai committed
114
## 参考资料
wangsen's avatar
wangsen committed
115

wangsen's avatar
wangsen committed
116

wangsen's avatar
wangsen committed
117
https://github.com/WangRongsheng/XrayGLM.git
MPU王荣胜's avatar
MPU王荣胜 committed
118

MPU王荣胜's avatar
MPU王荣胜 committed
119

MPU王荣胜's avatar
MPU王荣胜 committed
120

MPU王荣胜's avatar
MPU王荣胜 committed
121

MPU王荣胜's avatar
MPU王荣胜 committed
122