Commit d46fbe3c authored by dcuai's avatar dcuai
Browse files

Update README.md

parent 6f808229
Pipeline #1911 canceled with stages
...@@ -80,6 +80,23 @@ pip install -r requirements.txt -i http://mirrors.aliyun.com/pypi/simple ...@@ -80,6 +80,23 @@ pip install -r requirements.txt -i http://mirrors.aliyun.com/pypi/simple
``` ```
{"image": ["./imgs/DUE_Benchmark/DocVQA/pngs/xnbl0037_1.png"], "messages": [{"role": "user", "content": "<|image|>what is the date mentioned in this letter?"}, {"role": "assistant", "content": "1/8/93"}], "task_name": "qa_sft", "dataset_name": "DocVQA"} {"image": ["./imgs/DUE_Benchmark/DocVQA/pngs/xnbl0037_1.png"], "messages": [{"role": "user", "content": "<|image|>what is the date mentioned in this letter?"}, {"role": "assistant", "content": "1/8/93"}], "task_name": "qa_sft", "dataset_name": "DocVQA"}
``` ```
数据集目录结构如下:
```
── DocLocal4K
│   ├── imgs
│ │    ├── ChartQA
│ │ │
│ │   ├── DUE_Benchmark
│ │ │
│ │   ├── TextVQA
│ │ │
│ │ └── VisualMRC
│   ├── dataset_infos.json
│ ├── mini_imges.jsonl
│   ├── text_grounding.jsonl
│   ├── text_recognition.jsonl
│ └── README.md
```
## 训练 ## 训练
...@@ -87,7 +104,7 @@ pip install -r requirements.txt -i http://mirrors.aliyun.com/pypi/simple ...@@ -87,7 +104,7 @@ pip install -r requirements.txt -i http://mirrors.aliyun.com/pypi/simple
项目中的测试数据集可从快速下载通道下载 [DocLocal4K](http://113.200.138.88:18080/aidatasets/doclocal4k) 项目中的测试数据集可从快速下载通道下载 [DocLocal4K](http://113.200.138.88:18080/aidatasets/doclocal4k)
根据实际情况在脚本中修相关路径 根据实际情况在脚本中修相关路径,快速测试data_path可使用./DocLocal4K/mini_imges.jsonl,整体测试可选./DocLocal4K/text_recognition.jsonl、./DocLocal4K/text_grounding.jsonl
--deepspeed --deepspeed
...@@ -126,10 +143,16 @@ python docowl_infer.py ...@@ -126,10 +143,16 @@ python docowl_infer.py
### 指令问答 ### 指令问答
<div align=center> ```
<img src="./assets/result.jpg"/> image: './image/pic.jpg'
</div> query: '详细描述这张图片'
answer: 'christmas decorations in a store window <ocr> 1 </ocr>'
```
```
image: './image/car.jpg'
query: '详细描述这张图片'
answer: 'A blue license plate with the characters <ocr> A KC087 </ocr>'
```
### 精度 ### 精度
测试数据: [mini DocLocal4K](./DocLocal4K/mini_imges.jsonl),使用的加速卡:K100/A800。 测试数据: [mini DocLocal4K](./DocLocal4K/mini_imges.jsonl),使用的加速卡:K100/A800。
...@@ -142,7 +165,7 @@ python docowl_infer.py ...@@ -142,7 +165,7 @@ python docowl_infer.py
## 应用场景 ## 应用场景
### 算法类别 ### 算法类别
`OCR, 对话问答` `OCR`
### 热点应用行业 ### 热点应用行业
`金融,教育,政府,交通` `金融,教育,政府,交通`
...@@ -156,7 +179,7 @@ python docowl_infer.py ...@@ -156,7 +179,7 @@ python docowl_infer.py
项目中的预训练权重可从快速下载通道下载: [DocOwl1.5-Omni](http://113.200.138.88:18080/aimodels/mplug-doclcal_1.5) 项目中的预训练权重可从快速下载通道下载: [DocOwl1.5-Omni](http://113.200.138.88:18080/aimodels/mplug-doclcal_1.5)
## 源码仓库及问题反馈 ## 源码仓库及问题反馈
- http://developer.hpccube.com/codes/modelzoo/umt5.git - http://developer.sourcefind.cn/codes/modelzoo/mplug-docowl_pytorch.git
## 参考资料 ## 参考资料
- [iic/DocOwl1.5-Omni 魔搭](https://www.modelscope.cn/models/iic/DocOwl1.5-Omni/) - [iic/DocOwl1.5-Omni 魔搭](https://www.modelscope.cn/models/iic/DocOwl1.5-Omni/)
......
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment