update README

ee178109 · chenxj · c2d7f80d · ee178109 · ee178109 · ee178109
Commit ee178109 authored Sep 20, 2023 by chenxj
Hide whitespace changes
Inline Side-by-side

Showing with 64 additions and 13 deletions

README.md README.md +61 -12

inference_results/08.jpg inference_results/08.jpg +0 -0

model.properties model.properties +3 -1

No files found.
--- a/README.md
+++ b/README.md
 # PaddleOCR
-## 模型介绍
+## 论文
-基于百度PaddleOCR的字符识别模型。
+det、rec、cls三个模型的backbone基于mobilenetv3，可参考mobilenetv3的相关论文
+https://arxiv.org/pdf/1905.02244.pdf
 ## 模型结构
-整体包含检测和识别两个模型。
+det:
+Backbone:
+  name: MobileNetV3
+  scale: 0.5
+  model_name: large
+Neck:
+  name: DBFPN
+  out_channels: 256
+Head:
+  name: DBHead
+rec:
+Backbone:
+  name: MobileNetV1Enhance
+  scale: 0.5
+  last_conv_stride: [1, 2]
+  last_pool_type: avg
+Head:
+  name: MultiHead
+  head_list:
+    - CTCHead:
+        Neck:
+          name: svtr
+          dims: 64
+          depth: 2
+          hidden_dims: 120
+          use_guide: True
+        Head:
+          fc_decay: 0.00001
+    - SARHead:
+        enc_dim: 512
+        max_text_length: *max_text_length
+cls:
+Backbone:
+  name: MobileNetV3
+  scale: 0.35
+  model_name: small
+Head:
+  name: ClsHead
+  class_dim: 2
+## 算法原理
+det->cls->rec->text
 ## 数据集
 推荐使用icdar2015数据集[icdar2015](https://rrc.cvc.uab.es/?ch=4&com=downloads)。
@@ -32,14 +76,12 @@
            |- word_003.jpg
            | ...
 ```
-## 训练及推理
+## 环境配置
-### 环境配置
 在[光源](https://www.sourcefind.cn/#/service-details)可拉取训练以及推理的docker镜像，在[光合开发者社区](https://cancon.hpccube.com:65024/4/main/)可下载paddle安装包。PaddleOCR推荐的镜像如下：
 ```
 docker pull image.sourcefind.cn:5000/dcu/admin/base/paddlepaddle:2.3.2-centos7.6-dtk-22.10.1-py37-latest
 ```
+## 训练
-### 训练
 检测模型
 ```
@@ -49,7 +91,7 @@ python3 -m paddle.distributed.launch --gpus '0,1,2,3'  tools/train.py -c configs
 ```
 python3 -m paddle.distributed.launch --gpus '0,1,2,3'  tools/train.py -c configs/rec/PP-OCRv3/en_PP-OCRv3_rec.yml -o Global.pretrained_model=./pretrain_models/en_PP-OCRv3_rec_train/best_accuracy
 ```
-### 测试
+## 测试
 检测模型
 ```
 python3 -m paddle.distributed.launch --gpus '0,1,2,3'  tools/eval.py -c configs/det/det_mv3_db.yml -o Global.pretrained_model=./output/db_mv3/best_accuracy.pdparams
@@ -58,7 +100,7 @@ python3 -m paddle.distributed.launch --gpus '0,1,2,3'  tools/eval.py -c configs/
 ```
 python3 -m paddle.distributed.launch --gpus '0,1,2,3'  tools/eval.py -c configs/rec/PP-OCRv3/en_PP-OCRv3_rec.yml -o Global.pretrained_model=./output/v3_en_mobile/best_accuracy.pdparams
 ```
-### 测试(ort)
+## 测试(ort)
 检测模型
 ```
 python3 -m paddle.distributed.launch --gpus '0,1,2,3'  tools/eval.py -c configs/det/det_mv3_db.yml -o Global.pretrained_model=./ch_PP-OCRv3_det_infer/ch_PP-OCRv3_det.onnx --use_onnx=true
@@ -67,15 +109,17 @@ python3 -m paddle.distributed.launch --gpus '0,1,2,3'  tools/eval.py -c configs/
 ```
 python3 -m paddle.distributed.launch --gpus '0,1,2,3'  tools/eval.py -c configs/rec/PP-OCRv3/ch_PP-OCRv3_rec.yml -o Global.pretrained_model=./ch_PP-OCRv3_rec_infer/ch_PP-OCRv3_rec.onnx --use_onnx=true
 ```
-### 推理
+## 推理
 ```
 python3 tools/infer/predict_system.py --image_dir="./doc/imgs/" --det_model_dir="./ch_PP-OCRv3_det_infer/" --rec_model_dir="./ch_PP-OCRv3_rec_infer/" --use_angle_cls=false --rec_image_shape=3,48,320 --warmup=1
 ```
-### 推理(ort)
+## 推理(ort)
 ```
 python3 tools/infer/predict_system.py --image_dir="./doc/imgs/" --det_model_dir="./ch_PP-OCRv3_det_infer/ch_PP-OCRv3_det.onnx" --cls_model_dir="./ch_ppocr_mobile_v2.0_cls_infer/ch_ppocr_mobile_v2.0_cls_infer.onnx" --rec_model_dir="./ch_PP-OCRv3_rec_infer/ch_PP-OCRv3_rec.onnx" --use_onnx=true --use_angle_cls=true --rec_image_shape=3,48,320 --warmup=1
 ```
-## 性能和准确率数据
+## result
+![image](https://developer.hpccube.com/codes/modelzoo/paddleocr/-/raw/main/inference_results/08.jpg)
+### 性能和准确率数据
 检测模型测试
 | Model | Precision | Recall |
@@ -96,6 +140,11 @@ python3 tools/infer/predict_system.py --image_dir="./doc/imgs/" --det_model_dir=
 | Model | Acc | 
 | :------: | :------: |
 | rec | 0.6076 | 
+## 应用场景
+### 算法类别
+ocr
+### 热点应用行业
+工业制造、金融、交通、教育、医疗
 ## 源码仓库及问题反馈
 https://developer.hpccube.com/codes/modelzoo/paddleocr
 ## 参考

--- a/inference_results/08.jpg
+++ b/inference_results/08.jpg
--- a/model.properties
+++ b/model.properties
+# 模型唯一标识
+modelCode = 205
 # 模型名称
 modelName=PaddleOCR
 # 模型描述
 modelDescription=PaddleOCR是一个实现字符检测和识别的模型
 # 应用场景
-appScenario=推理,训练,OCR
+appScenario=推理,训练,OCR,工业制造,金融,交通,教育,医疗
 # 框架类型
 frameType=paddle