update doc

8c000977 · WenmuZhou · aad3093a · 8c000977 · 8c000977 · 8c000977
Commit 8c000977 authored Oct 13, 2020 by WenmuZhou
20 changed files
--- a/doc/doc_en/visualization_en.md
+++ b/doc/doc_en/visualization_en.md
 # Visualization

- [Chinese/English OCR Visualization (Space_support )](#Space_support)
- [Ultra-lightweight Chinese/English OCR Visualization](#Ultra-lightweight)
- [General Chinese/English OCR Visualization](#General)
+<a name="ppocr_server_1.1"></a>
+## ch_ppocr_server_1.1

-<a name="Space_support"></a>
-
-## Chinese/English OCR Visualization (Space_support )
-
-###  Ultra-lightweight Model
-<div align="center">
-    <img src="../imgs_results/img_11.jpg" width="800">
-</div>
-
-### General OCR Model
 <div align="center">
-    <img src="../imgs_results/chinese_db_crnn_server/en_paper.jpg" width="800">
+    <img src="../imgs_results/1101.jpg" width="800">
+    <img src="../imgs_results/1102.jpg" width="800">
+    <img src="../imgs_results/1103.jpg" width="800">
+    <img src="../imgs_results/1104.jpg" width="800">
+    <img src="../imgs_results/1105.jpg" width="800">
+    <img src="../imgs_results/1106.jpg" width="800">
 </div>

-<a name="Ultra-lightweight"></a>
-## Ultra-lightweight Chinese/English OCR Visualization

+<a name="en_ppocr_mobile_1.1"></a>
+## en_ppocr_mobile_1.1
 <div align="center">
-    <img src="../imgs_results/1.jpg" width="800">
+    <img src="../imgs_results/img_12.jpg" width="800">
 </div>

-<div align="center">
-    <img src="../imgs_results/7.jpg" width="800">
-</div>
-
-<div align="center">
-    <img src="../imgs_results/12.jpg" width="800">
-</div>

+<a name="multilingual"></a>
+## (multilingual)_ppocr_mobile_1.1
 <div align="center">
-    <img src="../imgs_results/4.jpg" width="800">
+    <img src="../imgs_results/1110.jpg" width="800">
+    <img src="../imgs_results/1112.jpg" width="800">
 </div>

-<div align="center">
-    <img src="../imgs_results/6.jpg" width="800">
-</div>

-<div align="center">
-    <img src="../imgs_results/9.jpg" width="800">
-</div>
+<a name="ppocr_mobile_1.0"></a>
+## ppocr_mobile_1.0

 <div align="center">
+    <img src="../imgs_results/1.jpg" width="800">
+    <img src="../imgs_results/7.jpg" width="800">
+    <img src="../imgs_results/6.jpg" width="800">
    <img src="../imgs_results/16.png" width="800">
 </div>

-<div align="center">
-    <img src="../imgs_results/22.jpg" width="800">
-</div>

-<a name="General"></a>
-## General Chinese/English OCR Visualization
+<a name="ppocr_server_1.0"></a>
+## ppocr_server_1.0

 <div align="center">
    <img src="../imgs_results/chinese_db_crnn_server/11.jpg" width="800">
-</div>
-
-<div align="center">
    <img src="../imgs_results/chinese_db_crnn_server/2.jpg" width="800">
-</div>
-
-<div align="center">
    <img src="../imgs_results/chinese_db_crnn_server/8.jpg" width="800">
 </div>
-
-
--- a/doc/doc_en/whl_en.md
+++ b/doc/doc_en/whl_en.md
@@ -9,15 +9,53 @@ pip install paddleocr

 build own whl package and install
 ```bash
-python setup.py bdist_wheel
-pip install dist/paddleocr-0.0.3-py3-none-any.whl
+python3 setup.py bdist_wheel
+pip3 install dist/paddleocr-x.x.x-py3-none-any.whl # x.x.x is the version of paddleocr
 ```
 ### 1. Use by code

+* detection classification and recognition
+```python
+from paddleocr import PaddleOCR,draw_ocr
+# Paddleocr supports Chinese, English, French, German, Korean and Japanese.
+# You can set the parameter `lang` as `ch`, `en`, `french`, `german`, `korean`, `japan`
+# to switch the language model in order.
+ocr = PaddleOCR(use_angle_cls=True, lang='en') # need to run only once to download and load model into memory
+img_path = 'PaddleOCR/doc/imgs_en/img_12.jpg'
+result = ocr.ocr(img_path, cls=True)
+for line in result:
+    print(line)
+
+
+# draw result
+from PIL import Image
+image = Image.open(img_path).convert('RGB')
+boxes = [line[0] for line in result]
+txts = [line[1][0] for line in result]
+scores = [line[1][1] for line in result]
+im_show = draw_ocr(image, boxes, txts, scores, font_path='/path/to/PaddleOCR/doc/simfang.ttf')
+im_show = Image.fromarray(im_show)
+im_show.save('result.jpg')
+```
+
+Output will be a list, each item contains bounding box, text and recognition confidence
+```bash
+[[[442.0, 173.0], [1169.0, 173.0], [1169.0, 225.0], [442.0, 225.0]], ['ACKNOWLEDGEMENTS', 0.99283075]]
+[[[393.0, 340.0], [1207.0, 342.0], [1207.0, 389.0], [393.0, 387.0]], ['We would like to thank all the designers and', 0.9357758]]
+[[[399.0, 398.0], [1204.0, 398.0], [1204.0, 433.0], [399.0, 433.0]], ['contributors whohave been involved in the', 0.9592447]]
+......
+```
+
+Visualization of results
+
+<div align="center">
+    <img src="../imgs_results/whl/12_det_rec.jpg" width="800">
+</div>
+
 * detection and recognition
 ```python
 from paddleocr import PaddleOCR,draw_ocr
-ocr = PaddleOCR() # need to run only once to download and load model into memory
+ocr = PaddleOCR(lang='en') # need to run only once to download and load model into memory
 img_path = 'PaddleOCR/doc/imgs_en/img_12.jpg'
 result = ocr.ocr(img_path)
 for line in result:
@@ -48,6 +86,21 @@ Visualization of results
    <img src="../imgs_results/whl/12_det_rec.jpg" width="800">
 </div>

+* classification and recognition
+```python
+from paddleocr import PaddleOCR
+ocr = PaddleOCR(use_angle_cls=True, lang='en') # need to run only once to load model into memory
+img_path = 'PaddleOCR/doc/imgs_words_en/word_10.png'
+result = ocr.ocr(img_path, det=False, cls=True)
+for line in result:
+    print(line)
+```
+
+Output will be a list, each item contains recognition text and confidence
+```bash
+['PAIN', 0.990372]
+```
+
 * only detection
 ```python
 from paddleocr import PaddleOCR,draw_ocr
@@ -83,18 +136,33 @@ Visualization of results
 * only recognition
 ```python
 from paddleocr import PaddleOCR
-ocr = PaddleOCR() # need to run only once to load model into memory
+ocr = PaddleOCR(lang='en') # need to run only once to load model into memory
 img_path = 'PaddleOCR/doc/imgs_words_en/word_10.png'
-result = ocr.ocr(img_path,det=False)
+result = ocr.ocr(img_path, det=False, cls=False)
 for line in result:
    print(line)
 ```

-Output will be a list, each item contains text and recognition confidence
+Output will be a list, each item contains recognition text and confidence
 ```bash
 ['PAIN', 0.990372]
 ```

+* only classification
+```python
+from paddleocr import PaddleOCR
+ocr = PaddleOCR(use_angle_cls=True) # need to run only once to load model into memory
+img_path = 'PaddleOCR/doc/imgs_words_en/word_10.png'
+result = ocr.ocr(img_path, det=False, rec=False, cls=True)
+for line in result:
+    print(line)
+```
+
+Output will be a list, each item contains classification result and confidence
+```bash
+['0', 0.99999964]
+```
+
 ### Use by command line

 show help information
@@ -102,9 +170,22 @@ show help information
 paddleocr -h
 ```

+* detection classification and recognition
+```bash
+paddleocr --image_dir PaddleOCR/doc/imgs_en/img_12.jpg --use_angle_cls true -cls true --lang en
+```
+
+Output will be a list, each item contains bounding box, text and recognition confidence
+```bash
+[[[442.0, 173.0], [1169.0, 173.0], [1169.0, 225.0], [442.0, 225.0]], ['ACKNOWLEDGEMENTS', 0.99283075]]
+[[[393.0, 340.0], [1207.0, 342.0], [1207.0, 389.0], [393.0, 387.0]], ['We would like to thank all the designers and', 0.9357758]]
+[[[399.0, 398.0], [1204.0, 398.0], [1204.0, 433.0], [399.0, 433.0]], ['contributors whohave been involved in the', 0.9592447]]
+......
+```
+
 * detection and recognition
 ```bash
-paddleocr --image_dir PaddleOCR/doc/imgs_en/img_12.jpg
+paddleocr --image_dir PaddleOCR/doc/imgs_en/img_12.jpg --lang en
 ```

 Output will be a list, each item contains bounding box, text and recognition confidence
@@ -115,6 +196,16 @@ Output will be a list, each item contains bounding box, text and recognition con
 ......
 ```

+* classification and recognition
+```bash
+paddleocr --image_dir PaddleOCR/doc/imgs_words_en/word_10.png --use_angle_cls true -cls true --det false --lang en
+```
+
+Output will be a list, each item contains text and recognition confidence
+```bash
+['PAIN', 0.990372]
+```
+
 * only detection
 ```bash
 paddleocr --image_dir PaddleOCR/doc/imgs_en/img_12.jpg --rec false
@@ -130,7 +221,7 @@ Output will be a list, each item only contains bounding box

 * only recognition
 ```bash
-paddleocr --image_dir PaddleOCR/doc/imgs_words_en/word_10.png --det false
+paddleocr --image_dir PaddleOCR/doc/imgs_words_en/word_10.png --det false --cls false --lang en
 ```

 Output will be a list, each item contains text and recognition confidence
@@ -138,6 +229,16 @@ Output will be a list, each item contains text and recognition confidence
 ['PAIN', 0.990372]
 ```

+* only classification
+```bash
+paddleocr --image_dir PaddleOCR/doc/imgs_words_en/word_10.png --use_angle_cls true -cls true --det false --rec false
+```
+
+Output will be a list, each item contains classification result and confidence
+```bash
+['0', 0.99999964]
+```
+
 ## Use custom model
 When the built-in model cannot meet the needs, you need to use your own trained model.
 First, refer to the first section of [inference_en.md](./inference_en.md) to convert your det and rec model to inference model, and then use it as follows
@@ -147,9 +248,9 @@ First, refer to the first section of [inference_en.md](./inference_en.md) to con
 ```python
 from paddleocr import PaddleOCR,draw_ocr
 # The path of detection and recognition model must contain model and params files
-ocr = PaddleOCR(det_model_dir='{your_det_model_dir}',rec_model_dir='{your_rec_model_dir}å')
+ocr = PaddleOCR(det_model_dir='{your_det_model_dir}', rec_model_dir='{your_rec_model_dir}', rec_char_dict_path='{your_rec_char_dict_path}', cls_model_dir='{your_cls_model_dir}', use_angle_cls=True)
 img_path = 'PaddleOCR/doc/imgs_en/img_12.jpg'
-result = ocr.ocr(img_path)
+result = ocr.ocr(img_path, cls=True)
 for line in result:
    print(line)

@@ -167,7 +268,7 @@ im_show.save('result.jpg')
 ### Use by command line

 ```bash
-paddleocr --image_dir PaddleOCR/doc/imgs/11.jpg --det_model_dir {your_det_model_dir} --rec_model_dir {your_rec_model_dir}
+paddleocr --image_dir PaddleOCR/doc/imgs/11.jpg --det_model_dir {your_det_model_dir} --rec_model_dir {your_rec_model_dir} --rec_char_dict_path {your_rec_char_dict_path} --cls_model_dir {your_cls_model_dir} --use_angle_cls true --cls true
 ```

 ## Parameter Description
@@ -194,6 +295,14 @@ paddleocr --image_dir PaddleOCR/doc/imgs/11.jpg --det_model_dir {your_det_model_
 | max_text_length         | The maximum text length that the recognition algorithm can recognize                                                                                                                                                                                         | 25                      |
 | rec_char_dict_path      | the alphabet path which needs to be modified to your own path when `rec_model_Name` use mode 2                                                                                                                                              | ./ppocr/utils/ppocr_keys_v1.txt                        |
 | use_space_char          | Whether to recognize spaces                                                                                                                                                                                                         | TRUE                    |
+| use_angle_cls          | Whether to load classification model                                                                                                                                                                                                       | FALSE                    |
+| cls_model_dir           | the classification inference model folder. There are two ways to transfer parameters, 1. None: Automatically download the built-in model to `~/.paddleocr/cls`; 2. The path of the inference model converted by yourself, the model and params files must be included in the model path | None |
+| cls_image_shape         | image shape of classification algorithm                                                                                                                                                                                            | "3,48,192"              |
+| label_list         | label list of classification algorithm                                                                                                                                                                                            | ['0','180']           |
+| cls_batch_num           | When performing classification, the batchsize of forward images                                                                                                                                                                                         | 30                      |
 | enable_mkldnn           | Whether to enable mkldnn                                                                                                                                                                                                       | FALSE                   |
+| use_zero_copy_run           | Whether to forward by zero_copy_run                                                                                                                                                                               | FALSE                   |
+| lang                     | The support language, now only Chinese(ch)、English(en)、French(french)、German(german)、Korean(korean)、Japanese(japan) are supported                                                                                                                                                                                                  | ch                    |
 | det                     | Enable detction when `ppocr.ocr` func exec                                                                                                                                                                                                   | TRUE                    |
-| rec                     | Enable detction when `ppocr.ocr` func exec                                                                                                                                                                                                   | TRUE                    |
+| rec                     | Enable recognition when `ppocr.ocr` func exec                                                                                                                                                                                                   | TRUE                    |
+| cls                     | Enable classification when `ppocr.ocr` func exec                                                                                                                                                                                                   | FALSE                    |
--- a/doc/french.ttf
+++ b/doc/french.ttf
--- a/doc/german.ttf
+++ b/doc/german.ttf
--- a/doc/imgs/french_0.jpg
+++ b/doc/imgs/french_0.jpg
--- a/doc/imgs/ger_1.jpg
+++ b/doc/imgs/ger_1.jpg
--- a/doc/imgs/ger_2.jpg
+++ b/doc/imgs/ger_2.jpg
--- a/doc/imgs/japan_1.jpg
+++ b/doc/imgs/japan_1.jpg
--- a/doc/imgs/japan_2.jpg
+++ b/doc/imgs/japan_2.jpg
--- a/doc/imgs/korean_1.jpg
+++ b/doc/imgs/korean_1.jpg
--- a/doc/imgs_results/1101.jpg
+++ b/doc/imgs_results/1101.jpg
--- a/doc/imgs_results/1102.jpg
+++ b/doc/imgs_results/1102.jpg
--- a/doc/imgs_results/1103.jpg
+++ b/doc/imgs_results/1103.jpg
--- a/doc/imgs_results/1104.jpg
+++ b/doc/imgs_results/1104.jpg
--- a/doc/imgs_results/1105.jpg
+++ b/doc/imgs_results/1105.jpg
--- a/doc/imgs_results/1106.jpg
+++ b/doc/imgs_results/1106.jpg
--- a/doc/imgs_results/1110.jpg
+++ b/doc/imgs_results/1110.jpg
--- a/doc/imgs_results/1112.jpg
+++ b/doc/imgs_results/1112.jpg
--- a/doc/imgs_results/img_12.jpg
+++ b/doc/imgs_results/img_12.jpg
--- a/doc/imgs_words/french/1.jpg
+++ b/doc/imgs_words/french/1.jpg