Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
8f8b8c4c
Unverified
Commit
8f8b8c4c
authored
May 22, 2025
by
Xiaomeng Zhao
Committed by
GitHub
May 22, 2025
Browse files
Merge pull request #2501 from myhloli/dev
feat(ocr): add PP-OCRv5 models and update configurations
parents
b122b86e
048f6af4
Changes
3
Expand all
Show whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
18430 additions
and
1 deletion
+18430
-1
magic_pdf/model/sub_modules/ocr/paddleocr2pytorch/pytorchocr/utils/resources/arch_config.yaml
...leocr2pytorch/pytorchocr/utils/resources/arch_config.yaml
+42
-0
magic_pdf/model/sub_modules/ocr/paddleocr2pytorch/pytorchocr/utils/resources/dict/ppocrv5_dict.txt
...2pytorch/pytorchocr/utils/resources/dict/ppocrv5_dict.txt
+18383
-0
magic_pdf/model/sub_modules/ocr/paddleocr2pytorch/pytorchocr/utils/resources/models_config.yml
...eocr2pytorch/pytorchocr/utils/resources/models_config.yml
+5
-1
No files found.
magic_pdf/model/sub_modules/ocr/paddleocr2pytorch/pytorchocr/utils/resources/arch_config.yaml
View file @
8f8b8c4c
...
...
@@ -104,6 +104,22 @@ ch_PP-OCRv4_det_infer:
name
:
DBHead
k
:
50
ch_PP-OCRv5_det_infer
:
model_type
:
det
algorithm
:
DB
Transform
:
null
Backbone
:
name
:
PPLCNetV3
scale
:
0.75
det
:
True
Neck
:
name
:
RSEFPN
out_channels
:
96
shortcut
:
True
Head
:
name
:
DBHead
k
:
50
ch_PP-OCRv4_det_server_infer
:
model_type
:
det
algorithm
:
DB
...
...
@@ -196,6 +212,32 @@ ch_PP-OCRv4_rec_server_doc_infer:
nrtr_dim
:
384
max_text_length
:
25
ch_PP-OCRv5_rec_infer
:
model_type
:
rec
algorithm
:
SVTR_HGNet
Transform
:
Backbone
:
name
:
PPLCNetV3
scale
:
0.95
Head
:
name
:
MultiHead
out_channels_list
:
CTCLabelDecode
:
18385
head_list
:
-
CTCHead
:
Neck
:
name
:
svtr
dims
:
120
depth
:
2
hidden_dims
:
120
kernel_size
:
[
1
,
3
]
use_guide
:
True
Head
:
fc_decay
:
0.00001
-
NRTRHead
:
nrtr_dim
:
384
max_text_length
:
25
chinese_cht_PP-OCRv3_rec_infer
:
model_type
:
rec
algorithm
:
SVTR
...
...
magic_pdf/model/sub_modules/ocr/paddleocr2pytorch/pytorchocr/utils/resources/dict/ppocrv5_dict.txt
0 → 100644
View file @
8f8b8c4c
This diff is collapsed.
Click to expand it.
magic_pdf/model/sub_modules/ocr/paddleocr2pytorch/pytorchocr/utils/resources/models_config.yml
View file @
8f8b8c4c
lang
:
ch_lite
:
ch_lite
_v4
:
det
:
ch_PP-OCRv3_det_infer.pth
rec
:
ch_PP-OCRv4_rec_infer.pth
dict
:
ppocr_keys_v1.txt
ch_lite
:
det
:
ch_PP-OCRv5_det_infer.pth
rec
:
ch_PP-OCRv5_rec_infer.pth
dict
:
ppocrv5_dict.txt
ch_server
:
det
:
ch_PP-OCRv3_det_infer.pth
rec
:
ch_PP-OCRv4_rec_server_infer.pth
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment