Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
"vscode:/vscode.git/clone" did not exist on "69231ce9cdd34ca5c66fef090e2f04ab59b974fc"
7512baaaa3a2aa1d1fe65d740df5feefe4902c85
Switch branch/tag
mineru
magic_pdf
pdf_parse_by_ocr.py
12 Mar, 2024
4 commits
重构drow_bbox为工具类
· 7512baaa
赵小蒙
authored
Mar 12, 2024
7512baaa
feat: complete self check
· 2611e853
许瑞
authored
Mar 12, 2024
2611e853
pdf_info_dict中间态结构调整
· 61a0c62c
赵小蒙
authored
Mar 12, 2024
61a0c62c
debug时自动绘制layout区域和text区域
· f31117de
赵小蒙
authored
Mar 12, 2024
f31117de
08 Mar, 2024
6 commits
ocr模式增加截图功能
· a5f8de98
赵小蒙
authored
Mar 08, 2024
a5f8de98
ocr pipeline更新
· 17b09f71
赵小蒙
authored
Mar 08, 2024
17b09f71
span->line现基于模型的layout进行拼接
· 864e9535
赵小蒙
authored
Mar 08, 2024
864e9535
对模型的layout坐标转换
· f9bd0040
赵小蒙
authored
Mar 08, 2024
f9bd0040
将模型和pymu坐标的转换逻辑抽象成方法
· f62d1aa7
赵小蒙
authored
Mar 08, 2024
f62d1aa7
ocr模式下删除header/page number/footnote/footer
· 388223f2
赵小蒙
authored
Mar 08, 2024
388223f2
07 Mar, 2024
2 commits
增加ocr模式的layout解析功能
· fcea39d3
赵小蒙
authored
Mar 07, 2024
fcea39d3
ocr拼接逻辑更新
· caa1588a
赵小蒙
authored
Mar 07, 2024
caa1588a
06 Mar, 2024
2 commits
parse_pdf_by_ocr 逻辑更新
· a0be4652
赵小蒙
authored
Mar 06, 2024
a0be4652
增加ocr版本解析功能
· 701f3849
赵小蒙
authored
Mar 06, 2024
701f3849