Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
17b09f71773a004bb98e4cdd3a305dddd03cd36b
Switch branch/tag
mineru
magic_pdf
pdf_parse_by_ocr.py
08 Mar, 2024
5 commits
ocr pipeline更新
· 17b09f71
赵小蒙
authored
Mar 08, 2024
17b09f71
span->line现基于模型的layout进行拼接
· 864e9535
赵小蒙
authored
Mar 08, 2024
864e9535
对模型的layout坐标转换
· f9bd0040
赵小蒙
authored
Mar 08, 2024
f9bd0040
将模型和pymu坐标的转换逻辑抽象成方法
· f62d1aa7
赵小蒙
authored
Mar 08, 2024
f62d1aa7
ocr模式下删除header/page number/footnote/footer
· 388223f2
赵小蒙
authored
Mar 08, 2024
388223f2
07 Mar, 2024
2 commits
增加ocr模式的layout解析功能
· fcea39d3
赵小蒙
authored
Mar 07, 2024
fcea39d3
ocr拼接逻辑更新
· caa1588a
赵小蒙
authored
Mar 07, 2024
caa1588a
06 Mar, 2024
2 commits
parse_pdf_by_ocr 逻辑更新
· a0be4652
赵小蒙
authored
Mar 06, 2024
a0be4652
增加ocr版本解析功能
· 701f3849
赵小蒙
authored
Mar 06, 2024
701f3849