Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
a77cb36d5149319fcc76f0bbf7648256fe360cb1
Switch branch/tag
mineru
magic_pdf
pdf_parse_by_ocr_v2.py
22 Apr, 2024
6 commits
block type 字段名修复
· 45ce99bf
赵小蒙
authored
Apr 22, 2024
增加remove_overlaps_min_blocks逻辑
45ce99bf
更新了para_split
· e31066ba
liukaiwen
authored
Apr 22, 2024
e31066ba
增加block嵌套问题的todo
· d8c5b7a7
赵小蒙
authored
Apr 22, 2024
d8c5b7a7
修复table的描述符号应该是"5"
· dbb9a1ab
赵小蒙
authored
Apr 22, 2024
dbb9a1ab
更新了para_split_by_model
· f519f63d
liukaiwen
authored
Apr 22, 2024
f519f63d
将ocr_parse逻辑切换到v2,并解决几个parse过程中的error
· dcf6e712
赵小蒙
authored
Apr 22, 2024
dcf6e712
19 Apr, 2024
1 commit
重构 parse_by_ocr_v2.py
· f5341e16
赵小蒙
authored
Apr 19, 2024
f5341e16
18 Apr, 2024
1 commit
重构parse_by_ocr_v2
· 7e8e9cab
赵小蒙
authored
Apr 18, 2024
7e8e9cab