Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
f10b4a501f18e43cbb30910ce0137443371cb50d
Switch branch/tag
mineru
magic_pdf
dict2md
ocr_mkcontent.py
15 Mar, 2024
3 commits
s3_image_save_path统一配置
· f10b4a50
赵小蒙
authored
Mar 15, 2024
f10b4a50
mk_mm_markdown2中span_type分类更新
· 195998a0
赵小蒙
authored
Mar 15, 2024
195998a0
make多模态markdown时图片地址更改为fullpath
· f06a3213
赵小蒙
authored
Mar 15, 2024
f06a3213
14 Mar, 2024
5 commits
实现layout内部分段
· 084e9328
xuchao
authored
Mar 14, 2024
084e9328
make markdown时特殊符号转义
· 59b0b0c3
赵小蒙
authored
Mar 14, 2024
59b0b0c3
ocr模式更新spark pipeline
· 9bd6294b
赵小蒙
authored
Mar 14, 2024
9bd6294b
ocr模式下content type 抽象
· 26c23782
赵小蒙
authored
Mar 14, 2024
26c23782
在layout.pdf中绘制drop的bbox
· b6f051d8
赵小蒙
authored
Mar 14, 2024
b6f051d8
12 Mar, 2024
1 commit
增加生成多模态markdown逻辑
· ec1a6ef7
赵小蒙
authored
Mar 12, 2024
ec1a6ef7
07 Mar, 2024
1 commit
修复一个span可能没有content导致的问题
· 00f3e329
赵小蒙
authored
Mar 07, 2024
00f3e329
06 Mar, 2024
1 commit
增加ocr版本解析功能
· 701f3849
赵小蒙
authored
Mar 06, 2024
701f3849