Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
8061dfce
Unverified
Commit
8061dfce
authored
Nov 16, 2024
by
Xiaomeng Zhao
Committed by
GitHub
Nov 16, 2024
Browse files
Merge pull request #975 from myhloli/dev
docs: update feature description for table conversion
parents
f485fa02
87dc40f5
Changes
2
Show whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
2 additions
and
2 deletions
+2
-2
README.md
README.md
+1
-1
README_zh-CN.md
README_zh-CN.md
+1
-1
No files found.
README.md
View file @
8061dfce
...
...
@@ -121,7 +121,7 @@ https://github.com/user-attachments/assets/4bea02c9-6d54-4cd6-97ed-dff14340982c
-
Preserve the structure of the original document, including headings, paragraphs, lists, etc.
-
Extract images, image descriptions, tables, table titles, and footnotes.
-
Automatically recognize and convert formulas in the document to LaTeX format.
-
Automatically recognize and convert tables in the document to
LaTeX or
HTML format.
-
Automatically recognize and convert tables in the document to HTML format.
-
Automatically detect scanned PDFs and garbled PDFs and enable OCR functionality.
-
OCR supports detection and recognition of 84 languages.
-
Supports multiple output formats, such as multimodal and NLP Markdown, JSON sorted by reading order, and rich intermediate formats.
...
...
README_zh-CN.md
View file @
8061dfce
...
...
@@ -121,7 +121,7 @@ https://github.com/user-attachments/assets/4bea02c9-6d54-4cd6-97ed-dff14340982c
-
保留原文档的结构,包括标题、段落、列表等
-
提取图像、图片描述、表格、表格标题及脚注
-
自动识别并转换文档中的公式为LaTeX格式
-
自动识别并转换文档中的表格为
LaTeX或
HTML格式
-
自动识别并转换文档中的表格为HTML格式
-
自动检测扫描版PDF和乱码PDF,并启用OCR功能
-
OCR支持84种语言的检测与识别
-
支持多种输出格式,如多模态与NLP的Markdown、按阅读顺序排序的JSON、含有丰富信息的中间格式等
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment