Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
29478dd6
Commit
29478dd6
authored
Jul 02, 2024
by
赵小蒙
Browse files
update readme
parent
4e80f0c1
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
6 additions
and
4 deletions
+6
-4
README.md
README.md
+3
-2
README_zh-CN.md
README_zh-CN.md
+3
-2
No files found.
README.md
View file @
29478dd6
...
@@ -20,7 +20,7 @@
...
@@ -20,7 +20,7 @@
## Introduction
## Introduction
MinerU is a one-stop, open-source data extraction tool,
primarily
includes the following features:
MinerU is a one-stop, open-source
, high-quality
data extraction tool, includes the following
primary
features:
-
[
Magic-PDF
](
#Magic-PDF
)
PDF Document Extraction
-
[
Magic-PDF
](
#Magic-PDF
)
PDF Document Extraction
-
[
Magic-Doc
](
#Magic-Doc
)
Webpage & E-book Extraction
-
[
Magic-Doc
](
#Magic-Doc
)
Webpage & E-book Extraction
...
@@ -86,9 +86,10 @@ pip install magic-pdf
...
@@ -86,9 +86,10 @@ pip install magic-pdf
###### simple
###### simple
```
bash
```
bash
cp
magic-pdf.template.json
to
~/magic-pdf.json
cp
magic-pdf.template.json ~/magic-pdf.json
magic-pdf pdf-command
--pdf
"pdf_path"
--model
"model_json_path"
magic-pdf pdf-command
--pdf
"pdf_path"
--model
"model_json_path"
```
```
After the program has finished, you can find the generated markdown files under the directory "/tmp/magic-pdf".
###### more
###### more
...
...
README_zh-CN.md
View file @
29478dd6
...
@@ -20,7 +20,7 @@
...
@@ -20,7 +20,7 @@
## 简介
## 简介
MinerU 是一款一站式开源数据提取工具,主要包含以下功能:
MinerU 是一款一站式
、
开源
、高质量的
数据提取工具,主要包含以下功能:
-
[
Magic-PDF
](
#Magic-PDF
)
PDF文档提取
-
[
Magic-PDF
](
#Magic-PDF
)
PDF文档提取
-
[
Magic-Doc
](
#Magic-Doc
)
网页与电子书提取
-
[
Magic-Doc
](
#Magic-Doc
)
网页与电子书提取
...
@@ -84,9 +84,10 @@ pip install magic-pdf
...
@@ -84,9 +84,10 @@ pip install magic-pdf
###### 直接使用
###### 直接使用
```
bash
```
bash
cp
magic-pdf.template.json
to
~/magic-pdf.json
cp
magic-pdf.template.json ~/magic-pdf.json
magic-pdf pdf-command
--pdf
"pdf_path"
--model
"model_json_path"
magic-pdf pdf-command
--pdf
"pdf_path"
--model
"model_json_path"
```
```
程序运行完成后,你可以在"/tmp/magic-pdf"目录下看到生成的markdown文件
###### 更多用法
###### 更多用法
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment