Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
zhougaofeng
magic_pdf
Commits
95d8d0dc
Commit
95d8d0dc
authored
Oct 22, 2024
by
zhougaofeng
Browse files
Update README.md
parent
116ce404
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
19 additions
and
12 deletions
+19
-12
README.md
README.md
+19
-12
No files found.
README.md
View file @
95d8d0dc
...
@@ -4,15 +4,20 @@
...
@@ -4,15 +4,20 @@
### 以下演示在223节点安装pdf解析模块(可以直接使用镜像:1177ea7959ce)
### 以下演示在223节点安装pdf解析模块(可以直接使用镜像:1177ea7959ce)
下载本项目
### 1、
下载本项目
`git clone http://developer.sourcefind.cn/codes/zhiAn123/magic_pdf.git`
`git clone http://developer.sourcefind.cn/codes/zhiAn123/magic_pdf.git`
下载需要的模型库
### 2、
下载需要的模型库
`git lfs clone https://www.modelscope.cn/opendatalab/PDF-Extract-Kit.git`
使用魔搭下载
下载qwen模型:
[
快速下载通道
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2-VL-7B-Instruct.git
)
下载PDF解析需要的模型:
(1)
`git lfs clone https://www.modelscope.cn/opendatalab/PDF-Extract-Kit.git`
(2)使用魔搭下载
`pip install modelscope`
`pip install modelscope`
...
@@ -21,29 +26,31 @@
...
@@ -21,29 +26,31 @@
`model_dir = snapshot_download('opendatalab/PDF-Extract-Kit')`
`model_dir = snapshot_download('opendatalab/PDF-Extract-Kit')`
### 3、安装需要的依赖库
### 1、安装需要的依赖库
进入主目录
`cd magic_pdf
-main
`
`cd magic_pdf`
#### pip install -e .
执行本地源码安装
###
2、安装需要的模型
###
# pip install -e .
###
#
修改magic-pdf.template.json
###
4、
修改magic-pdf.template.json
<div
align=
center
>
<div
align=
center
>
<img
src=
"doc/image (9).png"
/>
<img
src=
"doc/image (9).png"
/>
</div>
</div>
需要注意,"models-dir":"/home/practice/model/PDF-Extract-Kit/models" 路径指向PDF-Extract-Kit/models
"models-dir":"/home/practice/model/PDF-Extract-Kit/models" 路径指向PDF-Extract-Kit/models
将magic-pdf.template.json 拷贝到/root目录下并改名为magic-pdf.json
将magic-pdf.template.json 拷贝到/root目录下并改名为magic-pdf.json
<div
align=
center
>
<div
align=
center
>
<img
src=
"doc/image (10).png"
/>
<img
src=
"doc/image (10).png"
/>
</div>
</div>
### 4、启动qwen-ocr模块:
### 5、启动qwen-ocr模块:
下载qwen模型:
[
快速下载通道
](
http://113.200.138.88:18080/aimodels/qwen/Qwen2-VL-7B-Instruct.git
)
修改magic_pdf-main/magic_pdf/dict2md/ocr_server.py文件中模型路径地址
修改magic_pdf-main/magic_pdf/dict2md/ocr_server.py文件中模型路径地址
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment