Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
ModelZoo
PaddleOCR-VL_paddle
Commits
4c24053d
Commit
4c24053d
authored
Nov 04, 2025
by
chenych
Browse files
Update README
parent
92a367a9
Changes
3
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
with
9 additions
and
9 deletions
+9
-9
README.md
README.md
+5
-5
paddleocr-vl-image.py
paddleocr-vl-image.py
+2
-2
paddleocr-vl-pdf.py
paddleocr-vl-pdf.py
+2
-2
No files found.
README.md
View file @
4c24053d
...
...
@@ -18,13 +18,13 @@ PaddleOCR-VL 将复杂的文档解析任务分解为两个阶段。第一阶段
### 硬件需求
DCU型号:K100AI,节点数量:1台,卡数:1张。
`-v 路径`
、
`docker_name`
根据实际情况修改
`-v
挂载
路径`
、
`docker_name`
根据实际情况修改
### Docker(方法一)
```
bash
docker pull image.sourcefind.cn:5000/dcu/admin/base/custom:vllm0.9.2-ubuntu22.04-dtk25.04.2-py3.10-paddleocr-vl
docker run
-it
--shm-size
200g
--network
=
host
--name
{
docker_name
}
--privileged
--device
=
/dev/kfd
--device
=
/dev/dri
--device
=
/dev/mkfd
--group-add
video
--cap-add
=
SYS_PTRACE
--security-opt
seccomp
=
unconfined
-u
root
-v
/path/your_code_data/:/path/your_code_data/
-v
/opt/hyhal/:/opt/hyhal/:ro image.sourcefind.cn:5000/dcu/admin/base/vllm
:
0.9.2-ubuntu22.04-dtk25.04.2-py3.10 bash
docker run
-it
--shm-size
200g
--network
=
host
--name
{
docker_name
}
--privileged
--device
=
/dev/kfd
--device
=
/dev/dri
--device
=
/dev/mkfd
--group-add
video
--cap-add
=
SYS_PTRACE
--security-opt
seccomp
=
unconfined
-u
root
-v
/path/your_code_data/:/path/your_code_data/
-v
/opt/hyhal/:/opt/hyhal/:ro image.sourcefind.cn:5000/dcu/admin/base/
custom:
vllm0.9.2-ubuntu22.04-dtk25.04.2-py3.10
-paddleocr-vl
bash
cd
/your_code_path/paddleocr-vl_paddle
```
...
...
@@ -34,7 +34,7 @@ cd /your_code_path/paddleocr-vl_paddle
cd
docker
docker build
--no-cache
-t
paddleocr-vl:latest
.
docker run
-it
--shm-size
200g
--network
=
host
--name
{
docker_name
}
--privileged
--device
=
/dev/kfd
--device
=
/dev/dri
--device
=
/dev/mkfd
--group-add
video
--cap-add
=
SYS_PTRACE
--security-opt
seccomp
=
unconfined
-u
root
-v
/path/your_code_data/:/path/your_code_data/
-v
/opt/hyhal/:/opt/hyhal/:ro image.sourcefind.cn:5000/dcu/admin/base/vllm
:
0.9.2-ubuntu22.04-dtk25.04.2-py3.10 bash
docker run
-it
--shm-size
200g
--network
=
host
--name
{
docker_name
}
--privileged
--device
=
/dev/kfd
--device
=
/dev/dri
--device
=
/dev/mkfd
--group-add
video
--cap-add
=
SYS_PTRACE
--security-opt
seccomp
=
unconfined
-u
root
-v
/path/your_code_data/:/path/your_code_data/
-v
/opt/hyhal/:/opt/hyhal/:ro image.sourcefind.cn:5000/dcu/admin/base/
custom:
vllm0.9.2-ubuntu22.04-dtk25.04.2-py3.10
-paddleocr-vl
bash
cd
/your_code_path/paddleocr-vl_paddle
```
...
...
@@ -71,7 +71,7 @@ paddleocr doc_parser -i ./doc/paddleocr_vl_demo.png --device DCU --precision fp3
### vllm
serve端
```
bash
export
PADDLE_PDX_DISABLE_DEV_MODEL_WL
=
true
export
PADDLE_PDX_DISABLE_DEV_MODEL_WL
=
1
vllm serve PaddlePaddle/PaddleOCR-VL
--trust-remote-code
--max-model-len
16384
--max-num-batched-tokens
16384
--gpu-memory-utilization
0.8
--served-model-name
PaddleOCR-VL-0.9B
```
...
...
@@ -113,7 +113,7 @@ OCR
-
[
PaddleOCR-VL
](
https://huggingface.co/PaddlePaddle/PaddleOCR-VL
)
## 源码仓库及问题反馈
-
https://developer.sourcefind.cn/codes/modelzoo/
deepseek-ocr_pytorch
-
https://developer.sourcefind.cn/codes/modelzoo/
paddleocr-vl_paddle
## 参考资料
-
https://github.com/PaddlePaddle/PaddleOCR
...
...
paddleocr-vl-image.py
View file @
4c24053d
...
...
@@ -7,5 +7,5 @@ pipeline = PaddleOCRVL(device='DCU')
output
=
pipeline
.
predict
(
"./doc/paddleocr_vl_demo.png"
)
for
res
in
output
:
res
.
print
()
## 打印预测的结构化输出
res
.
save_to_json
(
save_path
=
"output"
)
## 保存当前图像的结构化json结果
res
.
save_to_markdown
(
save_path
=
"output"
)
## 保存当前图像的markdown格式的结果
res
.
save_to_json
(
save_path
=
"output
-jpg
"
)
## 保存当前图像的结构化json结果
res
.
save_to_markdown
(
save_path
=
"output
-jpg
"
)
## 保存当前图像的markdown格式的结果
paddleocr-vl-pdf.py
View file @
4c24053d
from
pathlib
import
Path
from
paddleocr
import
PaddleOCRVL
input_file
=
"./
your_pdf_file
.pdf"
output_path
=
Path
(
"./output"
)
input_file
=
"./
doc/PaddleOCR-VL
.pdf"
output_path
=
Path
(
"./output
-pdf
"
)
pipeline
=
PaddleOCRVL
(
device
=
'DCU'
)
output
=
pipeline
.
predict
(
input
=
input_file
)
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment