Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
b22e87cd
Unverified
Commit
b22e87cd
authored
Feb 25, 2025
by
Xiaomeng Zhao
Committed by
GitHub
Feb 25, 2025
Browse files
Merge pull request #1769 from myhloli/dev
docs(ascend): update Ascend NPU acceleration documentation
parents
ab551720
aea10cfe
Changes
4
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
21 additions
and
6 deletions
+21
-6
README.md
README.md
+0
-1
docs/README_Ascend_NPU_Acceleration_zh_CN.md
docs/README_Ascend_NPU_Acceleration_zh_CN.md
+19
-3
docs/README_Ubuntu_CUDA_Acceleration_en_US.md
docs/README_Ubuntu_CUDA_Acceleration_en_US.md
+1
-1
docs/README_Ubuntu_CUDA_Acceleration_zh_CN.md
docs/README_Ubuntu_CUDA_Acceleration_zh_CN.md
+1
-1
No files found.
README.md
View file @
b22e87cd
...
@@ -47,7 +47,6 @@ Easier to use: Just grab MinerU Desktop. No coding, no login, just a simple inte
...
@@ -47,7 +47,6 @@ Easier to use: Just grab MinerU Desktop. No coding, no login, just a simple inte
</div>
</div>
# Changelog
# Changelog
-
2025/02/24 1.2.0 released. This version includes several fixes and improvements to enhance parsing efficiency and accuracy:
-
2025/02/24 1.2.0 released. This version includes several fixes and improvements to enhance parsing efficiency and accuracy:
-
Performance Optimization
-
Performance Optimization
-
Increased classification speed for PDF documents in auto mode.
-
Increased classification speed for PDF documents in auto mode.
...
...
docs/README_Ascend_NPU_Acceleration_zh_CN.md
View file @
b22e87cd
...
@@ -2,11 +2,13 @@
...
@@ -2,11 +2,13 @@
## 简介
## 简介
本文档介绍如何在 Ascend NPU 上使用 MinerU。本文档内容已在
`华为Atlas 800T A2`
服务器上测试通过。
本文档介绍如何在 Ascend NPU 上使用 MinerU。本文档内容已在
`华为
Atlas 800T A2`
服务器上测试通过。
```
```
CPU:鲲鹏 920 aarch64 2.6GHz
CPU:鲲鹏 920 aarch64 2.6GHz
NPU:Ascend 910B 64GB
NPU:Ascend 910B 64GB
OS:openEuler 22.03 (LTS-SP3)
OS:openEuler 22.03 (LTS-SP3)/ Ubuntu 22.04.5 LTS
CANN:8.0.RC2
驱动版本:24.1.rc2.1
```
```
由于适配 Ascend NPU 的环境较为复杂,建议使用 Docker 容器运行 MinerU。
由于适配 Ascend NPU 的环境较为复杂,建议使用 Docker 容器运行 MinerU。
...
@@ -54,4 +56,18 @@ magic-pdf --help
...
@@ -54,4 +56,18 @@ magic-pdf --help
-
paddleocr使用内嵌onnx模型,仅在默认语言配置下能以较快速度对中英文进行识别
-
paddleocr使用内嵌onnx模型,仅在默认语言配置下能以较快速度对中英文进行识别
-
自定义lang参数时,paddleocr速度会存在明显下降情况
-
自定义lang参数时,paddleocr速度会存在明显下降情况
-
layout模型使用layoutlmv3时会发生间歇性崩溃,建议使用默认配置的doclayout_yolo模型
-
layout模型使用layoutlmv3时会发生间歇性崩溃,建议使用默认配置的doclayout_yolo模型
-
表格解析仅适配了rapid_table模型,其他模型可能会无法使用
-
表格解析仅适配了rapid_table模型,其他模型可能会无法使用
\ No newline at end of file
## 高性能模式
-
在特定硬件环境可以通过插件开启高性能模式,整体速度相比默认模式提升300%以上
| 系统要求 | 版本/型号 |
|----------------|--------------|
| 芯片类型 | 昇腾910B |
| CANN版本 | CANN 8.0.RC2 |
| 驱动版本 | 24.1.rc2.1 |
| magic-pdf 软件版本 |
\>
= 1.2.0 |
-
高性能插件需满足一定的硬件条件和资质要求,如需申请使用请填写以下表单
[
MinerU高性能版本合作申请表
](
https://aicarrier.feishu.cn/share/base/form/shrcnb10VaoNQB8kQPA8DEfZC6d
)
docs/README_Ubuntu_CUDA_Acceleration_en_US.md
View file @
b22e87cd
...
@@ -117,7 +117,7 @@ If your graphics card has at least **8GB** of VRAM, follow these steps to test C
...
@@ -117,7 +117,7 @@ If your graphics card has at least **8GB** of VRAM, follow these steps to test C
1.
Download
`paddlepaddle-gpu`
. Installation will automatically enable OCR acceleration.
1.
Download
`paddlepaddle-gpu`
. Installation will automatically enable OCR acceleration.
```
sh
```
sh
python
-m
pip
install
paddlepaddle-gpu
==
3.0.0
b
1
-i
https://www.paddlepaddle.org.cn/packages/stable/cu118/
python
-m
pip
install
paddlepaddle-gpu
==
3.0.0
rc
1
-i
https://www.paddlepaddle.org.cn/packages/stable/cu118/
```
```
2.
Test OCR acceleration with the following command:
2.
Test OCR acceleration with the following command:
```
sh
```
sh
...
...
docs/README_Ubuntu_CUDA_Acceleration_zh_CN.md
View file @
b22e87cd
...
@@ -122,7 +122,7 @@ magic-pdf -p small_ocr.pdf -o ./output
...
@@ -122,7 +122,7 @@ magic-pdf -p small_ocr.pdf -o ./output
**1.下载paddlepaddle-gpu, 安装完成后会自动开启ocr加速**
**1.下载paddlepaddle-gpu, 安装完成后会自动开启ocr加速**
```
bash
```
bash
python
-m
pip
install
paddlepaddle-gpu
==
3.0.0
b
1
-i
https://www.paddlepaddle.org.cn/packages/stable/cu118/
python
-m
pip
install
paddlepaddle-gpu
==
3.0.0
rc
1
-i
https://www.paddlepaddle.org.cn/packages/stable/cu118/
```
```
**2.运行以下命令测试ocr加速效果**
**2.运行以下命令测试ocr加速效果**
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment