Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
6e0d9a3e
"vscode:/vscode.git/clone" did not exist on "4422eaab6b9924b67e714af3006934b4049af199"
Unverified
Commit
6e0d9a3e
authored
Feb 25, 2025
by
Xiaomeng Zhao
Committed by
GitHub
Feb 25, 2025
Browse files
Merge pull request #1771 from opendatalab/release-1.2.0
Release 1.2.0
parents
815551fe
dd7af4d4
Changes
4
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
with
21 additions
and
6 deletions
+21
-6
README.md
README.md
+0
-1
docs/README_Ascend_NPU_Acceleration_zh_CN.md
docs/README_Ascend_NPU_Acceleration_zh_CN.md
+19
-3
docs/README_Ubuntu_CUDA_Acceleration_en_US.md
docs/README_Ubuntu_CUDA_Acceleration_en_US.md
+1
-1
docs/README_Ubuntu_CUDA_Acceleration_zh_CN.md
docs/README_Ubuntu_CUDA_Acceleration_zh_CN.md
+1
-1
No files found.
README.md
View file @
6e0d9a3e
...
...
@@ -47,7 +47,6 @@ Easier to use: Just grab MinerU Desktop. No coding, no login, just a simple inte
</div>
# Changelog
-
2025/02/24 1.2.0 released. This version includes several fixes and improvements to enhance parsing efficiency and accuracy:
-
Performance Optimization
-
Increased classification speed for PDF documents in auto mode.
...
...
docs/README_Ascend_NPU_Acceleration_zh_CN.md
View file @
6e0d9a3e
...
...
@@ -2,11 +2,13 @@
## 简介
本文档介绍如何在 Ascend NPU 上使用 MinerU。本文档内容已在
`华为Atlas 800T A2`
服务器上测试通过。
本文档介绍如何在 Ascend NPU 上使用 MinerU。本文档内容已在
`华为
Atlas 800T A2`
服务器上测试通过。
```
CPU:鲲鹏 920 aarch64 2.6GHz
NPU:Ascend 910B 64GB
OS:openEuler 22.03 (LTS-SP3)
OS:openEuler 22.03 (LTS-SP3)/ Ubuntu 22.04.5 LTS
CANN:8.0.RC2
驱动版本:24.1.rc2.1
```
由于适配 Ascend NPU 的环境较为复杂,建议使用 Docker 容器运行 MinerU。
...
...
@@ -54,4 +56,18 @@ magic-pdf --help
-
paddleocr使用内嵌onnx模型,仅在默认语言配置下能以较快速度对中英文进行识别
-
自定义lang参数时,paddleocr速度会存在明显下降情况
-
layout模型使用layoutlmv3时会发生间歇性崩溃,建议使用默认配置的doclayout_yolo模型
-
表格解析仅适配了rapid_table模型,其他模型可能会无法使用
\ No newline at end of file
-
表格解析仅适配了rapid_table模型,其他模型可能会无法使用
## 高性能模式
-
在特定硬件环境可以通过插件开启高性能模式,整体速度相比默认模式提升300%以上
| 系统要求 | 版本/型号 |
|----------------|--------------|
| 芯片类型 | 昇腾910B |
| CANN版本 | CANN 8.0.RC2 |
| 驱动版本 | 24.1.rc2.1 |
| magic-pdf 软件版本 |
\>
= 1.2.0 |
-
高性能插件需满足一定的硬件条件和资质要求,如需申请使用请填写以下表单
[
MinerU高性能版本合作申请表
](
https://aicarrier.feishu.cn/share/base/form/shrcnb10VaoNQB8kQPA8DEfZC6d
)
docs/README_Ubuntu_CUDA_Acceleration_en_US.md
View file @
6e0d9a3e
...
...
@@ -117,7 +117,7 @@ If your graphics card has at least **8GB** of VRAM, follow these steps to test C
1.
Download
`paddlepaddle-gpu`
. Installation will automatically enable OCR acceleration.
```
sh
python
-m
pip
install
paddlepaddle-gpu
==
3.0.0
b
1
-i
https://www.paddlepaddle.org.cn/packages/stable/cu118/
python
-m
pip
install
paddlepaddle-gpu
==
3.0.0
rc
1
-i
https://www.paddlepaddle.org.cn/packages/stable/cu118/
```
2.
Test OCR acceleration with the following command:
```
sh
...
...
docs/README_Ubuntu_CUDA_Acceleration_zh_CN.md
View file @
6e0d9a3e
...
...
@@ -122,7 +122,7 @@ magic-pdf -p small_ocr.pdf -o ./output
**1.下载paddlepaddle-gpu, 安装完成后会自动开启ocr加速**
```
bash
python
-m
pip
install
paddlepaddle-gpu
==
3.0.0
b
1
-i
https://www.paddlepaddle.org.cn/packages/stable/cu118/
python
-m
pip
install
paddlepaddle-gpu
==
3.0.0
rc
1
-i
https://www.paddlepaddle.org.cn/packages/stable/cu118/
```
**2.运行以下命令测试ocr加速效果**
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment