Commits · 4067f6fdf424df8725fa0d6f0cde8dcaf654ee93 · wangsen / MinerU

03 Apr, 2025 18 commits

docs(readme): update changelog and highlight usability improvements · 4067f6fd

myhloli authored Apr 03, 2025

- Remove duplicate entries for paddleocr2torch and thread safety
- Add new entry for real-time progress bar implementation
- Update mfr model to unimernet(2503)
- Extend torch version compatibility
- Enhance cuda support for various GPU models
- Improve parsing speed on MPS devices

4067f6fd

docs(readme): update release notes for version 1.3.0 · 5c2e25ac

myhloli authored Apr 03, 2025

- Update release notes in both English and Chinese README files
- Highlight major optimizations and improvements in version 1.3.0
- Clarify compatibility changes for torch, CUDA, and Python versions
- Emphasize performance improvements and parsing speed enhancements
- Mention specific bug fixes and parsing effect optimizations

5c2e25ac

Merge pull request #2087 from icecraft/fix/convert_image_with_pymupdf · bb40b9b6
Xiaomeng Zhao authored Apr 03, 2025
```
fix: convert image with pymupdf
```
bb40b9b6
fix: convert image with pymupdf · 3e8ee23e
icecraft authored Apr 03, 2025

3e8ee23e
Merge pull request #2086 from icecraft/fix/support_non_pdf_in_batch · 14097d4e
Xiaomeng Zhao authored Apr 03, 2025
```
fix: support non-pdf file in batch mode
```
14097d4e
fix: support non-pdf file in batch mode · 3379f3b3
icecraft authored Apr 03, 2025

3379f3b3
Merge pull request #2083 from myhloli/dev · e38efb97
Xiaomeng Zhao authored Apr 03, 2025
```
feat(web_api): update configuration and remove unused code
```
e38efb97

feat(web_api): update configuration and remove unused code · 3a820305

myhloli authored Apr 03, 2025

- Comment out PaddlePaddle GPU installation in Dockerfile
- Add OCR model download URL in download_models.py
- Update config version in magic-pdf.json
- Remove outdated information and simplify README.md
- Remove volume creation for PaddleOCR models in Dockerfile

3a820305

Merge pull request #2081 from myhloli/dev · 5c46c791
Xiaomeng Zhao authored Apr 03, 2025
```
docs(user_guide): update installation guide and CUDA support
```
5c46c791

docs(user_guide): update installation guide and CUDA support · b51ac110

myhloli authored Apr 03, 2025

- Update CUDA version requirements to 12.4 and higher
- Add support for CUDA 12.6 and CANN environments- Update Python version requirements to 3.10-3.12
- Remove paddlepaddle-gpu installation and related instructions
- Update magic-pdf installation command to use Aliyun mirror
- Add storage requirements and update memory requirements
- Update GPU hardware support list to include all GPUs with Tensor Cores
- Add support for Apple Silicon

b51ac110

Merge pull request #2079 from myhloli/dev · 9ffdd0df
Xiaomeng Zhao authored Apr 03, 2025
```
docs(readme): update changelog and compatibility information
```
9ffdd0df

docs(readme): update changelog and compatibility information · 0544996f

myhloli authored Apr 03, 2025

- Update changelog for version 1.3.0 release
- Clarify CUDA and GPU compatibility improvements
- Add information about batch processing speed improvements
- Update model download process and memory usage optimizations
- Include link to batch processing demo script

0544996f

Merge pull request #2077 from myhloli/dev · fe4e62a7
Xiaomeng Zhao authored Apr 03, 2025
```
feat(model): add tqdm progress bar to model prediction loops
```
fe4e62a7

refactor(magic_pdf): optimize table recognition and layout detection · 1fd72f5f

myhloli authored Apr 03, 2025

- Update table recognition logic to process each table individually
- Refactor layout detection to use tqdm for progress tracking
- Optimize OCR recognition by using a single tqdm wrapper
- Improve MFR prediction with a more accurate progress bar
- Simplify MFD prediction by removing unnecessary total calculation

1fd72f5f

refactor(magic_pdf): remove OCR timing measurement code · 795233d1

myhloli authored Apr 03, 2025

- Comment out OCR timing measurement code to improve readability and performance
- Remove unnecessary logging of OCR processing time

795233d1

refactor(magic_pdf): optimize code and improve logging · 553f250f

myhloli authored Apr 03, 2025

- Remove unused imports and comments
- Increase MIN_BATCH_INFERENCE_SIZE from 100 to 200
- Comment out VRAM cleaning and logging in batch_analyze.py
- Simplify code in doc_analyze_by_custom_model.py- Add tqdm progress bar in pdf_parse_union_core_v2.py
- Enable tqdm in OCR processing

553f250f

docs(README): update model config examples and add tqdm dependency · 86058278
myhloli authored Apr 03, 2025
```
- Remove outdated comments in table-config examples
- Add tqdm to requirements in all Docker environments
```
86058278

feat(model): add tqdm progress bar to model prediction loops · 8e1c2339

myhloli authored Apr 03, 2025

- Add tqdm progress bar to batch prediction loops in multiple model modules
- Improve logging and error handling in batch analysis script
- Update table model initialization to use default sub-model if none specified
- Add tqdm dependency to requirements.txt

8e1c2339

02 Apr, 2025 22 commits

Merge pull request #2073 from myhloli/dev · 09bd890e
Xiaomeng Zhao authored Apr 03, 2025
```
feat(model): update Chinese OCR detection model to PP-OCRv3
```
09bd890e

feat(model): update Chinese OCR detection model to PP-OCRv3 · ddfeea94

myhloli authored Apr 03, 2025

- Replace ch_PP-OCRv4_det_infer.pth with ch_PP-OCRv3_det_infer.pth in models_config.yml
- Add new ch_PP-OCRv3_det_infer model configuration in arch_config.yaml

ddfeea94

Merge pull request #2071 from myhloli/dev · bb30f32e
Xiaomeng Zhao authored Apr 03, 2025
```
refactor(ocr): remove redundant code and improve code quality
```
bb30f32e

refactor(ocr): remove redundant code and improve code quality · c4010ae0

myhloli authored Apr 03, 2025

- Remove unnecessary GPU checks and cuda() calls
- Consolidate tensor device placement using .to(self.device)
- Add warning suppression for cleaner output
- Refactor conditional logic for better readability

c4010ae0

Merge pull request #2069 from myhloli/dev · a8888420
Xiaomeng Zhao authored Apr 02, 2025
```
refactor(demo): simplify batch_demo.py and update demo.py
```
a8888420

refactor(demo): simplify batch_demo.py and update demo.py · b0e220c5

myhloli authored Apr 02, 2025

- Remove unnecessary imports and code in batch_demo.py
- Update demo.py to use relative paths and improve code structure
- Adjust output directory structure in both scripts
- Remove redundant code and simplify functions

b0e220c5

Merge pull request #2066 from icecraft/feat/add_batch_example · dbdb99f9
Xiaomeng Zhao authored Apr 02, 2025
```
feat: add batch example
```
dbdb99f9
feat: add batch example · d9136715
icecraft authored Apr 02, 2025

d9136715
Merge pull request #2064 from myhloli/dev · 96ab0ad8
Xiaomeng Zhao authored Apr 02, 2025
```
update released note
```
96ab0ad8

docs: add RapidOCR and PaddleOCR2Pytorch to Acknowledgments list · 7a0b87d5

myhloli authored Apr 02, 2025

- Add RapidOCR and PaddleOCR2Pytorch to the Acknowledgments list in README.md
- Add RapidOCR and PaddleOCR2Pytorch to the Acknowledgments list in README_zh-CN.md

7a0b87d5

feat(README): update changelog for version 1.3.0 release · 0eff993a

myhloli authored Apr 02, 2025

- Installation and compatibility optimizations:
- Replace PaddleOCR with paddleocr2torch to resolve conflicts between Paddle and PyTorch
  - Remove layoutlmv3 usage to solve compatibility issues with detectron2
  - Extend PyTorch version compatibility to2.2~2.6  - Extend CUDA compatibility to 11.8~12.6
  - Extend Python version compatibility to 3.10~3.12

- Performance optimizations:
 - Support batch processing for multiple PDF files
  - Optimize mfr model loading and usage to reduce memory consumption and improve speed
  - Reduce minimum memory requirement to 6GB
  - Improve running speed on MPS devices

- Parsing effect optimization:
  - Update mfr model to unimernet(2503) to fix line break issues in multi-line formulas

0eff993a

Merge pull request #2063 from myhloli/dev · 7de9668d
Xiaomeng Zhao authored Apr 02, 2025
```
update docs
```
7de9668d

docs(gpu): update CUDA acceleration documentation · a778645b

myhloli authored Apr 02, 2025

- Update CUDA version requirements to12.4
- Recommend nvidia-driver-570-server for Ubuntu
- Remove Python version specification for conda environment
- Update magic-pdf version requirement to 1.3.0
- Simplify CUDA acceleration testing instructions
- Remove OCR acceleration with paddlepaddle-gpu
- Update torch and torchvision installation instructions for Windows

a778645b

docs(README): update system requirements and GPU support · 298305dd

myhloli authored Apr 02, 2025

- Update Python version requirement to 3.10-3.12
- Expand CUDA environment options to 11.8/12.4/12.6
- Update GPU VRAM requirement to 6GB or more
-

298305dd

Merge pull request #2062 from myhloli/dev · f4ffdfe8
Xiaomeng Zhao authored Apr 02, 2025
```
feat: support 3.10~3.12 & remove paddle
```
f4ffdfe8

build(deps): update package versions for linux and macos · cb3a4314

myhloli authored Apr 02, 2025

- Update matplotlib minimum version to 3.10 for Linux and MacOS
- Specify version ranges for PyYAML, ftfy, openai, shapely, pyclipper, and omegaconf
- Update dill to version <1 for compatibility

cb3a4314

build(dependencies): update PyMuPDF, pydantic and transformers · 90321855

myhloli authored Apr 02, 2025

- Update PyMuPDF to version <1.25.0
- Update pydantic to version <2.11
- Update transformers to version < 5.0.0
- Remove always_apply parameter from alb.ToGray in image processing

90321855

feat(ocr): update OCR utility and dependencies · d09464be

myhloli authored Apr 02, 2025

- Update the default configuration path in pytorchocr_utility.py
- Add required dependencies for paddleocr2pytorch in setup.py:
  - shapely
  - pyclipper
  - omegaconf

d09464be

refactor(model): update OCR model and remove unused configs · c45a706c

myhloli authored Apr 02, 2025

- Remove unused UniMERNet and LayoutLMv3 model configurations
- Update OCR model path and dictionary path for PaddleOCR
- Modify README to update system requirements and installation instructions
- Update setup.py to include new package data

c45a706c

refactor(magic_pdf): remove unused imports and update dependencies · 243bc58c

myhloli authored Apr 02, 2025

- Remove unused imports for concurrent.futures, multiprocessing, and paddle
- Delete commented-out code
- Update numpy dependency to remove upper version limit
- Remove InferenceResult import that was commented out

243bc58c

refactor(docker): remove unused packages and simplify Dockerfile commands · ddaa7158

myhloli authored Apr 02, 2025

- Remove paddleocr, paddlepaddle, rapidocr-paddle, and rapidocr-onnxruntime from requirements.txt files
- Simplify pip install commands in Dockerfiles
- Remove installation of paddlepaddle-gpu in china and global Dockerfiles
- Update requirements.txt files across all Docker configurations

ddaa7158

refactor: comment out paddleocr model copying code · 3bd1e0e4

myhloli authored Apr 02, 2025

- Commented out the code that copies the paddleocr model to user directory
- This change affects both download_models.py and download_models_hf.py scripts

3bd1e0e4