Commits · ccd2a71fbb547e5ff53a510c8eb5d854ec0bad4b · wangsen / MinerU

15 Jun, 2025 1 commit
- fix: update sglang version to 0.4.7 and adjust changelog for compatibility issues · e1181ba8
  myhloli authored Jun 15, 2025
  
  e1181ba8
14 Jun, 2025 5 commits
- feat: add Docker Compose configuration and update README for container startup · 55cb4c90
  myhloli authored Jun 15, 2025
  
  55cb4c90
- docs: update README files with Docker run command for GPU support · f51c1acc
  myhloli authored Jun 15, 2025
  
  f51c1acc
- chore: update changelog for version 2.0.2 release with bug fixes and Dockerfile updates · 9b279553
  myhloli authored Jun 15, 2025
  
  9b279553
- fix: update Dockerfile and README files to use updated mineru installation commands · 4cb28fdf
  myhloli authored Jun 15, 2025
  
  4cb28fdf
- fix: add pdftext link to README and README_zh-CN for completeness · 6e399282
  myhloli authored Jun 14, 2025
  
  6e399282
13 Jun, 2025 9 commits
- fix: update Table of Contents in README and README_zh-CN for clarity and consistency · d8989ed1
  myhloli authored Jun 13, 2025
  
  d8989ed1
- feat: add MCP server documentation to README and README_zh-CN · 4338b633
  myhloli authored Jun 13, 2025
  
  4338b633
- fix: update README to clarify licensing implications of YOLO models and future... · 6526080e
  myhloli authored Jun 13, 2025
```
fix: update README to clarify licensing implications of YOLO models and future plans for permissive alternatives
```
  6526080e
- fix: update output file documentation links in README for English and remove Chinese references · 070c7f15
  myhloli authored Jun 13, 2025
  
  070c7f15
- feat: update PyPI links in README and README_zh-CN to reflect new package name · ced5a7b4
  myhloli authored Jun 13, 2025
  
  ced5a7b4
- feat: update PyPI links in README and README_zh-CN to reflect new package name · 1903d4a8
  myhloli authored Jun 13, 2025
  
  1903d4a8
- feat: add VLM demo section and update Hugging Face demo links in README and README_zh-CN · 1694a079
  myhloli authored Jun 13, 2025
  
  1694a079
- feat: add VLM demo section and update Hugging Face demo links in README and README_zh-CN · 7d77539c
  myhloli authored Jun 13, 2025
  
  7d77539c
- feat: update README and README_zh-CN to reflect MinerU 2.0 features,... · 7c0d119a
  myhloli authored Jun 13, 2025
```
feat: update README and README_zh-CN to reflect MinerU 2.0 features, installation instructions, and usage examples
```
  7c0d119a
24 May, 2025 1 commit
- feat(docs): update changelog for PP-OCRv5 model support and handwritten... · 73f0530d
  myhloli authored May 24, 2025
```
feat(docs): update changelog for PP-OCRv5 model support and handwritten document recognition enhancements
```
  73f0530d
14 May, 2025 1 commit
- docs(changelog): remove pdfminer.six version pinning from release notes · 51ceb480
  myhloli authored May 14, 2025
  
  51ceb480
09 May, 2025 1 commit
- docs(installation): update Python version and CUDA installation instructions · 9f0d45bb
  myhloli authored May 09, 2025
  
  9f0d45bb
29 Apr, 2025 1 commit

feat(model_utils): adjust table detection threshold and add features · 49a8f8be

myhloli authored Apr 29, 2025

- Adjust the threshold for considering tables inside other tables from2 to 3
- Add support for custom formula delimiters through user configuration
- Pin pdfminer.six to version 20250324 to prevent parsing failures

49a8f8be

27 Apr, 2025 1 commit

feat(pdf): optimize formula parsing and update pdfminer.six · 0807e971

myhloli authored Apr 27, 2025

- Improve formula parsing success rate for better formula rendering
- Upgrade pdfminer.six to the latest version to fix PDF parsing issues- Update changelog in both English and Chinese README files

0807e971

24 Apr, 2025 1 commit
- docs(README): fix typo · c1558af3
  小林在忙毕业设计 authored Apr 24, 2025
  
  c1558af3
23 Apr, 2025 2 commits
- docs(README): update changelog for version 1.3.8 release · e0dc6c84
  myhloli authored Apr 23, 2025
  
  e0dc6c84
- feat(ocr): add new Chinese OCR model and update language support · 4f88fcaa
  myhloli authored Apr 23, 2025
```
- Add new Chinese OCR model (ch_PP-OCRv4_rec_server_doc_infer) for server-side use
- Update language support in app.py to include new Chinese model
- Modify models_config.yml to add new model configuration
```
  4f88fcaa
22 Apr, 2025 2 commits

fix(lang|performance): resolve lang parameter issue and speed up OCR/table parsing · 9c4e779b

myhloli authored Apr 22, 2025

- Fix lang parameter ineffectiveness during table parsing model initialization
- Resolve significant slowdown in OCR and table parsing speed in CPU mode
- Update changelog in README.md and README_zh-CN.md

9c4e779b

fix(lang|performance): resolve lang parameter issue and speed up OCR/table parsing · 8d9070db

myhloli authored Apr 22, 2025

- Fix lang parameter ineffectiveness during table parsing model initialization
- Resolve significant slowdown in OCR and table parsing speed in CPU mode
- Update changelog in README.md and README_zh-CN.md

8d9070db

16 Apr, 2025 1 commit

docs(README): update changelog for v1.3.4 release · 1705958f

myhloli authored Apr 16, 2025

- Update README.md and README_zh-CN.md with the latest changes
- Add new release notes for version 1.3.4
- Include improvements in OCR detection speed and page-level sorting

1705958f

12 Apr, 2025 2 commits

docs(readme): update release notes for English and Chinese README files · a69b97c9

myhloli authored Apr 12, 2025

- Update version history in both English and Chinese README files
- Add note about model update required for fixing word concatenation issue- Ensure consistency between English and Chinese versions

a69b97c9

docs(README): update version history and installation instructions · 437311f5

myhloli authored Apr 12, 2025

- Update version history in README.md and README_zh-CN.md
- Add details for 1.3.2 release and previous versions
- Update Windows CUDA acceleration installation instructions
- Refactor changelog entries for better readability and organization

437311f5

08 Apr, 2025 5 commits

docs: update version number in README files · bc0ff1ac

myhloli authored Apr 08, 2025

- Correct version number from 1.3.2 to 1.3.1 in both README.md and README_zh-CN.md
- Update changelog entries for the latest release

bc0ff1ac

docs(README): update version number and changelog in README files · bd4728aa
myhloli authored Apr 08, 2025
```
- Update version number from 1.3.1 to 1.3.2
```
bd4728aa

docs(README): update version number in release notes · 0ab29cdb

myhloli authored Apr 08, 2025

- Update version from1.3.1 to 1.3.2 in both English and Chinese README files
- Keep other content unchanged

0ab29cdb

docs: update badges and project URLs- Update PyPI version badge to use shields.io · 90f0e737
myhloli authored Apr 08, 2025
```
- Add project URLs in setup.py for better discoverability
- Make consistent changes across README.md and README_zh-CN.md
```
90f0e737

docs(install): update Python version requirements and simplify torch installation · 4fd8d626

myhloli authored Apr 08, 2025

- Update Python version requirements to >=3.10
- Simplify torch installation command- Remove numpy version restriction
- Update CUDA compatibility information
- Adjust environment creation commands across multiple documentation files

4fd8d626

03 Apr, 2025 4 commits

docs(readme): update changelog and highlight usability improvements · 4067f6fd

myhloli authored Apr 03, 2025

- Remove duplicate entries for paddleocr2torch and thread safety
- Add new entry for real-time progress bar implementation
- Update mfr model to unimernet(2503)
- Extend torch version compatibility
- Enhance cuda support for various GPU models
- Improve parsing speed on MPS devices

4067f6fd

docs(readme): update release notes for version 1.3.0 · 5c2e25ac

myhloli authored Apr 03, 2025

- Update release notes in both English and Chinese README files
- Highlight major optimizations and improvements in version 1.3.0
- Clarify compatibility changes for torch, CUDA, and Python versions
- Emphasize performance improvements and parsing speed enhancements
- Mention specific bug fixes and parsing effect optimizations

5c2e25ac

docs(readme): update changelog and compatibility information · 0544996f

myhloli authored Apr 03, 2025

- Update changelog for version 1.3.0 release
- Clarify CUDA and GPU compatibility improvements
- Add information about batch processing speed improvements
- Update model download process and memory usage optimizations
- Include link to batch processing demo script

0544996f

docs(README): update model config examples and add tqdm dependency · 86058278
myhloli authored Apr 03, 2025
```
- Remove outdated comments in table-config examples
- Add tqdm to requirements in all Docker environments
```
86058278

02 Apr, 2025 3 commits

docs: add RapidOCR and PaddleOCR2Pytorch to Acknowledgments list · 7a0b87d5

myhloli authored Apr 02, 2025

- Add RapidOCR and PaddleOCR2Pytorch to the Acknowledgments list in README.md
- Add RapidOCR and PaddleOCR2Pytorch to the Acknowledgments list in README_zh-CN.md

7a0b87d5

feat(README): update changelog for version 1.3.0 release · 0eff993a

myhloli authored Apr 02, 2025

- Installation and compatibility optimizations:
- Replace PaddleOCR with paddleocr2torch to resolve conflicts between Paddle and PyTorch
  - Remove layoutlmv3 usage to solve compatibility issues with detectron2
  - Extend PyTorch version compatibility to2.2~2.6  - Extend CUDA compatibility to 11.8~12.6
  - Extend Python version compatibility to 3.10~3.12

- Performance optimizations:
 - Support batch processing for multiple PDF files
  - Optimize mfr model loading and usage to reduce memory consumption and improve speed
  - Reduce minimum memory requirement to 6GB
  - Improve running speed on MPS devices

- Parsing effect optimization:
  - Update mfr model to unimernet(2503) to fix line break issues in multi-line formulas

0eff993a

docs(README): update system requirements and GPU support · 298305dd

myhloli authored Apr 02, 2025

- Update Python version requirement to 3.10-3.12
- Expand CUDA environment options to 11.8/12.4/12.6
- Update GPU VRAM requirement to 6GB or more
-

298305dd