- 15 Jun, 2025 1 commit
-
-
myhloli authored
-
- 14 Jun, 2025 5 commits
- 13 Jun, 2025 9 commits
-
-
myhloli authored
-
myhloli authored
-
myhloli authored
fix: update README to clarify licensing implications of YOLO models and future plans for permissive alternatives
-
myhloli authored
-
myhloli authored
-
myhloli authored
-
myhloli authored
-
myhloli authored
-
myhloli authored
feat: update README and README_zh-CN to reflect MinerU 2.0 features, installation instructions, and usage examples
-
- 24 May, 2025 1 commit
-
-
myhloli authored
feat(docs): update changelog for PP-OCRv5 model support and handwritten document recognition enhancements
-
- 14 May, 2025 1 commit
-
-
myhloli authored
-
- 09 May, 2025 1 commit
-
-
myhloli authored
-
- 29 Apr, 2025 1 commit
-
-
myhloli authored
- Adjust the threshold for considering tables inside other tables from2 to 3 - Add support for custom formula delimiters through user configuration - Pin pdfminer.six to version 20250324 to prevent parsing failures
-
- 27 Apr, 2025 1 commit
-
-
myhloli authored
- Improve formula parsing success rate for better formula rendering - Upgrade pdfminer.six to the latest version to fix PDF parsing issues- Update changelog in both English and Chinese README files
-
- 24 Apr, 2025 1 commit
-
-
小林在忙毕业设计 authored
-
- 23 Apr, 2025 2 commits
- 22 Apr, 2025 2 commits
-
-
myhloli authored
- Fix lang parameter ineffectiveness during table parsing model initialization - Resolve significant slowdown in OCR and table parsing speed in CPU mode - Update changelog in README.md and README_zh-CN.md
-
myhloli authored
- Fix lang parameter ineffectiveness during table parsing model initialization - Resolve significant slowdown in OCR and table parsing speed in CPU mode - Update changelog in README.md and README_zh-CN.md
-
- 16 Apr, 2025 1 commit
-
-
myhloli authored
- Update README.md and README_zh-CN.md with the latest changes - Add new release notes for version 1.3.4 - Include improvements in OCR detection speed and page-level sorting
-
- 12 Apr, 2025 2 commits
-
-
myhloli authored
- Update version history in both English and Chinese README files - Add note about model update required for fixing word concatenation issue- Ensure consistency between English and Chinese versions
-
myhloli authored
- Update version history in README.md and README_zh-CN.md - Add details for 1.3.2 release and previous versions - Update Windows CUDA acceleration installation instructions - Refactor changelog entries for better readability and organization
-
- 08 Apr, 2025 5 commits
-
-
myhloli authored
- Correct version number from 1.3.2 to 1.3.1 in both README.md and README_zh-CN.md - Update changelog entries for the latest release
-
myhloli authored
- Update version number from 1.3.1 to 1.3.2
-
myhloli authored
- Update version from1.3.1 to 1.3.2 in both English and Chinese README files - Keep other content unchanged
-
myhloli authored
- Add project URLs in setup.py for better discoverability - Make consistent changes across README.md and README_zh-CN.md
-
myhloli authored
- Update Python version requirements to >=3.10 - Simplify torch installation command- Remove numpy version restriction - Update CUDA compatibility information - Adjust environment creation commands across multiple documentation files
-
- 03 Apr, 2025 4 commits
-
-
myhloli authored
- Remove duplicate entries for paddleocr2torch and thread safety - Add new entry for real-time progress bar implementation - Update mfr model to unimernet(2503) - Extend torch version compatibility - Enhance cuda support for various GPU models - Improve parsing speed on MPS devices
-
myhloli authored
- Update release notes in both English and Chinese README files - Highlight major optimizations and improvements in version 1.3.0 - Clarify compatibility changes for torch, CUDA, and Python versions - Emphasize performance improvements and parsing speed enhancements - Mention specific bug fixes and parsing effect optimizations
-
myhloli authored
- Update changelog for version 1.3.0 release - Clarify CUDA and GPU compatibility improvements - Add information about batch processing speed improvements - Update model download process and memory usage optimizations - Include link to batch processing demo script
-
myhloli authored
- Remove outdated comments in table-config examples - Add tqdm to requirements in all Docker environments
-
- 02 Apr, 2025 3 commits
-
-
myhloli authored
- Add RapidOCR and PaddleOCR2Pytorch to the Acknowledgments list in README.md - Add RapidOCR and PaddleOCR2Pytorch to the Acknowledgments list in README_zh-CN.md
-
myhloli authored
- Installation and compatibility optimizations: - Replace PaddleOCR with paddleocr2torch to resolve conflicts between Paddle and PyTorch - Remove layoutlmv3 usage to solve compatibility issues with detectron2 - Extend PyTorch version compatibility to2.2~2.6 - Extend CUDA compatibility to 11.8~12.6 - Extend Python version compatibility to 3.10~3.12 - Performance optimizations: - Support batch processing for multiple PDF files - Optimize mfr model loading and usage to reduce memory consumption and improve speed - Reduce minimum memory requirement to 6GB - Improve running speed on MPS devices - Parsing effect optimization: - Update mfr model to unimernet(2503) to fix line break issues in multi-line formulas
-
myhloli authored
- Update Python version requirement to 3.10-3.12 - Expand CUDA environment options to 11.8/12.4/12.6 - Update GPU VRAM requirement to 6GB or more -
-