- 16 Apr, 2025 3 commits
-
-
myhloli authored
- Update README.md and README_zh-CN.md with the latest changes - Add new release notes for version 1.3.4 - Include improvements in OCR detection speed and page-level sorting
-
myhloli authored
- Modify `ocr_detect_all_bboxes.py` to return footnote blocks - Update `pdf_parse_union_core_v2.py` to handle footnote blocks in line sorting and layout splitting - This change improves the accuracy of layout analysis by considering footnote blocks separately
-
myhloli authored
- Rename test file from test_tablemaster.py to test_rapidtable.py - Replace TableMasterPaddleModel with RapidTableModel - Update test case to use new model and adjust assertions accordingly - Remove some outdated assertions and comments
-
- 15 Apr, 2025 3 commits
-
-
Xiaomeng Zhao authored
feat(model): add text region handling and improve overlap resolution
-
myhloli authored
- Add text region handling in get_res_list_from_layout_res function - Implement remove_overlaps_min_blocks function to handle overlapping blocks - Update OCR region handling to include text regions - Improve overlap resolution for all regions in layout results
-
myhloli authored
- Add pip install command for torch and torchvision - Specify version2.3.1 for both packages - Use Aliyun mirror for faster download
-
- 14 Apr, 2025 14 commits
-
-
Xiaomeng Zhao authored
Dev
-
Xiaomeng Zhao authored
master->dev
-
github-actions[bot] authored
-
myhloli authored
-
Xiaomeng Zhao authored
Release 1.3.3
-
Xiaomeng Zhao authored
build(deps): downgrade dill to 0.3.8 for doclayout_yolo compatibility
-
Xiaomeng Zhao authored
build(deps): downgrade dill to 0.3.8 for doclayout_yolo compatibility
-
myhloli authored
- Change dill dependency from >=0.3.9,<1 to >=0.3.8,<1 - Update dill version in both general and specific requirements
-
Xiaomeng Zhao authored
fix(magic_pdf): correct range for images in document analysis
-
myhloli authored
- Update the range used to generate images_with_extra_info to match the number of images - This fixes a potential IndexError when the number of images differs from the dataset length
-
Xiaomeng Zhao authored
refactor(footnote_detection): adjust footnote detection threshold
-
github-actions[bot] authored
-
Xiaomeng Zhao authored
refactor(footnote_detection): adjust footnote detection threshold
-
myhloli authored
- Change footnote detection threshold from 50% of page height to 30% - Improve accuracy of footnote identification in PDF processing
-
- 13 Apr, 2025 5 commits
-
-
Xiaomeng Zhao authored
master->dev
-
Xiaomeng Zhao authored
build(docker): remove requirements.txt and update package installation
-
Xiaomeng Zhao authored
build(docker): remove requirements.txt and update package installation
-
Xiaomeng Zhao authored
build(docker): remove requirements.txt and update package installation
-
myhloli authored
- Remove requirements.txt files for all Docker configurations - Update package installation process in Dockerfiles: - Use magic-pdf[full] instead of individual requirements - Simplify installation steps and reduce image size - Remove redundant wget commands for requirements.txt
-
- 12 Apr, 2025 13 commits
-
-
myhloli authored
-
Xiaomeng Zhao authored
Release 1.3.2
-
Xiaomeng Zhao authored
Dev
-
Xiaomeng Zhao authored
docs(readme): update release notes for English and Chinese README files
-
myhloli authored
- Update version history in both English and Chinese README files - Add note about model update required for fixing word concatenation issue- Ensure consistency between English and Chinese versions
-
Xiaomeng Zhao authored
Dev
-
Xiaomeng Zhao authored
docs(README): update version history and installation instructions
-
myhloli authored
- Update version history in README.md and README_zh-CN.md - Add details for 1.3.2 release and previous versions - Update Windows CUDA acceleration installation instructions - Refactor changelog entries for better readability and organization
-
Xiaomeng Zhao authored
feat(magic_pdf): add logging for batch image processing
-
myhloli authored
- Add batch processing logs to track the progress of image analysis - Display the current batch number, total batches, and the number of processed pages
-
Xiaomeng Zhao authored
build(setup): update package versions and constraints
-
myhloli authored
-
myhloli authored
- Update matplotlib version range to >=3.10, <4 - Add version上限 for ultralytics: <9 - Remove redundant version ranges for full_old_linux
-
- 11 Apr, 2025 2 commits
-
-
Xiaomeng Zhao authored
refactor(model): optimize batch processing and inference
-
myhloli authored
- Remove unnecessary line breaks and adjust indentation - Update function call to use named arguments for better readability - Modify _do_parse function call to use MakeMode.MM_MD instead of
-