- 17 Apr, 2025 6 commits
-
-
dt-yy authored
-
Xiaomeng Zhao authored
master->dev
-
myhloli authored
-
Xiaomeng Zhao authored
Release 1.3.5
-
Xiaomeng Zhao authored
refactor(office_to_pdf): simplify font checking and add logging
-
myhloli authored
- Remove specific Chinese font list and detailed font checking - Add logging warning if no Chinese fonts are detected - Make font checking more robust and less platform-specific
-
- 16 Apr, 2025 14 commits
-
-
myhloli authored
- Temporarily disable Chinese font check for Windows systems - This change allows bypassing the font check when the required fonts are not present
-
myhloli authored
- Update magic-pdf installation to include specific version with full dependencies - Add numpy, decorator, attrs, absl-py, cloudpickle, ml-dtypes, tornado, and einops as separate packages - Specify numpy version to be less than 2
-
Xiaomeng Zhao authored
fix(test_table): update image path to use relative path
-
myhloli authored
- Replace hardcoded image path with dynamic path generation - Use os.path.join to create platform-independent file paths - Improve code maintainability and portability across different environments
-
Xiaomeng Zhao authored
master->dev
-
myhloli authored
-
Xiaomeng Zhao authored
Release 1.3.4
-
Xiaomeng Zhao authored
Dev
-
Xiaomeng Zhao authored
docs(README): update changelog for v1.3.4 release
-
myhloli authored
- Update README.md and README_zh-CN.md with the latest changes - Add new release notes for version 1.3.4 - Include improvements in OCR detection speed and page-level sorting
-
Xiaomeng Zhao authored
feat(pdf_parse): add footnote block handling in layout split
-
myhloli authored
- Modify `ocr_detect_all_bboxes.py` to return footnote blocks - Update `pdf_parse_union_core_v2.py` to handle footnote blocks in line sorting and layout splitting - This change improves the accuracy of layout analysis by considering footnote blocks separately
-
Xiaomeng Zhao authored
test(table): update unit test to use RapidTable model
-
myhloli authored
- Rename test file from test_tablemaster.py to test_rapidtable.py - Replace TableMasterPaddleModel with RapidTableModel - Update test case to use new model and adjust assertions accordingly - Remove some outdated assertions and comments
-
- 15 Apr, 2025 9 commits
-
-
Xiaomeng Zhao authored
feat(model): add text region handling and improve overlap resolution
-
Xiaomeng Zhao authored
feat(model): add text region handling and improve overlap resolution
-
myhloli authored
- Add text region handling in get_res_list_from_layout_res function - Implement remove_overlaps_min_blocks function to handle overlapping blocks - Update OCR region handling to include text regions - Improve overlap resolution for all regions in layout results
-
Xiaomeng Zhao authored
fix:Chinese Character Garbling in PPTX/DOCX Conversion by Adding Font Check and Installation
-
Doge2077 authored
-
Xiaomeng Zhao authored
build(docker): add torch and torchvision installation
-
Xiaomeng Zhao authored
build(docker): add torch and torchvision installation
-
Xiaomeng Zhao authored
build(docker): add torch and torchvision installation
-
myhloli authored
- Add pip install command for torch and torchvision - Specify version2.3.1 for both packages - Use Aliyun mirror for faster download
-
- 14 Apr, 2025 11 commits
-
-
Doge2077 authored
-
Xiaomeng Zhao authored
Dev
-
Xiaomeng Zhao authored
master->dev
-
github-actions[bot] authored
-
myhloli authored
-
Xiaomeng Zhao authored
Release 1.3.3
-
Xiaomeng Zhao authored
build(deps): downgrade dill to 0.3.8 for doclayout_yolo compatibility
-
Xiaomeng Zhao authored
build(deps): downgrade dill to 0.3.8 for doclayout_yolo compatibility
-
myhloli authored
- Change dill dependency from >=0.3.9,<1 to >=0.3.8,<1 - Update dill version in both general and specific requirements
-
Doge2077 authored
-
Xiaomeng Zhao authored
fix(magic_pdf): correct range for images in document analysis
-