- 09 Apr, 2025 9 commits
-
-
myhloli authored
- Comment out the line that updates det_count in batch_analyze.py - Add a new OCR model configuration for Chinese (ch_lite) in models_config.yml- Update the Chinese OCR model configuration to use a different recognition model
-
myhloli authored
- Change `bits` to `self._data_bits` for language detection - This fixes the TypeError when opening PDF files
-
myhloli authored
- Simplify aspect ratio calculation using direct coordinate subtraction - Remove unnecessary list append operation - Improve code readability and performance in table rotation detection
-
myhloli authored
- Implement table orientation detection to identify if a table is in portrait mode - Add rotation logic to turn portrait tables 90 degrees clockwise before OCR - Update OCR processing to work with potentially rotated images - Improve text box analysis to determine if a table is rotated
-
myhloli authored
- Update predict_rec.py to check for NaN values in recognition results - Replace NaN scores with 0.0 to ensure stability and consistency
-
Xiaomeng Zhao authored
fix: support page range
-
Xiaomeng Zhao authored
feat(model): improve table recognition by merging and filtering tables
-
myhloli authored
- Add functions to calculate IoU, check if tables are inside each other, and merge tables - Implement table merging for high IoU tables - Add filtering to remove nested tables that don't overlap but cover a large area - Update table_res_list and layout_res to reflect these changes
-
icecraft authored
-
- 08 Apr, 2025 28 commits
-
-
Xiaomeng Zhao authored
-
Xiaomeng Zhao authored
docs: update version number in README files
-
myhloli authored
- Correct version number from 1.3.2 to 1.3.1 in both README.md and README_zh-CN.md - Update changelog entries for the latest release
-
Xiaomeng Zhao authored
-
Xiaomeng Zhao authored
Dev
-
Xiaomeng Zhao authored
docs(README): update version number and changelog in README files
-
myhloli authored
- Update version number from 1.3.1 to 1.3.2
-
Xiaomeng Zhao authored
refactor(ocr): improve OCR score precision to three decimal places
-
myhloli authored
- Update OCR score formatting in batch_analyze.py and pdf_parse_union_core_v2.py - Change score rounding method to preserve three decimal places - Enhance accuracy representation without significantly altering the score value
-
myhloli authored
- Update version from1.3.1 to 1.3.2 in both English and Chinese README files - Keep other content unchanged
-
Xiaomeng Zhao authored
-
myhloli authored
-
Xiaomeng Zhao authored
Release 1.3.1
-
Xiaomeng Zhao authored
Dev
-
Xiaomeng Zhao authored
docs: update badges and project URLs- Update PyPI version badge to us…
-
myhloli authored
- Add project URLs in setup.py for better discoverability - Make consistent changes across README.md and README_zh-CN.md
-
Xiaomeng Zhao authored
-
Xiaomeng Zhao authored
-
Xiaomeng Zhao authored
fix(table): add model path for slanet-plus to resolve RapidTableError
-
Xiaomeng Zhao authored
fix(table): add model path for slanet-plus to resolve RapidTableError
-
myhloli authored
- Import os and pathlib modules to handle file paths - Define the path to the slanet-plus model - Update RapidTableInput initialization to include the model path
-
Xiaomeng Zhao authored
update 1.3.1
-
myhloli authored
- Add support for Python3.11, 3.12, and 3.13 - Replace requirements.txt based installation with editable install
-
myhloli authored
- Update Python version requirements to >=3.10 - Simplify torch installation command- Remove numpy version restriction - Update CUDA compatibility information - Adjust environment creation commands across multiple documentation files
-
myhloli authored
- Remove rapid_table from install_requires in setup.py
-
myhloli authored
- Add "3.13" option for Python version - Remove "3.9" option for Python version - Update dependency version options: - Remove "0.8.x", "0.9.x", "0.10.x" - Add "1.1.x", "1.2.x", "1.3.x"
-
myhloli authored
- Update transformers to exclude version 4.51.0 due to compatibility issues- Rapid table version range expanded to >=1.0.5,<2.0.0 - Add separate 'full_old_linux' extras_require for better support of older Linux systems - Update matplotlib version requirements for different platforms - Remove platform-specific paddlepaddle versions,
-
myhloli authored
- Add rapid_table==1.0.3 to old_linux specific dependencies - This version is compatible with Linux systems from 2019 and earlier - Newer versions of rapid_table depend on onnxruntime, which is not supported on older Linux systems
-
- 07 Apr, 2025 3 commits
-
-
Xiaomeng Zhao authored
fix(model): improve VRAM detection and handling
-
myhloli authored
- Refactor VRAM detection logic for better readability and efficiency - Add fallback mechanism for unknown VRAM sizes - Improve device checking in get_vram function
-
Xiaomeng Zhao authored
fix: image dataset add lang field
-