- 01 Apr, 2025 1 commit
-
-
myhloli authored
- Added warnings module to import list - Implemented a warning catcher to ignore FutureWarning from the transformers module - This change prevents unnecessary warning messages during model inference
-
- 31 Mar, 2025 3 commits
-
-
myhloli authored
- Replace direct OCR model access with AtomModelSingleton for better model management - Round OCR scores to 2 decimal places for consistency - Improve error handling and logging in batch analysis - Simplify OCR result processing in pdf_parse_union_core_v2.py
-
myhloli authored
- Add support for multiple languages in OCR processing - Create separate lists for each language to improve processing efficiency - Update OCR model initialization to use PytorchPaddleOCR instead of ModifiedPaddleOCR - Modify get_ocr_result_list function to include language information- Improve logging for OCR detection and recognition
-
myhloli authored
- Split OCR process into detection and recognition stages - Update batch analysis and document analysis pipelines - Modify OCR result formatting and handling - Remove unused imports and optimize code structure
-
- 27 Mar, 2025 4 commits
-
-
Xiaomeng Zhao authored
-
myhloli authored
-
myhloli authored
- Add base model structure for OCR in pytorch - Implement data augmentation and transformation modules - Create utilities for dictionary handling and state dict conversion - Include post-processing modules for OCR - Add weight initialization and loading functions
-
Xiaomeng Zhao authored
feat: remove old inference code
-
- 26 Mar, 2025 3 commits
-
-
icecraft authored
-
Xiaomeng Zhao authored
feat: batch inference with ocr and lang flag
-
icecraft authored
-
- 24 Mar, 2025 10 commits
-
-
Xiaomeng Zhao authored
refactor(pdf_parse): adjust line calculation for block height
-
myhloli authored
- Remove unnecessary addition of 1 when calculating lines for block height - This change affects the logic for both potential double-column and triple-column structures
-
Xiaomeng Zhao authored
refactor(pdf_parse): adjust line calculation for block height
-
myhloli authored
- Remove unnecessary addition of 1 when calculating lines for block height - This change affects the logic for both potential double-column and triple-column structures
-
Xiaomeng Zhao authored
fix(pre_proc): improve character overlap handling in OCR processing
-
myhloli authored
- Add condition to check for identical or space characters when resolving overlaps - Skip non-conflicting character pairs to prevent unnecessary removals
-
Xiaomeng Zhao authored
fix: support auto method and auto lang
-
icecraft authored
-
Xiaomeng Zhao authored
fix(magic_pdf): improve image resizing and padding in UnimerSwinn model
-
myhloli authored
- Comment out margin cropping to prevent errors with broken files - Refactor image resizing to preserve aspect ratio - Update padding calculation and application using OpenCV
-
- 22 Mar, 2025 2 commits
-
-
Xiaomeng Zhao authored
refactor(ocr): improve ONNX model initialization and resource handling
-
myhloli authored
- Replace deprecated importlib.resources.path with importlib.resources.files - Simplify code structure and improve readability - Remove unnecessary comments and empty lines
-
- 21 Mar, 2025 4 commits
-
-
Xiaomeng Zhao authored
feat(pre_proc): add function to remove x-overlapping characters in spans
-
myhloli authored
-
myhloli authored
- Implement `remove_x_overlapping_chars` function in `ocr_span_list_modify.py` - Integrate the new function in `pdf_parse_union_core_v2.py` to process spans - Remove unnecessary character replacement functions and comments
-
myhloli authored
- Comment out LayoutLMv3, TableMaster, and StructEqTable models - Update MFR model path to unimernet_hf_small_2503- Remove unused import in Unimernet.py
-
- 20 Mar, 2025 13 commits
-
-
Xiaomeng Zhao authored
Dev push
-
myhloli authored
- Remove separate condition for GPU memory >= 24GB - Simplify logic to use a single threshold of 16GB
-
myhloli authored
- Increase batch ratio to 32 for GPU memory >= 24GB - Set batch ratio to 16 for GPU memory >= 16GB - Reduce batch ratio to 8 for GPU memory >= 12GB - Lower batch ratio to 4 for GPU memory >= 8GB - Set batch ratio to 2 for GPU memory >= 6GB - Keep batch ratio at 1 for lower GPU memory sizes
-
myhloli authored
- Add bf_16_support check for CUDA and MPS devices - Use bfloat16 precision for layoutreader model on supported devices - Improve performance on devices with bf_16 support
-
Xiaomeng Zhao authored
refactor: remove torchtext deprecation warning handling
-
myhloli authored
- Remove torchtext version check and deprecation warning handling from multiple files - This code was unnecessary and potentially caused issues when torchtext was not installed
-
myhloli authored
-
Xiaomeng Zhao authored
refactor(magic_pdf): remove unnecessary half() calls for CPU devices
-
myhloli authored
- Remove half() calls for DocLayoutYOLO and YOLOv8 models - This change prevents potential errors when running models on CPU
-
Xiaomeng Zhao authored
build(docker&setup): add ftfy package
-
myhloli authored
- Added ftfy package to requirements.txt files for Ascend NPU, China, and Global Docker images - This change ensures consistent text formatting across all supported platforms
-
myhloli authored
- Add 'ftfy' package to the list of dependencies in setup.py - This change ensures that the ftfy package is installed along with other dependencies when setting up the project
-
Xiaomeng Zhao authored
Dev push
-