- 02 Apr, 2025 6 commits
-
-
myhloli authored
- Update PyMuPDF to version <1.25.0 - Update pydantic to version <2.11 - Update transformers to version < 5.0.0 - Remove always_apply parameter from alb.ToGray in image processing
-
myhloli authored
- Update the default configuration path in pytorchocr_utility.py - Add required dependencies for paddleocr2pytorch in setup.py: - shapely - pyclipper - omegaconf
-
myhloli authored
- Remove unused UniMERNet and LayoutLMv3 model configurations - Update OCR model path and dictionary path for PaddleOCR - Modify README to update system requirements and installation instructions - Update setup.py to include new package data
-
myhloli authored
- Add newline at the beginning of arabic_dict.txt - Change mode of multiple dictionary files
-
myhloli authored
- Remove OCR utils, modified PaddleOCR, and StructEqTable model - Delete related import statements and model definitions - Update dependencies in setup.py to remove paddlepaddle and related OCR packages
-
myhloli authored
- Comment out print statements in base_ocr_v20.py and pytorch_paddle.py - Update table model initialization to use lang parameter instead of ocr_engine - Remove unused RapidOCR initialization in rapid_table.py
-
- 01 Apr, 2025 2 commits
-
-
myhloli authored
- Remove unused OCR dictionaries for Arabic, Belarusian, Bulgarian and Armenian languages - Update model configurations in arch_config.yaml: - Comment out 'out_channels' for various language models - Rename Arabic, Korean, Japanese, Tamil and Devanagari model configurations to use 'v3' instead of 'v4' - Delete ar_dict.txt, be_dict.txt and bg_dict.txt files - Update arabic_dict.txt to remove blank line at the start
-
myhloli authored
- Remove unused imports and code - Simplify model architecture by removing unnecessary components - Update initialization and forward pass logic - Rename variables for consistency
-
- 31 Mar, 2025 2 commits
-
-
myhloli authored
- Add support for multiple languages in OCR processing - Create separate lists for each language to improve processing efficiency - Update OCR model initialization to use PytorchPaddleOCR instead of ModifiedPaddleOCR - Modify get_ocr_result_list function to include language information- Improve logging for OCR detection and recognition
-
myhloli authored
- Split OCR process into detection and recognition stages - Update batch analysis and document analysis pipelines - Modify OCR result formatting and handling - Remove unused imports and optimize code structure
-
- 27 Mar, 2025 1 commit
-
-
myhloli authored
- Add base model structure for OCR in pytorch - Implement data augmentation and transformation modules - Create utilities for dictionary handling and state dict conversion - Include post-processing modules for OCR - Add weight initialization and loading functions
-
- 24 Mar, 2025 1 commit
-
-
myhloli authored
- Comment out margin cropping to prevent errors with broken files - Refactor image resizing to preserve aspect ratio - Update padding calculation and application using OpenCV
-
- 22 Mar, 2025 1 commit
-
-
myhloli authored
- Replace deprecated importlib.resources.path with importlib.resources.files - Simplify code structure and improve readability - Remove unnecessary comments and empty lines
-
- 21 Mar, 2025 1 commit
-
-
myhloli authored
- Comment out LayoutLMv3, TableMaster, and StructEqTable models - Update MFR model path to unimernet_hf_small_2503- Remove unused import in Unimernet.py
-
- 20 Mar, 2025 3 commits
-
-
myhloli authored
- Remove half() calls for DocLayoutYOLO and YOLOv8 models - This change prevents potential errors when running models on CPU
-
myhloli authored
- Update config version to1.2.0 - Refactor model initialization in model_init.py- Update dependencies in requirements.txt files - Remove unused imports and models - Add conditional imports for table models
-
myhloli authored
- Add support for Apple M1 chips (mps device) - Refactor image processing for better performance and compatibility - Update model loading and inference for various devices - Adjust batch processing and memory management
-
- 19 Mar, 2025 1 commit
-
-
myhloli authored
- Add UnimerMBartConfig and UnimerSwinConfig classes - Implement UnimerSwinImageProcessor for image preprocessing- Create necessary __init__.py files for module structure
-
- 13 Mar, 2025 4 commits
- 12 Mar, 2025 1 commit
-
-
myhloli authored
- Remove unnecessary __getitem__ method - Simplify image cropping in detect_math_formula_region - Improve code readability and efficiency
-
- 10 Mar, 2025 1 commit
-
-
myhloli authored
- Remove unused @ImportPIL decorator from load_images_from_pdf function - Update image shape handling in YOLOv11.py for better compatibility These changes improve code readability and performance without altering the original functionality.
-
- 07 Mar, 2025 2 commits
-
-
myhloli authored
- Replace PIL with cv2 for image processing - Fix issues with image cropping and resizing - Add boundary checks and error handling - Optimize code for better performance and readability
-
myhloli authored
- Remove PIL usage across multiple files - Convert image processing functions to use NumPy arrays - Update crop_img function to work with NumPy arrays - Modify image loading and resizing to use NumPy and OpenCV - Clean up unused imports and comments related to PIL
-
- 03 Mar, 2025 1 commit
-
-
myhloli authored
- Sort detected images by area before processing to enhance MFR accuracy - Implement stable sorting to maintain original order of images with equal
-
- 23 Feb, 2025 1 commit
-
-
myhloli authored
- Add license ID information to the log for better traceability - Improve logging format to include both license ID and expiration date
-
- 21 Feb, 2025 2 commits
-
-
myhloli authored
- Add ImportError handling to silence known import-related exceptions - Improve generic exception handling to log error messages- Maintain existing specific exception handlers for license-related issues
-
myhloli authored
- Add license verification logic for Ascend plugin - Handle different license-related exceptions with appropriate error messages - Log success message with license expiration date if verification passes - Fall back to CPU model if license verification fails or plugin is not available
-
- 10 Feb, 2025 2 commits
-
-
myhloli authored
- Remove redundant imports for StructTableModel and TableMasterPaddleModel - Reorder imports to group related modules together - Update import structure for better readability and maintainability
-
myhloli authored
- Remove unused utility functions - Update import statements for better readability - Add conditional imports for Ascend plugin - Refactor table model initialization to support NPU
-
- 09 Feb, 2025 1 commit
-
-
myhloli authored
- Increase batch size from 8 to 256 for language detection inference - Add timing measurement for language detection process
-
- 21 Jan, 2025 1 commit
-
-
myhloli authored
refactor(magic_pdf): adjust VRAM allocation and MFR batch size- Update VRAM allocation logic to use 'VIRTUAL_VRAM_SIZE' environment variable - Reduce MFR (Math Formula Recognition) batch size from 64 to 32
-
- 20 Jan, 2025 2 commits
- 17 Jan, 2025 1 commit
-
-
myhloli authored
- Import get_device function from magic_pdf.libs.config_reader- Update RapidTableModel initialization to include device parameter for Unitable model
-
- 16 Jan, 2025 1 commit
-
-
myhloli authored
- Update RapidTable dependency to version 1.0.3 - Add support for sub-models in RapidTable - Update magic-pdf configuration to include table sub-model - Modify table model initialization to support sub-models - Update table prediction logic to handle new output format
-
- 14 Jan, 2025 1 commit
-
-
myhloli authored
- Merge title blocks that are close to each other horizontally - Adjust line insertion logic for title blocks- Increase image size and decrease confidence threshold for layout detection - Update DocLayoutYOLO model weights - Refactor drawing of bounding boxes for different block types
-
- 09 Jan, 2025 1 commit
-
-
myhloli authored
- Remove conditional logic for OCR engine selection - Always use RapidOCR as the OCR engine - Simplify the __init__ method by removing unused code
-