- 20 Mar, 2025 1 commit
-
-
myhloli authored
- Add support for Apple M1 chips (mps device) - Refactor image processing for better performance and compatibility - Update model loading and inference for various devices - Adjust batch processing and memory management
-
- 19 Mar, 2025 1 commit
-
-
myhloli authored
- Add UnimerMBartConfig and UnimerSwinConfig classes - Implement UnimerSwinImageProcessor for image preprocessing- Create necessary __init__.py files for module structure
-
- 13 Mar, 2025 4 commits
- 12 Mar, 2025 1 commit
-
-
myhloli authored
- Remove unnecessary __getitem__ method - Simplify image cropping in detect_math_formula_region - Improve code readability and efficiency
-
- 10 Mar, 2025 1 commit
-
-
myhloli authored
- Remove unused @ImportPIL decorator from load_images_from_pdf function - Update image shape handling in YOLOv11.py for better compatibility These changes improve code readability and performance without altering the original functionality.
-
- 07 Mar, 2025 2 commits
-
-
myhloli authored
- Replace PIL with cv2 for image processing - Fix issues with image cropping and resizing - Add boundary checks and error handling - Optimize code for better performance and readability
-
myhloli authored
- Remove PIL usage across multiple files - Convert image processing functions to use NumPy arrays - Update crop_img function to work with NumPy arrays - Modify image loading and resizing to use NumPy and OpenCV - Clean up unused imports and comments related to PIL
-
- 03 Mar, 2025 1 commit
-
-
myhloli authored
- Sort detected images by area before processing to enhance MFR accuracy - Implement stable sorting to maintain original order of images with equal
-
- 23 Feb, 2025 1 commit
-
-
myhloli authored
- Add license ID information to the log for better traceability - Improve logging format to include both license ID and expiration date
-
- 21 Feb, 2025 2 commits
-
-
myhloli authored
- Add ImportError handling to silence known import-related exceptions - Improve generic exception handling to log error messages- Maintain existing specific exception handlers for license-related issues
-
myhloli authored
- Add license verification logic for Ascend plugin - Handle different license-related exceptions with appropriate error messages - Log success message with license expiration date if verification passes - Fall back to CPU model if license verification fails or plugin is not available
-
- 10 Feb, 2025 2 commits
-
-
myhloli authored
- Remove redundant imports for StructTableModel and TableMasterPaddleModel - Reorder imports to group related modules together - Update import structure for better readability and maintainability
-
myhloli authored
- Remove unused utility functions - Update import statements for better readability - Add conditional imports for Ascend plugin - Refactor table model initialization to support NPU
-
- 09 Feb, 2025 1 commit
-
-
myhloli authored
- Increase batch size from 8 to 256 for language detection inference - Add timing measurement for language detection process
-
- 21 Jan, 2025 1 commit
-
-
myhloli authored
refactor(magic_pdf): adjust VRAM allocation and MFR batch size- Update VRAM allocation logic to use 'VIRTUAL_VRAM_SIZE' environment variable - Reduce MFR (Math Formula Recognition) batch size from 64 to 32
-
- 20 Jan, 2025 2 commits
- 17 Jan, 2025 1 commit
-
-
myhloli authored
- Import get_device function from magic_pdf.libs.config_reader- Update RapidTableModel initialization to include device parameter for Unitable model
-
- 16 Jan, 2025 1 commit
-
-
myhloli authored
- Update RapidTable dependency to version 1.0.3 - Add support for sub-models in RapidTable - Update magic-pdf configuration to include table sub-model - Modify table model initialization to support sub-models - Update table prediction logic to handle new output format
-
- 14 Jan, 2025 1 commit
-
-
myhloli authored
- Merge title blocks that are close to each other horizontally - Adjust line insertion logic for title blocks- Increase image size and decrease confidence threshold for layout detection - Update DocLayoutYOLO model weights - Refactor drawing of bounding boxes for different block types
-
- 09 Jan, 2025 3 commits
-
-
myhloli authored
- Remove conditional logic for OCR engine selection - Always use RapidOCR as the OCR engine - Simplify the __init__ method by removing unused code
-
myhloli authored
- Remove YOLO v11 language detection model from model_configs.yaml - Update language detection utils to use a fixed model path instead of dynamic configuration - Remove unused model weight parameter for YOLO v11 language detection
-
myhloli authored
- Remove LangDetectMode and related conditional logic - Use a single model weight for language detection - Add logging for language detection results - Update model initialization and prediction methods
-
- 08 Jan, 2025 2 commits
-
-
myhloli authored
- Add language detection model initialization and integration - Update model list to include language detection - Refactor language detection utils for better model management
-
myhloli authored
- Add separate models for Chinese/Japanese and English/French/German detection - Implement mode-based detection to use appropriate models for different languages - Update language detection process to use higher DPI for better accuracy - Modify model initialization and prediction logic to support new language-specific models
-
- 06 Jan, 2025 1 commit
-
-
myhloli authored
- Add check for empty OCR result when using PaddleOCR model - Assign None to ocr_result if no text is detected, preventing further errors
-
- 05 Jan, 2025 1 commit
-
-
myhloli authored
- Add missing 'else' statement in OCR model selection logic - Ensure consistent formatting of 'if' statements for better readability - Remove unnecessary empty line in the 'app.py' file
-
- 03 Jan, 2025 2 commits
-
-
myhloli authored
- Remove logger.info() call for additional_ocr_params to reduce log verbosity
-
myhloli authored
- Implement ONNXModelSingleton to manage ONNX models - Modify ModifiedPaddleOCR to use ONNX models on ARM CPUs without CUDA - Update RapidTableModel to use RapidOCR with ONNXRuntime on CPU - Add rapidocr_onnxruntime dependency in setup.py
-
- 30 Dec, 2024 2 commits
-
-
myhloli authored
- Remove use_npu variable initialization - Comment out device assignment and npu check - Comment out use_npu parameter in ModifiedPaddleOCR constructor
-
myhloli authored
- Update `clean_memory.py` to use `torch_npu.npu` instead of `torch.npu` - Update `model_utils.py` to use `torch_npu.npu` instead of `torch.npu` - Simplify NPU availability check and bfloat16 support in `pdf_parse_union_core_v2.py`
-
- 26 Dec, 2024 2 commits
-
-
myhloli authored
- Update clean_memory function to support both CUDA and NPU devices - Implement get_device function to centralize device selection logic - Modify model initialization and memory cleaning to use the selected device - Update RapidTableModel to support both RapidOCR and PaddleOCR engines
-
myhloli authored
- Add NPU support for memory cleaning and model initialization - Optimize table model initialization and prediction process - Update memory utils to support NPU - Add language parameter for table model
-
- 17 Dec, 2024 1 commit
-
-
myhloli authored
- Add YOLOv11 language detection model for PDF documents - Implement language detection in PymuDocDataset - Update app.py to include 'auto' language option - Create language detection utilities and constants
-
- 16 Dec, 2024 1 commit
-
-
myhloli authored
- Remove YOLO_VERBOSE environment variable from multiple files - Set verbose=False in YOLOv8 prediction method to suppress logger output
-
- 12 Dec, 2024 2 commits