- 03 Apr, 2025 2 commits
-
-
myhloli authored
- Update table recognition logic to process each table individually - Refactor layout detection to use tqdm for progress tracking - Optimize OCR recognition by using a single tqdm wrapper - Improve MFR prediction with a more accurate progress bar - Simplify MFD prediction by removing unnecessary total calculation
-
myhloli authored
- Add tqdm progress bar to batch prediction loops in multiple model modules - Improve logging and error handling in batch analysis script - Update table model initialization to use default sub-model if none specified - Add tqdm dependency to requirements.txt
-
- 02 Apr, 2025 1 commit
-
-
myhloli authored
- Update PyMuPDF to version <1.25.0 - Update pydantic to version <2.11 - Update transformers to version < 5.0.0 - Remove always_apply parameter from alb.ToGray in image processing
-
- 24 Mar, 2025 1 commit
-
-
myhloli authored
- Comment out margin cropping to prevent errors with broken files - Refactor image resizing to preserve aspect ratio - Update padding calculation and application using OpenCV
-
- 21 Mar, 2025 1 commit
-
-
myhloli authored
- Comment out LayoutLMv3, TableMaster, and StructEqTable models - Update MFR model path to unimernet_hf_small_2503- Remove unused import in Unimernet.py
-
- 20 Mar, 2025 1 commit
-
-
myhloli authored
- Add support for Apple M1 chips (mps device) - Refactor image processing for better performance and compatibility - Update model loading and inference for various devices - Adjust batch processing and memory management
-
- 19 Mar, 2025 1 commit
-
-
myhloli authored
- Add UnimerMBartConfig and UnimerSwinConfig classes - Implement UnimerSwinImageProcessor for image preprocessing- Create necessary __init__.py files for module structure
-
- 12 Mar, 2025 1 commit
-
-
myhloli authored
- Remove unnecessary __getitem__ method - Simplify image cropping in detect_math_formula_region - Improve code readability and efficiency
-
- 07 Mar, 2025 1 commit
-
-
myhloli authored
- Remove PIL usage across multiple files - Convert image processing functions to use NumPy arrays - Update crop_img function to work with NumPy arrays - Modify image loading and resizing to use NumPy and OpenCV - Clean up unused imports and comments related to PIL
-
- 03 Mar, 2025 1 commit
-
-
myhloli authored
- Sort detected images by area before processing to enhance MFR accuracy - Implement stable sorting to maintain original order of images with equal
-
- 21 Jan, 2025 1 commit
-
-
myhloli authored
refactor(magic_pdf): adjust VRAM allocation and MFR batch size- Update VRAM allocation logic to use 'VIRTUAL_VRAM_SIZE' environment variable - Reduce MFR (Math Formula Recognition) batch size from 64 to 32
-
- 12 Dec, 2024 1 commit
-
-
Suven authored
-
- 15 Nov, 2024 1 commit
-
-
myhloli authored
-