1. 02 Apr, 2025 17 commits
    • myhloli's avatar
      docs: add RapidOCR and PaddleOCR2Pytorch to Acknowledgments list · 7a0b87d5
      myhloli authored
      - Add RapidOCR and PaddleOCR2Pytorch to the Acknowledgments list in README.md
      - Add RapidOCR and PaddleOCR2Pytorch to the Acknowledgments list in README_zh-CN.md
      7a0b87d5
    • myhloli's avatar
      feat(README): update changelog for version 1.3.0 release · 0eff993a
      myhloli authored
      - Installation and compatibility optimizations:
      - Replace PaddleOCR with paddleocr2torch to resolve conflicts between Paddle and PyTorch
        - Remove layoutlmv3 usage to solve compatibility issues with detectron2
        - Extend PyTorch version compatibility to2.2~2.6  - Extend CUDA compatibility to 11.8~12.6
        - Extend Python version compatibility to 3.10~3.12
      
      - Performance optimizations:
       - Support batch processing for multiple PDF files
        - Optimize mfr model loading and usage to reduce memory consumption and improve speed
        - Reduce minimum memory requirement to 6GB
        - Improve running speed on MPS devices
      
      - Parsing effect optimization:
        - Update mfr model to unimernet(2503) to fix line break issues in multi-line formulas
      0eff993a
    • myhloli's avatar
      docs(gpu): update CUDA acceleration documentation · a778645b
      myhloli authored
      - Update CUDA version requirements to12.4
      - Recommend nvidia-driver-570-server for Ubuntu
      - Remove Python version specification for conda environment
      - Update magic-pdf version requirement to 1.3.0
      - Simplify CUDA acceleration testing instructions
      - Remove OCR acceleration with paddlepaddle-gpu
      - Update torch and torchvision installation instructions for Windows
      a778645b
    • myhloli's avatar
      docs(README): update system requirements and GPU support · 298305dd
      myhloli authored
      - Update Python version requirement to 3.10-3.12
      - Expand CUDA environment options to 11.8/12.4/12.6
      - Update GPU VRAM requirement to 6GB or more
      -
      298305dd
    • myhloli's avatar
      build(deps): update package versions for linux and macos · cb3a4314
      myhloli authored
      - Update matplotlib minimum version to 3.10 for Linux and MacOS
      - Specify version ranges for PyYAML, ftfy, openai, shapely, pyclipper, and omegaconf
      - Update dill to version <1 for compatibility
      cb3a4314
    • myhloli's avatar
      build(dependencies): update PyMuPDF, pydantic and transformers · 90321855
      myhloli authored
      - Update PyMuPDF to version <1.25.0
      - Update pydantic to version <2.11
      - Update transformers to version < 5.0.0
      - Remove always_apply parameter from alb.ToGray in image processing
      90321855
    • myhloli's avatar
      feat(ocr): update OCR utility and dependencies · d09464be
      myhloli authored
      - Update the default configuration path in pytorchocr_utility.py
      - Add required dependencies for paddleocr2pytorch in setup.py:
        - shapely
        - pyclipper
        - omegaconf
      d09464be
    • myhloli's avatar
      refactor(model): update OCR model and remove unused configs · c45a706c
      myhloli authored
      - Remove unused UniMERNet and LayoutLMv3 model configurations
      - Update OCR model path and dictionary path for PaddleOCR
      - Modify README to update system requirements and installation instructions
      - Update setup.py to include new package data
      c45a706c
    • myhloli's avatar
      refactor(magic_pdf): remove unused imports and update dependencies · 243bc58c
      myhloli authored
      - Remove unused imports for concurrent.futures, multiprocessing, and paddle
      - Delete commented-out code
      - Update numpy dependency to remove upper version limit
      - Remove InferenceResult import that was commented out
      243bc58c
    • myhloli's avatar
      refactor(docker): remove unused packages and simplify Dockerfile commands · ddaa7158
      myhloli authored
      - Remove paddleocr, paddlepaddle, rapidocr-paddle, and rapidocr-onnxruntime from requirements.txt files
      - Simplify pip install commands in Dockerfiles
      - Remove installation of paddlepaddle-gpu in china and global Dockerfiles
      - Update requirements.txt files across all Docker configurations
      ddaa7158
    • myhloli's avatar
      refactor: comment out paddleocr model copying code · 3bd1e0e4
      myhloli authored
      - Commented out the code that copies the paddleocr model to user directory
      - This change affects both download_models.py and download_models_hf.py scripts
      3bd1e0e4
    • myhloli's avatar
      fix(scripts): update model download scripts for OCR · 5237a385
      myhloli authored
      - Update download_models.py and download_models_hf.py scripts
      - Change OCR model path from paddleocr to paddleocr_torch
      5237a385
    • myhloli's avatar
      chore: update dictionary files · 3b5d3fc8
      myhloli authored
      - Add newline at the beginning of arabic_dict.txt
      - Change mode of multiple dictionary files
      3b5d3fc8
    • myhloli's avatar
      refactor(model): remove unused OCR and table models · d8ebd92f
      myhloli authored
      - Remove OCR utils, modified PaddleOCR, and StructEqTable model
      - Delete related import statements and model definitions
      - Update dependencies in setup.py to remove paddlepaddle and related OCR packages
      d8ebd92f
    • myhloli's avatar
      refactor(ocr): comment out print statements and update table model initialization · 5252c46e
      myhloli authored
      - Comment out print statements in base_ocr_v20.py and pytorch_paddle.py
      - Update table model initialization to use lang parameter instead of ocr_engine
      - Remove unused RapidOCR initialization in rapid_table.py
      5252c46e
    • myhloli's avatar
      9b3339f1
    • myhloli's avatar
      feat(ocr): implement dynamic OCR processing for text spans with low contrast · a024c30f
      myhloli authored
      - Comment out OCR model initialization and execution for low-contrast spans
      - Add batch OCR processing for collected image spans
      - Adjust contrast threshold for OCR processing
      - Remove unnecessary OCR processing for high-contrast spans
      - Implement more efficient OCR workflow by processing multiple spans at once
      a024c30f
  2. 01 Apr, 2025 5 commits
  3. 31 Mar, 2025 3 commits
    • myhloli's avatar
      refactor(model): integrate AtomModelSingleton for OCR and improve OCR result handling · 59d6b195
      myhloli authored
      - Replace direct OCR model access with AtomModelSingleton for better model management
      - Round OCR scores to 2 decimal places for consistency
      - Improve error handling and logging in batch analysis
      - Simplify OCR result processing in pdf_parse_union_core_v2.py
      59d6b195
    • myhloli's avatar
      feat(ocr): implement language-specific OCR processing · d7d85a28
      myhloli authored
      - Add support for multiple languages in OCR processing
      - Create separate lists for each language to improve processing efficiency
      - Update OCR model initialization to use PytorchPaddleOCR instead of ModifiedPaddleOCR
      - Modify get_ocr_result_list function to include language information- Improve logging for OCR detection and recognition
      d7d85a28
    • myhloli's avatar
      feat(ocr): implement separate detection and recognition processes · a330651d
      myhloli authored
      - Split OCR process into detection and recognition stages
      - Update batch analysis and document analysis pipelines
      - Modify OCR result formatting and handling
      - Remove unused imports and optimize code structure
      a330651d
  4. 27 Mar, 2025 4 commits
  5. 26 Mar, 2025 3 commits
  6. 24 Mar, 2025 8 commits