1. 10 Jun, 2025 3 commits
  2. 03 Jun, 2025 1 commit
  3. 09 May, 2025 1 commit
  4. 14 Apr, 2025 1 commit
  5. 12 Apr, 2025 1 commit
  6. 08 Apr, 2025 5 commits
  7. 02 Apr, 2025 4 commits
    • myhloli's avatar
      build(deps): update package versions for linux and macos · cb3a4314
      myhloli authored
      - Update matplotlib minimum version to 3.10 for Linux and MacOS
      - Specify version ranges for PyYAML, ftfy, openai, shapely, pyclipper, and omegaconf
      - Update dill to version <1 for compatibility
      cb3a4314
    • myhloli's avatar
      feat(ocr): update OCR utility and dependencies · d09464be
      myhloli authored
      - Update the default configuration path in pytorchocr_utility.py
      - Add required dependencies for paddleocr2pytorch in setup.py:
        - shapely
        - pyclipper
        - omegaconf
      d09464be
    • myhloli's avatar
      refactor(model): update OCR model and remove unused configs · c45a706c
      myhloli authored
      - Remove unused UniMERNet and LayoutLMv3 model configurations
      - Update OCR model path and dictionary path for PaddleOCR
      - Modify README to update system requirements and installation instructions
      - Update setup.py to include new package data
      c45a706c
    • myhloli's avatar
      refactor(model): remove unused OCR and table models · d8ebd92f
      myhloli authored
      - Remove OCR utils, modified PaddleOCR, and StructEqTable model
      - Delete related import statements and model definitions
      - Update dependencies in setup.py to remove paddlepaddle and related OCR packages
      d8ebd92f
  8. 20 Mar, 2025 3 commits
  9. 13 Feb, 2025 2 commits
  10. 16 Jan, 2025 1 commit
    • myhloli's avatar
      feat(table): upgrade RapidTable to1.0.3 and add sub-model support · 79c8a5c8
      myhloli authored
      - Update RapidTable dependency to version 1.0.3
      - Add support for sub-models in RapidTable
      - Update magic-pdf configuration to include table sub-model
      - Modify table model initialization to support sub-models
      - Update table prediction logic to handle new output format
      79c8a5c8
  11. 14 Jan, 2025 1 commit
    • myhloli's avatar
      feat(layout): improve title block handling and layout detection · c20e9a1e
      myhloli authored
      - Merge title blocks that are close to each other horizontally
      - Adjust line insertion logic for title blocks- Increase image size and decrease confidence threshold for layout detection
      - Update DocLayoutYOLO model weights
      - Refactor drawing of bounding boxes for different block types
      c20e9a1e
  12. 09 Jan, 2025 1 commit
  13. 03 Jan, 2025 1 commit
    • myhloli's avatar
      feat(model): add onnxruntime support for paddleocr on cpu · 512adb67
      myhloli authored
      - Implement ONNXModelSingleton to manage ONNX models
      - Modify ModifiedPaddleOCR to use ONNX models on ARM CPUs without CUDA
      - Update RapidTableModel to use RapidOCR with ONNXRuntime on CPU
      - Add rapidocr_onnxruntime dependency in setup.py
      512adb67
  14. 26 Dec, 2024 1 commit
  15. 25 Dec, 2024 1 commit
    • myhloli's avatar
      feat(llm_aided): add title optimization feature · 0a468eca
      myhloli authored
      - Implement llm_aided_title function to optimize document titles using LLM
      - Update pdf_parse_union_core_v2.py to include title optimization
      - Modify ocr_mkcontent.py to use optimized title levels- Add openai SDK dependency in setup.py
      0a468eca
  16. 11 Dec, 2024 1 commit
    • myhloli's avatar
      build(deps): update torch and torchvision version requirements · 9a96362d
      myhloli authored
      - Specify torch==2.3.1 and torchvision==0.18.1 for Windows CUDA installation
      - Add torch and torchvision version constraints in setup.py:
        - torch>=2.2.2,<=2.3.1
        - torchvision>=0.17.2,<=0.18.1
      - Update installation instructions in both English and Chinese README files
      9a96362d
  17. 09 Dec, 2024 2 commits
  18. 06 Dec, 2024 1 commit
  19. 18 Nov, 2024 1 commit
    • myhloli's avatar
      build(setup): add old_linux specific dependencies · d0f633e2
      myhloli authored
      - Add albumentations package with version <=1.4.20 for old_linux
      - This version is compatible with Linux systems from 2019 and earlier
      - Version 1.4.21 and above introduced simsimd which is not supported on older Linux systems
      d0f633e2
  20. 15 Nov, 2024 1 commit
  21. 08 Nov, 2024 2 commits
    • myhloli's avatar
      feat(table): add RapidOCR support for RapidTable model · fe2c2c0d
      myhloli authored
      - Integrate RapidOCR with RapidTable model for table recognition
      - Improve memory management for devices with <= 8GB VRAM
      - Update table recognition process to use RapidOCR for RapidTable
      - Add rapidocr-paddle dependency in setup.py
      fe2c2c0d
    • myhloli's avatar
      feat(table): integrate RapidTable model for table recognition · 240fe99e
      myhloli authored
      - Add RapidTable model support for table recognition
      - Update table model configuration and initialization
      - Modify table recognition process to use RapidTable when specified
      - Add RapidTable dependency to setup.py
      240fe99e
  22. 04 Nov, 2024 1 commit
  23. 23 Oct, 2024 1 commit
  24. 10 Sep, 2024 2 commits
    • Xiaomeng Zhao's avatar
      Update setup.py · 20212a37
      Xiaomeng Zhao authored
      update UniMERNet to 0.2.1
      20212a37
    • myhloli's avatar
      refactor(pdf_extract_kit): update model config and weight paths for UniMERNet-0.2.0 · 3e9bc7a4
      myhloli authored
      Update the paths to model weights and configuration files for the UniMERNet architecture
      in both the demo.yaml and model_configs.yaml files. Adjust the mfr_model_init function toreflect the new weight and configuration paths. The changes include specifying more detailed
      paths to the unimernet_base directory and changing the weight file extension to .pth.
      3e9bc7a4
  25. 04 Aug, 2024 1 commit
    • myhloli's avatar
      fix(setup): allow latest matplotlib versions on non-Windows platforms · 25213909
      myhloli authored
      The restriction on the matplotlib version has been updated to only apply on Windows
      platforms, where precompiled packages are not available starting from version 3.9.1.
      This change enables users on Linux and macOS to install newer versions of matplotlib,
      addressing compatibility issues with recent bug fixes.
      25213909