1. 02 Apr, 2025 3 commits
    • myhloli's avatar
      feat(ocr): update OCR utility and dependencies · d09464be
      myhloli authored
      - Update the default configuration path in pytorchocr_utility.py
      - Add required dependencies for paddleocr2pytorch in setup.py:
        - shapely
        - pyclipper
        - omegaconf
      d09464be
    • myhloli's avatar
      refactor(model): update OCR model and remove unused configs · c45a706c
      myhloli authored
      - Remove unused UniMERNet and LayoutLMv3 model configurations
      - Update OCR model path and dictionary path for PaddleOCR
      - Modify README to update system requirements and installation instructions
      - Update setup.py to include new package data
      c45a706c
    • myhloli's avatar
      refactor(model): remove unused OCR and table models · d8ebd92f
      myhloli authored
      - Remove OCR utils, modified PaddleOCR, and StructEqTable model
      - Delete related import statements and model definitions
      - Update dependencies in setup.py to remove paddlepaddle and related OCR packages
      d8ebd92f
  2. 20 Mar, 2025 3 commits
  3. 13 Feb, 2025 2 commits
  4. 16 Jan, 2025 1 commit
    • myhloli's avatar
      feat(table): upgrade RapidTable to1.0.3 and add sub-model support · 79c8a5c8
      myhloli authored
      - Update RapidTable dependency to version 1.0.3
      - Add support for sub-models in RapidTable
      - Update magic-pdf configuration to include table sub-model
      - Modify table model initialization to support sub-models
      - Update table prediction logic to handle new output format
      79c8a5c8
  5. 14 Jan, 2025 1 commit
    • myhloli's avatar
      feat(layout): improve title block handling and layout detection · c20e9a1e
      myhloli authored
      - Merge title blocks that are close to each other horizontally
      - Adjust line insertion logic for title blocks- Increase image size and decrease confidence threshold for layout detection
      - Update DocLayoutYOLO model weights
      - Refactor drawing of bounding boxes for different block types
      c20e9a1e
  6. 09 Jan, 2025 1 commit
  7. 03 Jan, 2025 1 commit
    • myhloli's avatar
      feat(model): add onnxruntime support for paddleocr on cpu · 512adb67
      myhloli authored
      - Implement ONNXModelSingleton to manage ONNX models
      - Modify ModifiedPaddleOCR to use ONNX models on ARM CPUs without CUDA
      - Update RapidTableModel to use RapidOCR with ONNXRuntime on CPU
      - Add rapidocr_onnxruntime dependency in setup.py
      512adb67
  8. 26 Dec, 2024 1 commit
  9. 25 Dec, 2024 1 commit
    • myhloli's avatar
      feat(llm_aided): add title optimization feature · 0a468eca
      myhloli authored
      - Implement llm_aided_title function to optimize document titles using LLM
      - Update pdf_parse_union_core_v2.py to include title optimization
      - Modify ocr_mkcontent.py to use optimized title levels- Add openai SDK dependency in setup.py
      0a468eca
  10. 11 Dec, 2024 1 commit
    • myhloli's avatar
      build(deps): update torch and torchvision version requirements · 9a96362d
      myhloli authored
      - Specify torch==2.3.1 and torchvision==0.18.1 for Windows CUDA installation
      - Add torch and torchvision version constraints in setup.py:
        - torch>=2.2.2,<=2.3.1
        - torchvision>=0.17.2,<=0.18.1
      - Update installation instructions in both English and Chinese README files
      9a96362d
  11. 09 Dec, 2024 2 commits
  12. 06 Dec, 2024 1 commit
  13. 18 Nov, 2024 1 commit
    • myhloli's avatar
      build(setup): add old_linux specific dependencies · d0f633e2
      myhloli authored
      - Add albumentations package with version <=1.4.20 for old_linux
      - This version is compatible with Linux systems from 2019 and earlier
      - Version 1.4.21 and above introduced simsimd which is not supported on older Linux systems
      d0f633e2
  14. 15 Nov, 2024 1 commit
  15. 08 Nov, 2024 2 commits
    • myhloli's avatar
      feat(table): add RapidOCR support for RapidTable model · fe2c2c0d
      myhloli authored
      - Integrate RapidOCR with RapidTable model for table recognition
      - Improve memory management for devices with <= 8GB VRAM
      - Update table recognition process to use RapidOCR for RapidTable
      - Add rapidocr-paddle dependency in setup.py
      fe2c2c0d
    • myhloli's avatar
      feat(table): integrate RapidTable model for table recognition · 240fe99e
      myhloli authored
      - Add RapidTable model support for table recognition
      - Update table model configuration and initialization
      - Modify table recognition process to use RapidTable when specified
      - Add RapidTable dependency to setup.py
      240fe99e
  16. 04 Nov, 2024 1 commit
  17. 23 Oct, 2024 1 commit
  18. 10 Sep, 2024 2 commits
    • Xiaomeng Zhao's avatar
      Update setup.py · 20212a37
      Xiaomeng Zhao authored
      update UniMERNet to 0.2.1
      20212a37
    • myhloli's avatar
      refactor(pdf_extract_kit): update model config and weight paths for UniMERNet-0.2.0 · 3e9bc7a4
      myhloli authored
      Update the paths to model weights and configuration files for the UniMERNet architecture
      in both the demo.yaml and model_configs.yaml files. Adjust the mfr_model_init function toreflect the new weight and configuration paths. The changes include specifying more detailed
      paths to the unimernet_base directory and changing the weight file extension to .pth.
      3e9bc7a4
  19. 04 Aug, 2024 2 commits
  20. 01 Aug, 2024 1 commit
  21. 30 Jul, 2024 1 commit
  22. 28 Jul, 2024 1 commit
  23. 23 Jul, 2024 1 commit
  24. 12 Jul, 2024 2 commits
  25. 11 Jul, 2024 1 commit
    • myhloli's avatar
      feat(model): add model mode selection for PDF analysis · bc0f6932
      myhloli authored
      Introduce a new feature that allows users to choose between a "lite" and a "full"
      model mode for PDF document analysis. The "lite" mode uses a faster, less
      accurate model, while the "full" mode employs a higher-precision model at the
      cost of speed. This selection can be made through the CLI or API, providing
      flexibility for different use cases.
      bc0f6932
  26. 08 Jul, 2024 1 commit
  27. 25 Jun, 2024 1 commit
  28. 20 Jun, 2024 2 commits
  29. 18 Jun, 2024 1 commit