1. 20 Mar, 2025 1 commit
  2. 13 Feb, 2025 2 commits
  3. 16 Jan, 2025 1 commit
    • myhloli's avatar
      feat(table): upgrade RapidTable to1.0.3 and add sub-model support · 79c8a5c8
      myhloli authored
      - Update RapidTable dependency to version 1.0.3
      - Add support for sub-models in RapidTable
      - Update magic-pdf configuration to include table sub-model
      - Modify table model initialization to support sub-models
      - Update table prediction logic to handle new output format
      79c8a5c8
  4. 14 Jan, 2025 1 commit
    • myhloli's avatar
      feat(layout): improve title block handling and layout detection · c20e9a1e
      myhloli authored
      - Merge title blocks that are close to each other horizontally
      - Adjust line insertion logic for title blocks- Increase image size and decrease confidence threshold for layout detection
      - Update DocLayoutYOLO model weights
      - Refactor drawing of bounding boxes for different block types
      c20e9a1e
  5. 09 Jan, 2025 1 commit
  6. 03 Jan, 2025 1 commit
    • myhloli's avatar
      feat(model): add onnxruntime support for paddleocr on cpu · 512adb67
      myhloli authored
      - Implement ONNXModelSingleton to manage ONNX models
      - Modify ModifiedPaddleOCR to use ONNX models on ARM CPUs without CUDA
      - Update RapidTableModel to use RapidOCR with ONNXRuntime on CPU
      - Add rapidocr_onnxruntime dependency in setup.py
      512adb67
  7. 26 Dec, 2024 1 commit
  8. 25 Dec, 2024 1 commit
    • myhloli's avatar
      feat(llm_aided): add title optimization feature · 0a468eca
      myhloli authored
      - Implement llm_aided_title function to optimize document titles using LLM
      - Update pdf_parse_union_core_v2.py to include title optimization
      - Modify ocr_mkcontent.py to use optimized title levels- Add openai SDK dependency in setup.py
      0a468eca
  9. 11 Dec, 2024 1 commit
    • myhloli's avatar
      build(deps): update torch and torchvision version requirements · 9a96362d
      myhloli authored
      - Specify torch==2.3.1 and torchvision==0.18.1 for Windows CUDA installation
      - Add torch and torchvision version constraints in setup.py:
        - torch>=2.2.2,<=2.3.1
        - torchvision>=0.17.2,<=0.18.1
      - Update installation instructions in both English and Chinese README files
      9a96362d
  10. 09 Dec, 2024 2 commits
  11. 06 Dec, 2024 1 commit
  12. 18 Nov, 2024 1 commit
    • myhloli's avatar
      build(setup): add old_linux specific dependencies · d0f633e2
      myhloli authored
      - Add albumentations package with version <=1.4.20 for old_linux
      - This version is compatible with Linux systems from 2019 and earlier
      - Version 1.4.21 and above introduced simsimd which is not supported on older Linux systems
      d0f633e2
  13. 15 Nov, 2024 1 commit
  14. 08 Nov, 2024 2 commits
    • myhloli's avatar
      feat(table): add RapidOCR support for RapidTable model · fe2c2c0d
      myhloli authored
      - Integrate RapidOCR with RapidTable model for table recognition
      - Improve memory management for devices with <= 8GB VRAM
      - Update table recognition process to use RapidOCR for RapidTable
      - Add rapidocr-paddle dependency in setup.py
      fe2c2c0d
    • myhloli's avatar
      feat(table): integrate RapidTable model for table recognition · 240fe99e
      myhloli authored
      - Add RapidTable model support for table recognition
      - Update table model configuration and initialization
      - Modify table recognition process to use RapidTable when specified
      - Add RapidTable dependency to setup.py
      240fe99e
  15. 04 Nov, 2024 1 commit
  16. 23 Oct, 2024 1 commit
  17. 10 Sep, 2024 2 commits
    • Xiaomeng Zhao's avatar
      Update setup.py · 20212a37
      Xiaomeng Zhao authored
      update UniMERNet to 0.2.1
      20212a37
    • myhloli's avatar
      refactor(pdf_extract_kit): update model config and weight paths for UniMERNet-0.2.0 · 3e9bc7a4
      myhloli authored
      Update the paths to model weights and configuration files for the UniMERNet architecture
      in both the demo.yaml and model_configs.yaml files. Adjust the mfr_model_init function toreflect the new weight and configuration paths. The changes include specifying more detailed
      paths to the unimernet_base directory and changing the weight file extension to .pth.
      3e9bc7a4
  18. 04 Aug, 2024 2 commits
  19. 01 Aug, 2024 1 commit
  20. 30 Jul, 2024 1 commit
  21. 28 Jul, 2024 1 commit
  22. 23 Jul, 2024 1 commit
  23. 12 Jul, 2024 2 commits
  24. 11 Jul, 2024 1 commit
    • myhloli's avatar
      feat(model): add model mode selection for PDF analysis · bc0f6932
      myhloli authored
      Introduce a new feature that allows users to choose between a "lite" and a "full"
      model mode for PDF document analysis. The "lite" mode uses a faster, less
      accurate model, while the "full" mode employs a higher-precision model at the
      cost of speed. This selection can be made through the CLI or API, providing
      flexibility for different use cases.
      bc0f6932
  25. 08 Jul, 2024 1 commit
  26. 25 Jun, 2024 1 commit
  27. 20 Jun, 2024 2 commits
  28. 18 Jun, 2024 1 commit
  29. 05 Jun, 2024 1 commit
  30. 04 Jun, 2024 2 commits
  31. 03 Jun, 2024 1 commit
  32. 30 May, 2024 1 commit