1. 03 Jan, 2025 1 commit
    • myhloli's avatar
      feat(model): add onnxruntime support for paddleocr on cpu · 512adb67
      myhloli authored
      - Implement ONNXModelSingleton to manage ONNX models
      - Modify ModifiedPaddleOCR to use ONNX models on ARM CPUs without CUDA
      - Update RapidTableModel to use RapidOCR with ONNXRuntime on CPU
      - Add rapidocr_onnxruntime dependency in setup.py
      512adb67
  2. 26 Dec, 2024 1 commit
  3. 25 Dec, 2024 1 commit
    • myhloli's avatar
      feat(llm_aided): add title optimization feature · 0a468eca
      myhloli authored
      - Implement llm_aided_title function to optimize document titles using LLM
      - Update pdf_parse_union_core_v2.py to include title optimization
      - Modify ocr_mkcontent.py to use optimized title levels- Add openai SDK dependency in setup.py
      0a468eca
  4. 11 Dec, 2024 1 commit
    • myhloli's avatar
      build(deps): update torch and torchvision version requirements · 9a96362d
      myhloli authored
      - Specify torch==2.3.1 and torchvision==0.18.1 for Windows CUDA installation
      - Add torch and torchvision version constraints in setup.py:
        - torch>=2.2.2,<=2.3.1
        - torchvision>=0.17.2,<=0.18.1
      - Update installation instructions in both English and Chinese README files
      9a96362d
  5. 09 Dec, 2024 2 commits
  6. 06 Dec, 2024 1 commit
  7. 18 Nov, 2024 1 commit
    • myhloli's avatar
      build(setup): add old_linux specific dependencies · d0f633e2
      myhloli authored
      - Add albumentations package with version <=1.4.20 for old_linux
      - This version is compatible with Linux systems from 2019 and earlier
      - Version 1.4.21 and above introduced simsimd which is not supported on older Linux systems
      d0f633e2
  8. 15 Nov, 2024 1 commit
  9. 08 Nov, 2024 2 commits
    • myhloli's avatar
      feat(table): add RapidOCR support for RapidTable model · fe2c2c0d
      myhloli authored
      - Integrate RapidOCR with RapidTable model for table recognition
      - Improve memory management for devices with <= 8GB VRAM
      - Update table recognition process to use RapidOCR for RapidTable
      - Add rapidocr-paddle dependency in setup.py
      fe2c2c0d
    • myhloli's avatar
      feat(table): integrate RapidTable model for table recognition · 240fe99e
      myhloli authored
      - Add RapidTable model support for table recognition
      - Update table model configuration and initialization
      - Modify table recognition process to use RapidTable when specified
      - Add RapidTable dependency to setup.py
      240fe99e
  10. 04 Nov, 2024 1 commit
  11. 23 Oct, 2024 1 commit
  12. 10 Sep, 2024 2 commits
    • Xiaomeng Zhao's avatar
      Update setup.py · 20212a37
      Xiaomeng Zhao authored
      update UniMERNet to 0.2.1
      20212a37
    • myhloli's avatar
      refactor(pdf_extract_kit): update model config and weight paths for UniMERNet-0.2.0 · 3e9bc7a4
      myhloli authored
      Update the paths to model weights and configuration files for the UniMERNet architecture
      in both the demo.yaml and model_configs.yaml files. Adjust the mfr_model_init function toreflect the new weight and configuration paths. The changes include specifying more detailed
      paths to the unimernet_base directory and changing the weight file extension to .pth.
      3e9bc7a4
  13. 04 Aug, 2024 2 commits
  14. 01 Aug, 2024 1 commit
  15. 30 Jul, 2024 1 commit
  16. 28 Jul, 2024 1 commit
  17. 23 Jul, 2024 1 commit
  18. 12 Jul, 2024 2 commits
  19. 11 Jul, 2024 1 commit
    • myhloli's avatar
      feat(model): add model mode selection for PDF analysis · bc0f6932
      myhloli authored
      Introduce a new feature that allows users to choose between a "lite" and a "full"
      model mode for PDF document analysis. The "lite" mode uses a faster, less
      accurate model, while the "full" mode employs a higher-precision model at the
      cost of speed. This selection can be made through the CLI or API, providing
      flexibility for different use cases.
      bc0f6932
  20. 08 Jul, 2024 1 commit
  21. 25 Jun, 2024 1 commit
  22. 20 Jun, 2024 2 commits
  23. 18 Jun, 2024 1 commit
  24. 05 Jun, 2024 1 commit
  25. 04 Jun, 2024 2 commits
  26. 03 Jun, 2024 1 commit
  27. 30 May, 2024 2 commits
  28. 05 Mar, 2024 3 commits
  29. 04 Mar, 2024 2 commits