1. 17 Jan, 2025 1 commit
  2. 16 Jan, 2025 1 commit
  3. 15 Jan, 2025 1 commit
  4. 14 Jan, 2025 1 commit
  5. 03 Jan, 2025 1 commit
    • myhloli's avatar
      feat(model): add onnxruntime support for paddleocr on cpu · 512adb67
      myhloli authored
      - Implement ONNXModelSingleton to manage ONNX models
      - Modify ModifiedPaddleOCR to use ONNX models on ARM CPUs without CUDA
      - Update RapidTableModel to use RapidOCR with ONNXRuntime on CPU
      - Add rapidocr_onnxruntime dependency in setup.py
      512adb67
  6. 25 Dec, 2024 2 commits
    • myhloli's avatar
      refactor(magic_pdf): remove unnecessary logging statements · 192047a1
      myhloli authored
      - Comment out logging statements for title list, title completion, and length comparison
      - Improve code readability and reduce clutter by removing unused debug information
      192047a1
    • myhloli's avatar
      feat(llm_aided): add title optimization feature · 0a468eca
      myhloli authored
      - Implement llm_aided_title function to optimize document titles using LLM
      - Update pdf_parse_union_core_v2.py to include title optimization
      - Modify ocr_mkcontent.py to use optimized title levels- Add openai SDK dependency in setup.py
      0a468eca
  7. 24 Dec, 2024 1 commit
    • myhloli's avatar
      feat(llm): add LLM-aided formula and text correction · c660fdc8
      myhloli authored
      - Add LLM-aided formula and text correction functionality
      - Update config reader to include LLM-aided settings
      - Create new LLM-aided processing module
      - Update main processing script to incorporate LLM-aided corrections
      - Modify download scripts to check for new config version
      c660fdc8