1. 02 Jan, 2025 1 commit
  2. 27 Dec, 2024 1 commit
  3. 25 Dec, 2024 2 commits
    • myhloli's avatar
      refactor(magic_pdf): remove unnecessary logging statements · 192047a1
      myhloli authored
      - Comment out logging statements for title list, title completion, and length comparison
      - Improve code readability and reduce clutter by removing unused debug information
      192047a1
    • myhloli's avatar
      feat(llm_aided): add title optimization feature · 0a468eca
      myhloli authored
      - Implement llm_aided_title function to optimize document titles using LLM
      - Update pdf_parse_union_core_v2.py to include title optimization
      - Modify ocr_mkcontent.py to use optimized title levels- Add openai SDK dependency in setup.py
      0a468eca
  4. 24 Dec, 2024 1 commit
    • myhloli's avatar
      feat(llm): add LLM-aided formula and text correction · c660fdc8
      myhloli authored
      - Add LLM-aided formula and text correction functionality
      - Update config reader to include LLM-aided settings
      - Create new LLM-aided processing module
      - Update main processing script to incorporate LLM-aided corrections
      - Modify download scripts to check for new config version
      c660fdc8
  5. 20 Dec, 2024 1 commit
  6. 19 Dec, 2024 1 commit
  7. 18 Dec, 2024 7 commits
  8. 17 Dec, 2024 3 commits
  9. 16 Dec, 2024 1 commit
  10. 13 Dec, 2024 3 commits
  11. 12 Dec, 2024 4 commits
  12. 11 Dec, 2024 14 commits
  13. 10 Dec, 2024 1 commit
    • myhloli's avatar
      refactor(model): update import paths for PaddleOCR modules · 061c03a0
      myhloli authored
      - Change import paths from paddleocr.ppocr to ppocr for utility functions
      - Update import paths for logging and utility modules in ppocr_273_mod.py- Modify import paths for tablemaster_paddle.py to use ppstructure instead of paddleocr.ppstructure
      061c03a0