1. 22 Nov, 2024 2 commits
    • myhloli's avatar
      feat(README): update for v0.10.0 、 · d9cfdad1
      myhloli authored
      - Introduced hybrid OCR text extraction capabilities in v0.10.0
      - Significantly improved parsing performance in complex text distribution scenarios- Combined advantages of accurate content extraction and faster speed in text mode with more precise span/line region recognition in OCR mode
      - Updated both English and Chinese README files
      d9cfdad1
    • myhloli's avatar
      refactor(para): improve line stop flag and remove unused debug mode · 5d6cbcb1
      myhloli authored
      - Add '-' and '–' to LINE_STOP_FLAG in pdf_parse_union_core_v2.py
      - Remove unused debug_mode parameter from para_split function in para_split_v3.py
      5d6cbcb1
  2. 21 Nov, 2024 19 commits
  3. 20 Nov, 2024 1 commit
  4. 19 Nov, 2024 4 commits
  5. 18 Nov, 2024 14 commits