1. 21 Oct, 2024 1 commit
    • myhloli's avatar
      refactor(para): improve paragraph splitting algorithm · 8cc76c49
      myhloli authored
      - Adjust the threshold for identifying index blocks from 3 lines to 2 lines
      - Add a new function __is_list_group to detect if a group of blocks is a list
      - Modify the paragraph merging logic to handle list groups differently
      8cc76c49
  2. 15 Oct, 2024 3 commits
  3. 14 Oct, 2024 1 commit
    • myhloli's avatar
      feat(list&index block): detect and merge list and index blocks · 1f1dd353
      myhloli authored
      - Add detection for list and index blocks in OCR processing- Implement merging of list and index blocks across pages
      - Update block types to include list and index categories
      - Adjust text merging logic to handle new block types
      - Modify layout drawing to distinguish list and index blocks
      1f1dd353
  4. 10 Oct, 2024 2 commits