1. 18 Nov, 2024 1 commit
    • myhloli's avatar
      refactor(para): improve paragraph splitting logic · 517fbe5b
      myhloli authored
      - Add page size information to blocks
      - Calculate block width ratio relative to page width
      - Adjust threshold for determining right side indentation
      - Implement additional checks for merging blocks across pages
      - Improve logic for identifying list structures
      517fbe5b
  2. 11 Nov, 2024 1 commit
  3. 03 Nov, 2024 1 commit
  4. 02 Nov, 2024 2 commits
    • myhloli's avatar
      feat(list): improve list detection algorithm- Add center_close_num and... · 2bf6c268
      myhloli authored
      feat(list): improve list detection algorithm- Add center_close_num and external_sides_not_close_num variables to analyze line positioning
      - Implement new list detection condition for centered lines
      - Enhance existing list detection logic with additional checks
      2bf6c268
    • myhloli's avatar
      fix(list): improve list identification accuracy- Adjust the threshold for... · a8f2e7d6
      myhloli authored
      fix(list): improve list identification accuracy- Adjust the threshold for determining right-side spacing to 0.26 * block_weight
      - Add TODO comment for special list identification with all centered lines- Modify the condition for recognizing short item lists with left alignment
      - Update the condition for identifying the end of a list item
      a8f2e7d6
  5. 21 Oct, 2024 1 commit
    • myhloli's avatar
      refactor(para): improve paragraph splitting algorithm · 8cc76c49
      myhloli authored
      - Adjust the threshold for identifying index blocks from 3 lines to 2 lines
      - Add a new function __is_list_group to detect if a group of blocks is a list
      - Modify the paragraph merging logic to handle list groups differently
      8cc76c49
  6. 15 Oct, 2024 3 commits
  7. 14 Oct, 2024 1 commit
    • myhloli's avatar
      feat(list&index block): detect and merge list and index blocks · 1f1dd353
      myhloli authored
      - Add detection for list and index blocks in OCR processing- Implement merging of list and index blocks across pages
      - Update block types to include list and index categories
      - Adjust text merging logic to handle new block types
      - Modify layout drawing to distinguish list and index blocks
      1f1dd353
  8. 10 Oct, 2024 2 commits