1. 09 Dec, 2024 1 commit
  2. 07 Dec, 2024 1 commit
  3. 06 Dec, 2024 9 commits
  4. 05 Dec, 2024 1 commit
    • myhloli's avatar
      perf(model): add threading lock for OCR model initialization · 04478095
      myhloli authored
      - Introduce a lock to synchronize access to OCR model initialization- This change improves thread safety when multiple threads access the OCR model concurrently
      - The lock ensures that the OCR model is initialized only once, even in multi-threaded scenarios
      04478095
  5. 03 Dec, 2024 5 commits
  6. 29 Nov, 2024 1 commit
  7. 28 Nov, 2024 1 commit
  8. 27 Nov, 2024 2 commits
  9. 26 Nov, 2024 2 commits
  10. 24 Nov, 2024 2 commits
  11. 22 Nov, 2024 2 commits
  12. 21 Nov, 2024 2 commits
    • myhloli's avatar
      refactor(txt_parse): improve text extraction accuracy with new algorithm · 309be741
      myhloli authored
      - Implement new text extraction method (txt_spans_extract_v2) to enhance accuracy
      - Add character filling in spans for better text reconstruction
      - Introduce empty span handling using OCR for missed text
      - Optimize span filtering and overlap removal
      309be741
    • myhloli's avatar
      feat(ocr): improve text detection and OCR accuracy · b2e37a2d
      myhloli authored
      - Update OCR utils to handle different box formats and improve angle calculation
      - Modify PDF extraction kit to support OCR option and optimize processing flow
      - Enhance PPOCR model to sort and filter detection boxes, improving text splitting accuracy
      b2e37a2d
  13. 19 Nov, 2024 1 commit
  14. 18 Nov, 2024 2 commits
  15. 15 Nov, 2024 1 commit
  16. 08 Nov, 2024 2 commits
    • myhloli's avatar
      feat(table): add RapidOCR support for RapidTable model · fe2c2c0d
      myhloli authored
      - Integrate RapidOCR with RapidTable model for table recognition
      - Improve memory management for devices with <= 8GB VRAM
      - Update table recognition process to use RapidOCR for RapidTable
      - Add rapidocr-paddle dependency in setup.py
      fe2c2c0d
    • myhloli's avatar
      feat(table): integrate RapidTable model for table recognition · 240fe99e
      myhloli authored
      - Add RapidTable model support for table recognition
      - Update table model configuration and initialization
      - Modify table recognition process to use RapidTable when specified
      - Add RapidTable dependency to setup.py
      240fe99e
  17. 07 Nov, 2024 1 commit
    • myhloli's avatar
      feat(model): add xycut algorithm for block sorting · 7d5850e3
      myhloli authored
      - Implement xycut algorithm to sort blocks when layoutreader fails
      - Add recursive_xy_cut function to perform the xycut algorithm- Update pdf_parse_union_core_v2.py to use xycut when layoutreader fails
      - Modify draw_bbox.py to handle cases where layoutreader fails to sort blocks
      7d5850e3
  18. 06 Nov, 2024 1 commit
  19. 05 Nov, 2024 1 commit
  20. 04 Nov, 2024 2 commits