1. 28 Oct, 2024 1 commit
  2. 27 Oct, 2024 2 commits
  3. 26 Oct, 2024 1 commit
    • myhloli's avatar
      feat(draw_bbox): update bounding box drawing for tables and images · 0e8d5893
      myhloli authored
      - Add support for drawing bounding boxes of table and image sub-blocks
      - Implement sorting of table blocks based on type order
      - Update bounding box drawing for text and title blocks
      - Refactor code to handle different block types and their sub-blocks
      0e8d5893
  4. 25 Oct, 2024 7 commits
  5. 24 Oct, 2024 3 commits
  6. 23 Oct, 2024 1 commit
    • myhloli's avatar
      feat(model): add support for DocLayout-YOLO model · 1279f2cd
      myhloli authored
      - Add new layout model option: DocLayout-YOLO
      - Implement model initialization and prediction for DocLayout-YOLO
      - Update configuration options to include new model- Modify existing code to support both LayoutLMv3 and DocLayout-YOLO models
      - Update Gradio app to support more Custom Switch
      1279f2cd
  7. 21 Oct, 2024 2 commits
  8. 18 Oct, 2024 1 commit
  9. 17 Oct, 2024 1 commit
  10. 15 Oct, 2024 4 commits
  11. 14 Oct, 2024 2 commits
    • myhloli's avatar
      fix(magic_pdf): include List and Index block types in processing · 0a9a6d3e
      myhloli authored
      Add List and Index to the list of block types being processed in the draw_bbox.py file. This inclusion ensures that these block types are handled similarly to other text-containing blocks, improving the overall document processing accuracy and consistency.
      0a9a6d3e
    • myhloli's avatar
      feat(list&index block): detect and merge list and index blocks · 1f1dd353
      myhloli authored
      - Add detection for list and index blocks in OCR processing- Implement merging of list and index blocks across pages
      - Update block types to include list and index categories
      - Adjust text merging logic to handle new block types
      - Modify layout drawing to distinguish list and index blocks
      1f1dd353
  12. 10 Oct, 2024 2 commits
  13. 08 Oct, 2024 5 commits
  14. 06 Oct, 2024 1 commit
    • myhloli's avatar
      refactor(model): improve timing information and performance · be1b1ae7
      myhloli authored
      - Enhance timing output precision to two decimal places for better readability- Calculate and log document analysis speed in pages per second
      - Optimize logging for YOLO and table recognition processes
      - Remove unnecessary comments and improve code efficiency
      be1b1ae7
  15. 30 Sep, 2024 1 commit
  16. 29 Sep, 2024 2 commits
  17. 28 Sep, 2024 3 commits
    • myhloli's avatar
      refactor(magic_pdf): import model helpers directly for clarity · 42a7d792
      myhloli authored
      Update import statements in `pdf_parse_union_core_v2.py` to directly import
      `prepare_inputs`, `boxes2inputs`, and `parse_logits` from `magic_pdf.model.v3.helpers`
      instead of from `magic_pdf.model.v3`. This change streamlines the imports, making the
      code more readable and maintaining a cleaner approach to modular design.
      42a7d792
    • myhloli's avatar
      refactor(pdf_parse_union_core_v2): update import paths to use new package structure · 5522d0a3
      myhloli authored
      Adapt import statements in `pdf_parse_union_core_v2.py` to reflect the updated packagestructure, changing from the `magic_pdf.v3.helpers` module to the `magic_pdf.model.v3`
      module. This ensures compatibility with the revised directory layout.
      5522d0a3
    • myhloli's avatar
      fix(pdf_parse): handle blocks without lines and enable bf16 on compatible devices · 2145a8b6
      myhloli authored
      Blocks without lines are now correctly indexed even when they contain textual content rendered
      as images. The sorting logic has been updated to accommodate this scenario. Additionally, the
      LayoutLMv3 model initialization has been enhanced to utilize bfloat16 precision on devices that
      support it, offering potential performance benefits on supported hardware.
      2145a8b6
  18. 27 Sep, 2024 1 commit
    • myhloli's avatar
      refactor(pdf_parse): remove redundant sorting and optimize block indexing · 177ab08e
      myhloli authored
      Removed redundant sorting of lines by model and optimized calculation of block
      indexes by using a single pass through the sorted lines. This change simplifies the
      code and potentially improves performance by reducing the number of sortingoperations and unnecessary iterations over blocks without lines.
      177ab08e