1. 16 Jan, 2025 1 commit
    • myhloli's avatar
      feat(table): upgrade RapidTable to1.0.3 and add sub-model support · 79c8a5c8
      myhloli authored
      - Update RapidTable dependency to version 1.0.3
      - Add support for sub-models in RapidTable
      - Update magic-pdf configuration to include table sub-model
      - Modify table model initialization to support sub-models
      - Update table prediction logic to handle new output format
      79c8a5c8
  2. 14 Jan, 2025 1 commit
    • myhloli's avatar
      feat(layout): improve title block handling and layout detection · c20e9a1e
      myhloli authored
      - Merge title blocks that are close to each other horizontally
      - Adjust line insertion logic for title blocks- Increase image size and decrease confidence threshold for layout detection
      - Update DocLayoutYOLO model weights
      - Refactor drawing of bounding boxes for different block types
      c20e9a1e
  3. 09 Jan, 2025 3 commits
  4. 08 Jan, 2025 2 commits
  5. 06 Jan, 2025 1 commit
  6. 05 Jan, 2025 1 commit
    • myhloli's avatar
      fix(magic-pdf): update OCR model selection logic · 16a0a350
      myhloli authored
      - Add missing 'else' statement in OCR model selection logic
      - Ensure consistent formatting of 'if' statements for better readability
      - Remove unnecessary empty line in the 'app.py' file
      16a0a350
  7. 03 Jan, 2025 2 commits
  8. 30 Dec, 2024 2 commits
    • myhloli's avatar
      refactor(magic_pdf): comment out npu-related code · 88b909e2
      myhloli authored
      - Remove use_npu variable initialization
      - Comment out device assignment and npu check
      - Comment out use_npu parameter in ModifiedPaddleOCR constructor
      88b909e2
    • myhloli's avatar
      fix(npu): correct module name for NPU operations · 2684e775
      myhloli authored
      - Update `clean_memory.py` to use `torch_npu.npu` instead of `torch.npu`
      - Update `model_utils.py` to use `torch_npu.npu` instead of `torch.npu`
      - Simplify NPU availability check and bfloat16 support in `pdf_parse_union_core_v2.py`
      2684e775
  9. 26 Dec, 2024 2 commits
    • myhloli's avatar
      refactor(device): optimize memory cleaning and device selection · 50f48417
      myhloli authored
      - Update clean_memory function to support both CUDA and NPU devices
      - Implement get_device function to centralize device selection logic
      - Modify model initialization and memory cleaning to use the selected device
      - Update RapidTableModel to support both RapidOCR and PaddleOCR engines
      50f48417
    • myhloli's avatar
      feat(model): add npu support and optimize table model · 7990e7df
      myhloli authored
      - Add NPU support for memory cleaning and model initialization
      - Optimize table model initialization and prediction process
      - Update memory utils to support NPU
      - Add language parameter for table model
      7990e7df
  10. 17 Dec, 2024 1 commit
  11. 16 Dec, 2024 1 commit
  12. 12 Dec, 2024 2 commits
  13. 10 Dec, 2024 3 commits
  14. 06 Dec, 2024 5 commits
  15. 05 Dec, 2024 1 commit
    • myhloli's avatar
      perf(model): add threading lock for OCR model initialization · 04478095
      myhloli authored
      - Introduce a lock to synchronize access to OCR model initialization- This change improves thread safety when multiple threads access the OCR model concurrently
      - The lock ensures that the OCR model is initialized only once, even in multi-threaded scenarios
      04478095
  16. 03 Dec, 2024 2 commits
    • myhloli's avatar
      fix(vram): improve VRAM checking logic · 104273cc
      myhloli authored
      - Update VRAM checking logic in app.py and model_utils.py
      - Add None and type checks for VRAM values
      - Adjust concurrency limit calculation in app.py
      - Modify clean_vram function to handle cases with no VRAM information
      104273cc
    • myhloli's avatar
      feat(gradio_app): implement dynamic concurrency limit based on VRAM · b1fe9d4f
      myhloli authored
      - Add get_concurrency_limit function to calculate concurrency limit based on VRAM
      - Update clean_vram function and rename to get_vram for better clarity
      - Apply concurrency limit to the to_markdown function in the Gradio app
      b1fe9d4f
  17. 29 Nov, 2024 1 commit
  18. 27 Nov, 2024 1 commit
  19. 26 Nov, 2024 1 commit
  20. 22 Nov, 2024 1 commit
  21. 21 Nov, 2024 2 commits
    • myhloli's avatar
      refactor(txt_parse): improve text extraction accuracy with new algorithm · 309be741
      myhloli authored
      - Implement new text extraction method (txt_spans_extract_v2) to enhance accuracy
      - Add character filling in spans for better text reconstruction
      - Introduce empty span handling using OCR for missed text
      - Optimize span filtering and overlap removal
      309be741
    • myhloli's avatar
      feat(ocr): improve text detection and OCR accuracy · b2e37a2d
      myhloli authored
      - Update OCR utils to handle different box formats and improve angle calculation
      - Modify PDF extraction kit to support OCR option and optimize processing flow
      - Enhance PPOCR model to sort and filter detection boxes, improving text splitting accuracy
      b2e37a2d
  22. 19 Nov, 2024 1 commit
  23. 18 Nov, 2024 1 commit
    • myhloli's avatar
      feat(ocr): improve handling of angled text boxes · 4fd966eb
      myhloli authored
      - Add calculate_is_angle function to detect angled text boxes
      - Update update_det_boxes and merge_det_boxes functions to handle angled text boxes
      - Modify angle detection logic in various parts of the code
      4fd966eb
  24. 15 Nov, 2024 1 commit