1. 02 Apr, 2025 6 commits
  2. 01 Apr, 2025 2 commits
    • myhloli's avatar
      refactor(ocr): remove unused OCR dictionaries and update model configurations · 41f1fb8a
      myhloli authored
      - Remove unused OCR dictionaries for Arabic, Belarusian, Bulgarian and Armenian languages
      - Update model configurations in arch_config.yaml:
      - Comment out 'out_channels' for various language models
        - Rename Arabic, Korean, Japanese, Tamil and Devanagari model configurations to use 'v3' instead of 'v4'
      - Delete ar_dict.txt, be_dict.txt and bg_dict.txt files
      - Update arabic_dict.txt to remove blank line at the start
      41f1fb8a
    • myhloli's avatar
      refactor(ocr): remove unused code and simplify model architecture · b3d6785d
      myhloli authored
      - Remove unused imports and code
      - Simplify model architecture by removing unnecessary components
      - Update initialization and forward pass logic
      - Rename variables for consistency
      b3d6785d
  3. 31 Mar, 2025 2 commits
    • myhloli's avatar
      feat(ocr): implement language-specific OCR processing · d7d85a28
      myhloli authored
      - Add support for multiple languages in OCR processing
      - Create separate lists for each language to improve processing efficiency
      - Update OCR model initialization to use PytorchPaddleOCR instead of ModifiedPaddleOCR
      - Modify get_ocr_result_list function to include language information- Improve logging for OCR detection and recognition
      d7d85a28
    • myhloli's avatar
      feat(ocr): implement separate detection and recognition processes · a330651d
      myhloli authored
      - Split OCR process into detection and recognition stages
      - Update batch analysis and document analysis pipelines
      - Modify OCR result formatting and handling
      - Remove unused imports and optimize code structure
      a330651d
  4. 27 Mar, 2025 1 commit
    • myhloli's avatar
      feat(model): add OCR model base structure and utilities · a7a899f6
      myhloli authored
      - Add base model structure for OCR in pytorch
      - Implement data augmentation and transformation modules
      - Create utilities for dictionary handling and state dict conversion
      - Include post-processing modules for OCR
      - Add weight initialization and loading functions
      a7a899f6
  5. 24 Mar, 2025 1 commit
  6. 22 Mar, 2025 1 commit
  7. 21 Mar, 2025 1 commit
  8. 20 Mar, 2025 3 commits
  9. 19 Mar, 2025 1 commit
  10. 13 Mar, 2025 4 commits
  11. 12 Mar, 2025 1 commit
  12. 10 Mar, 2025 1 commit
  13. 07 Mar, 2025 2 commits
  14. 03 Mar, 2025 1 commit
  15. 23 Feb, 2025 1 commit
  16. 21 Feb, 2025 2 commits
    • myhloli's avatar
      fix(model): handle import errors and improve exception logging · 66f0899a
      myhloli authored
      - Add ImportError handling to silence known import-related exceptions
      - Improve generic exception handling to log error messages- Maintain existing specific exception handlers for license-related issues
      66f0899a
    • myhloli's avatar
      feat(model_init): implement license verification for Ascend plugin · d5f6fbc6
      myhloli authored
      - Add license verification logic for Ascend plugin
      - Handle different license-related exceptions with appropriate error messages
      - Log success message with license expiration date if verification passes
      - Fall back to CPU model if license verification fails or plugin is not available
      d5f6fbc6
  17. 10 Feb, 2025 2 commits
  18. 09 Feb, 2025 1 commit
  19. 21 Jan, 2025 1 commit
  20. 20 Jan, 2025 2 commits
  21. 17 Jan, 2025 1 commit
  22. 16 Jan, 2025 1 commit
    • myhloli's avatar
      feat(table): upgrade RapidTable to1.0.3 and add sub-model support · 79c8a5c8
      myhloli authored
      - Update RapidTable dependency to version 1.0.3
      - Add support for sub-models in RapidTable
      - Update magic-pdf configuration to include table sub-model
      - Modify table model initialization to support sub-models
      - Update table prediction logic to handle new output format
      79c8a5c8
  23. 14 Jan, 2025 1 commit
    • myhloli's avatar
      feat(layout): improve title block handling and layout detection · c20e9a1e
      myhloli authored
      - Merge title blocks that are close to each other horizontally
      - Adjust line insertion logic for title blocks- Increase image size and decrease confidence threshold for layout detection
      - Update DocLayoutYOLO model weights
      - Refactor drawing of bounding boxes for different block types
      c20e9a1e
  24. 09 Jan, 2025 1 commit