1. 03 Apr, 2025 8 commits
    • myhloli's avatar
      feat(web_api): update configuration and remove unused code · 3a820305
      myhloli authored
      - Comment out PaddlePaddle GPU installation in Dockerfile
      - Add OCR model download URL in download_models.py
      - Update config version in magic-pdf.json
      - Remove outdated information and simplify README.md
      - Remove volume creation for PaddleOCR models in Dockerfile
      3a820305
    • myhloli's avatar
      docs(user_guide): update installation guide and CUDA support · b51ac110
      myhloli authored
      - Update CUDA version requirements to 12.4 and higher
      - Add support for CUDA 12.6 and CANN environments- Update Python version requirements to 3.10-3.12
      - Remove paddlepaddle-gpu installation and related instructions
      - Update magic-pdf installation command to use Aliyun mirror
      - Add storage requirements and update memory requirements
      - Update GPU hardware support list to include all GPUs with Tensor Cores
      - Add support for Apple Silicon
      b51ac110
    • myhloli's avatar
      docs(readme): update changelog and compatibility information · 0544996f
      myhloli authored
      - Update changelog for version 1.3.0 release
      - Clarify CUDA and GPU compatibility improvements
      - Add information about batch processing speed improvements
      - Update model download process and memory usage optimizations
      - Include link to batch processing demo script
      0544996f
    • myhloli's avatar
      refactor(magic_pdf): optimize table recognition and layout detection · 1fd72f5f
      myhloli authored
      - Update table recognition logic to process each table individually
      - Refactor layout detection to use tqdm for progress tracking
      - Optimize OCR recognition by using a single tqdm wrapper
      - Improve MFR prediction with a more accurate progress bar
      - Simplify MFD prediction by removing unnecessary total calculation
      1fd72f5f
    • myhloli's avatar
      refactor(magic_pdf): remove OCR timing measurement code · 795233d1
      myhloli authored
      - Comment out OCR timing measurement code to improve readability and performance
      - Remove unnecessary logging of OCR processing time
      795233d1
    • myhloli's avatar
      refactor(magic_pdf): optimize code and improve logging · 553f250f
      myhloli authored
      - Remove unused imports and comments
      - Increase MIN_BATCH_INFERENCE_SIZE from 100 to 200
      - Comment out VRAM cleaning and logging in batch_analyze.py
      - Simplify code in doc_analyze_by_custom_model.py- Add tqdm progress bar in pdf_parse_union_core_v2.py
      - Enable tqdm in OCR processing
      553f250f
    • myhloli's avatar
      docs(README): update model config examples and add tqdm dependency · 86058278
      myhloli authored
      - Remove outdated comments in table-config examples
      - Add tqdm to requirements in all Docker environments
      86058278
    • myhloli's avatar
      feat(model): add tqdm progress bar to model prediction loops · 8e1c2339
      myhloli authored
      - Add tqdm progress bar to batch prediction loops in multiple model modules
      - Improve logging and error handling in batch analysis script
      - Update table model initialization to use default sub-model if none specified
      - Add tqdm dependency to requirements.txt
      8e1c2339
  2. 02 Apr, 2025 25 commits
  3. 01 Apr, 2025 5 commits
  4. 31 Mar, 2025 2 commits
    • myhloli's avatar
      refactor(model): integrate AtomModelSingleton for OCR and improve OCR result handling · 59d6b195
      myhloli authored
      - Replace direct OCR model access with AtomModelSingleton for better model management
      - Round OCR scores to 2 decimal places for consistency
      - Improve error handling and logging in batch analysis
      - Simplify OCR result processing in pdf_parse_union_core_v2.py
      59d6b195
    • myhloli's avatar
      feat(ocr): implement language-specific OCR processing · d7d85a28
      myhloli authored
      - Add support for multiple languages in OCR processing
      - Create separate lists for each language to improve processing efficiency
      - Update OCR model initialization to use PytorchPaddleOCR instead of ModifiedPaddleOCR
      - Modify get_ocr_result_list function to include language information- Improve logging for OCR detection and recognition
      d7d85a28