Commits · 383fed52ea42660ec5b7b8c84987dedeb6355a45 · wangsen / MinerU

05 Jun, 2025 1 commit
- Update batch_analyze.py · 400bb763
  seedclaimer authored Jun 05, 2025
```
fix absence of sorted_boxes, merge_det_boxes, update_det_boxes.
```
  400bb763
04 Jun, 2025 2 commits
- fix(batch): refactor OCR detection integration and area ratio calculation · ddf5a878
  myhloli authored Jun 04, 2025
  
  ddf5a878
- fix(batch): disable OCR detection batch processing by default · 7c1d7dff
  myhloli authored Jun 04, 2025
  
  7c1d7dff
29 May, 2025 1 commit
- Update batch_analyze.py · 54950551
  Xiaomeng Zhao authored May 29, 2025
  
  54950551
28 May, 2025 1 commit
- 支持batch-ocr-det，速度约提升3倍（200页pdf在3090上） · 99d4c97a
  speta authored May 28, 2025
  
  99d4c97a
24 May, 2025 2 commits
- fix(ocr): adjust area ratio threshold and update fitz document handling in image conversion · 1e01ffcf
  myhloli authored May 24, 2025
  
  1e01ffcf
- feat(ocr): add area ratio calculation for OCR results and enhance get_coords_and_area function · a2b84813
  myhloli authored May 24, 2025
  
  a2b84813
22 Apr, 2025 1 commit

refactor(table): replace ocr_engine with lang in table model prediction · 1d1c7ba9

myhloli authored Apr 22, 2025

- Remove OCR engine instantiation inside the loop
- Pass language directly to the table model instead of OCR engine
- Simplify code structure and improve readability

1d1c7ba9

11 Apr, 2025 1 commit

refactor(model): optimize batch processing and inference · d2fc9dab

myhloli authored Apr 11, 2025

- Update batch processing logic for improved efficiency
- Refactor image analysis and inference methods
- Optimize dataset handling and image retrieval
- Improve error handling and logging in batch processes

d2fc9dab

09 Apr, 2025 2 commits

refactor(ocr): comment out det_count update and update OCR models · f8323ae0

myhloli authored Apr 09, 2025

- Comment out the line that updates det_count in batch_analyze.py
- Add a new OCR model configuration for Chinese (ch_lite) in models_config.yml- Update the Chinese OCR model configuration to use a different recognition model

f8323ae0

feat(model): improve table recognition by merging and filtering tables · df7ae404

myhloli authored Apr 09, 2025

- Add functions to calculate IoU, check if tables are inside each other, and merge tables
- Implement table merging for high IoU tables
- Add filtering to remove nested tables that don't overlap but cover a large area
- Update table_res_list and layout_res to reflect these changes

df7ae404

08 Apr, 2025 1 commit

refactor(ocr): improve OCR score precision to three decimal places · ea730ae2

myhloli authored Apr 08, 2025

- Update OCR score formatting in batch_analyze.py and pdf_parse_union_core_v2.py
- Change score rounding method to preserve three decimal places
- Enhance accuracy representation without significantly altering the score value

ea730ae2

03 Apr, 2025 3 commits

refactor(magic_pdf): optimize table recognition and layout detection · 1fd72f5f

myhloli authored Apr 03, 2025

- Update table recognition logic to process each table individually
- Refactor layout detection to use tqdm for progress tracking
- Optimize OCR recognition by using a single tqdm wrapper
- Improve MFR prediction with a more accurate progress bar
- Simplify MFD prediction by removing unnecessary total calculation

1fd72f5f

refactor(magic_pdf): optimize code and improve logging · 553f250f

myhloli authored Apr 03, 2025

- Remove unused imports and comments
- Increase MIN_BATCH_INFERENCE_SIZE from 100 to 200
- Comment out VRAM cleaning and logging in batch_analyze.py
- Simplify code in doc_analyze_by_custom_model.py- Add tqdm progress bar in pdf_parse_union_core_v2.py
- Enable tqdm in OCR processing

553f250f

feat(model): add tqdm progress bar to model prediction loops · 8e1c2339

myhloli authored Apr 03, 2025

- Add tqdm progress bar to batch prediction loops in multiple model modules
- Improve logging and error handling in batch analysis script
- Update table model initialization to use default sub-model if none specified
- Add tqdm dependency to requirements.txt

8e1c2339

31 Mar, 2025 3 commits

refactor(model): integrate AtomModelSingleton for OCR and improve OCR result handling · 59d6b195

myhloli authored Mar 31, 2025

- Replace direct OCR model access with AtomModelSingleton for better model management
- Round OCR scores to 2 decimal places for consistency
- Improve error handling and logging in batch analysis
- Simplify OCR result processing in pdf_parse_union_core_v2.py

59d6b195

feat(ocr): implement language-specific OCR processing · d7d85a28

myhloli authored Mar 31, 2025

- Add support for multiple languages in OCR processing
- Create separate lists for each language to improve processing efficiency
- Update OCR model initialization to use PytorchPaddleOCR instead of ModifiedPaddleOCR
- Modify get_ocr_result_list function to include language information- Improve logging for OCR detection and recognition

d7d85a28

feat(ocr): implement separate detection and recognition processes · a330651d

myhloli authored Mar 31, 2025

- Split OCR process into detection and recognition stages
- Update batch analysis and document analysis pipelines
- Modify OCR result formatting and handling
- Remove unused imports and optimize code structure

a330651d

26 Mar, 2025 1 commit
- feat: batch inference with ocr and lang flag · bbba2a12
  icecraft authored Mar 26, 2025
  
  bbba2a12
07 Mar, 2025 1 commit

refactor(magic_pdf): replace PIL with NumPy for image processing · 1b34f7e4

myhloli authored Mar 07, 2025

- Remove PIL usage across multiple files
- Convert image processing functions to use NumPy arrays
- Update crop_img function to work with NumPy arrays
- Modify image loading and resizing to use NumPy and OpenCV
- Clean up unused imports and comments related to PIL

1b34f7e4

21 Jan, 2025 1 commit

perf(model): adjust batch size for layout and formula detection · 49d140c5

myhloli authored Jan 21, 2025

- Reduce YOLO_LAYOUT_BASE_BATCH_SIZE from 4 to 1
- Simplify batch ratio calculation for formula detection
- Remove unused conditional logic in batch ratio determination

49d140c5

16 Jan, 2025 1 commit

refactor(model): update batch analyze logic for rapid table model · 452a9c0b

myhloli authored Jan 16, 2025

- Modify the batch analyze process to handle the rapid table model's output
- Add logic_points variable to capture additional output from rapid table prediction

452a9c0b

15 Jan, 2025 1 commit

feat(model): improve batch analysis logic and support npu · f3502226

myhloli authored Jan 15, 2025

- Add support for NPU (Neural Processing Unit) when available
- Implement batch analysis for GPU and NPU devices
- Optimize memory usage and improve performance
- Update logging and error handling

f3502226

14 Jan, 2025 1 commit
- refactor(BatchAnalyze): comment out image rotation logic in doclayout_yolo · 902dcd2c
  myhloli authored Jan 14, 2025
  
  902dcd2c
26 Dec, 2024 1 commit

refactor(device): optimize memory cleaning and device selection · 50f48417

myhloli authored Dec 26, 2024

- Update clean_memory function to support both CUDA and NPU devices
- Implement get_device function to centralize device selection logic
- Modify model initialization and memory cleaning to use the selected device
- Update RapidTableModel to support both RapidOCR and PaddleOCR engines

50f48417

18 Dec, 2024 1 commit
- refactor: refactor code · b2887ca0
  icecraft authored Dec 18, 2024
  
  b2887ca0
13 Dec, 2024 2 commits
- feat: add logging for detection time in BatchAnalyze when OCR is not applied · be010394
  Suven authored Dec 13, 2024
  
  be010394
- feat: enhance batch processing in BatchAnalyze with layout and OCR timing logs · 49bfdf07
  Suven authored Dec 13, 2024
  
  49bfdf07
12 Dec, 2024 2 commits
- fix: batch methods in DocLayoutYOLO and YOLOv8 models · 4fd1e41e
  Suven authored Dec 12, 2024
  
  4fd1e41e
- feat: add batch prediction methods for YOLOv8 and Unimernet models · 7ce9edc6
  Suven authored Dec 12, 2024
  
  7ce9edc6