"mmdet3d/core/utils/gaussian.py" did not exist on "79a8299cf8a939f69929db1e543443252fbb6629"
- 21 Jan, 2025 4 commits
-
-
myhloli authored
refactor(magic_pdf): adjust VRAM allocation and MFR batch size- Update VRAM allocation logic to use 'VIRTUAL_VRAM_SIZE' environment variable - Reduce MFR (Math Formula Recognition) batch size from 64 to 32
-
myhloli authored
- Update GPU memory check and batch ratio calculation logic - Add support for virtual VRAM size environment variable - Improve logging for GPU memory and batch ratio
-
myhloli authored
- Reduce YOLO_LAYOUT_BASE_BATCH_SIZE from 4 to 1 - Simplify batch ratio calculation for formula detection - Remove unused conditional logic in batch ratio determination
-
myhloli authored
- Update model path from 'unimernet_small' to 'unimernet_small_2501' in multiple scripts and configuration files - This change affects download_models.py, download_models_hf.py, and model_configs.yaml
-
- 20 Jan, 2025 3 commits
-
-
myhloli authored
- Add key length validation for ONNX model initialization - Move import statements to the top of the file - Wrap model initialization in a try-except block for better error handling - Refactor code to improve readability and maintainability
-
myhloli authored
- Add remove_tilted_line function to filter out lines with angles between 2 and 88 degrees - Integrate the new function into the text extraction process - Improve the accuracy of text block processing by removing non-horizontal/vertical lines
-
陆逊 authored
-
- 17 Jan, 2025 3 commits
-
-
myhloli authored
- Added instructions for checking the reasonability of heading levels - Included guidelines for making fine adjustments based on context and logic - Emphasized the importance of aligning the final result with the document's actual structure
-
myhloli authored
- Commented out the original batch ratio calculation - Set a fixed batch ratio of 2 for GPUs with less than 8 GB memory - Increased batch ratio to 4 for GPUs with 8 GB or more memory
-
myhloli authored
- Import get_device function from magic_pdf.libs.config_reader- Update RapidTableModel initialization to include device parameter for Unitable model
-
- 16 Jan, 2025 3 commits
-
-
myhloli authored
- Modify the batch analyze process to handle the rapid table model's output - Add logic_points variable to capture additional output from rapid table prediction
-
myhloli authored
- Update RapidTable dependency to version 1.0.3 - Add support for sub-models in RapidTable - Update magic-pdf configuration to include table sub-model - Modify table model initialization to support sub-models - Update table prediction logic to handle new output format
-
myhloli authored
- Adjust end_page_id calculation to prevent IndexError when accessing pages - Enhance error handling in LLM post-processing by specifically catching JSONDecodeError
-
- 15 Jan, 2025 5 commits
-
-
myhloli authored
- Rename and update merge_title_blocks function - Implement merge_two_bbox helper function - Refactor merging logic to preserve original block structure- Update function calls and integrate with existing pipeline
-
myhloli authored
- Add support for NPU (Neural Processing Unit) when available - Implement batch analysis for GPU and NPU devices - Optimize memory usage and improve performance - Update logging and error handling
-
myhloli authored
- Add `remove_invalid_surrogates` function to filter out invalid UTF-16 surrogate pairs - Integrate the new function into the `detect_lang` workflow - Include a test case with UTF-16 surrogates to verify the fix
-
myhloli authored
- Clarify the expected format for the optimized title list JSON output- Emphasize the need to return only the title levels in the specified format
-
myhloli authored
- Modified the IOU threshold in ocr_span_list_modify.py from 0.9 to 0.35 - This change aims to improve the detection of overlapping characters in OCR processed PDFs
-
- 14 Jan, 2025 4 commits
-
-
myhloli authored
- Add average line height calculation for title blocks - Include page number in title dictionary - Improve title optimization prompt for better hierarchy- Implement retry mechanism for JSON decoding errors - Add error logging for title count mismatch
-
myhloli authored
-
myhloli authored
- Merge title blocks that are close to each other horizontally - Adjust line insertion logic for title blocks- Increase image size and decrease confidence threshold for layout detection - Update DocLayoutYOLO model weights - Refactor drawing of bounding boxes for different block types
-
Xiaomeng Zhao authored
-
- 10 Jan, 2025 4 commits
-
-
myhloli authored
-
myhloli authored
- Add enable flag check for formula, text, and title optimizations
-
myhloli authored
-
myhloli authored
- Add MPS support for Apple Silicon devices - Implement empty_cache() for MPS devices - Set PYTORCH_ENABLE_MPS_FALLBACK environment variable - Adjust MFR model device allocation for MPS
-
- 09 Jan, 2025 5 commits
-
-
myhloli authored
- Improve language detection by removing newline characters from the input text - Add error handling and fallback mechanism to deal with text containing control characters
-
myhloli authored
- Remove conditional logic for OCR engine selection - Always use RapidOCR as the OCR engine - Simplify the __init__ method by removing unused code
-
myhloli authored
- Remove YOLO v11 language detection model from model_configs.yaml - Update language detection utils to use a fixed model path instead of dynamic configuration - Remove unused model weight parameter for YOLO v11 language detection
-
myhloli authored
- Implement block sorting within image and table blocks - Ensure correct order of captions and footnotes within blocks - Improve overall document structure and parsing accuracy
-
myhloli authored
- Remove LangDetectMode and related conditional logic - Use a single model weight for language detection - Add logging for language detection results - Update model initialization and prediction methods
-
- 08 Jan, 2025 3 commits
-
-
myhloli authored
- Add language detection model initialization and integration - Update model list to include language detection - Refactor language detection utils for better model management
-
myhloli authored
- Add separate models for Chinese/Japanese and English/French/German detection - Implement mode-based detection to use appropriate models for different languages - Update language detection process to use higher DPI for better accuracy - Modify model initialization and prediction logic to support new language-specific models
-
myhloli authored
- Add logic to set any negative values in block['bbox'] to 0 - This prevents potential errors when processing PDF blocks
-
- 07 Jan, 2025 1 commit
-
-
myhloli authored
- Remove DropMode and MakeMode imports from user code - Set default drop_mode to DropMode.NONE in get_markdown and get_content_list methods - Remove md_make_mode parameter from get_content_list method - Add dump_middle_json method to PipeResult - Update examples in API documentation and demo script
-
- 06 Jan, 2025 3 commits
-
-
Xiaomeng Zhao authored
-
myhloli authored
- Add check for empty OCR result when using PaddleOCR model - Assign None to ocr_result if no text is detected, preventing further errors
-
icecraft authored
-
- 05 Jan, 2025 2 commits
-
-
myhloli authored
- Add `draw_char_bbox` function to `draw_bbox.py` for drawing character bounding boxes - Integrate `draw_char_bbox` into `common.py` for use in PDF processing pipeline - Include option to draw character bounding boxes in debug mode
-
myhloli authored
style(pdf_parse_union_core_v2): remove unnecessary spaces and improve code formatting- Remove extra space in conditional statement for character spacing logic - Adjust spacing in trigonometric checks for line direction- Improve overall code readability and consistency
-