- 08 Jan, 2025 2 commits
-
-
myhloli authored
- Add separate models for Chinese/Japanese and English/French/German detection - Implement mode-based detection to use appropriate models for different languages - Update language detection process to use higher DPI for better accuracy - Modify model initialization and prediction logic to support new language-specific models
-
myhloli authored
- Add logic to set any negative values in block['bbox'] to 0 - This prevents potential errors when processing PDF blocks
-
- 07 Jan, 2025 1 commit
-
-
myhloli authored
- Remove DropMode and MakeMode imports from user code - Set default drop_mode to DropMode.NONE in get_markdown and get_content_list methods - Remove md_make_mode parameter from get_content_list method - Add dump_middle_json method to PipeResult - Update examples in API documentation and demo script
-
- 06 Jan, 2025 2 commits
- 05 Jan, 2025 3 commits
-
-
myhloli authored
- Add `draw_char_bbox` function to `draw_bbox.py` for drawing character bounding boxes - Integrate `draw_char_bbox` into `common.py` for use in PDF processing pipeline - Include option to draw character bounding boxes in debug mode
-
myhloli authored
style(pdf_parse_union_core_v2): remove unnecessary spaces and improve code formatting- Remove extra space in conditional statement for character spacing logic - Adjust spacing in trigonometric checks for line direction- Improve overall code readability and consistency
-
myhloli authored
- Add missing 'else' statement in OCR model selection logic - Ensure consistent formatting of 'if' statements for better readability - Remove unnecessary empty line in the 'app.py' file
-
- 03 Jan, 2025 2 commits
-
-
myhloli authored
- Remove logger.info() call for additional_ocr_params to reduce log verbosity
-
myhloli authored
- Implement ONNXModelSingleton to manage ONNX models - Modify ModifiedPaddleOCR to use ONNX models on ARM CPUs without CUDA - Update RapidTableModel to use RapidOCR with ONNXRuntime on CPU - Add rapidocr_onnxruntime dependency in setup.py
-
- 02 Jan, 2025 2 commits
-
-
myhloli authored
- Update the logic for inserting spaces between characters- Consider the next character's position instead of the previous one - Adjust the spacing threshold to 25% of the average character width - Ignore spaces at the end of lines to prevent double spaces
-
myhloli authored
- Update the logic for inserting spaces between characters- Consider the next character's position instead of the previous one - Adjust the spacing threshold to 25% of the average character width - Ignore spaces at the end of lines to prevent double spaces
-
- 30 Dec, 2024 2 commits
-
-
myhloli authored
- Remove use_npu variable initialization - Comment out device assignment and npu check - Comment out use_npu parameter in ModifiedPaddleOCR constructor
-
myhloli authored
- Update `clean_memory.py` to use `torch_npu.npu` instead of `torch.npu` - Update `model_utils.py` to use `torch_npu.npu` instead of `torch.npu` - Simplify NPU availability check and bfloat16 support in `pdf_parse_union_core_v2.py`
-
- 27 Dec, 2024 1 commit
-
-
icecraft authored
-
- 26 Dec, 2024 2 commits
-
-
myhloli authored
- Update clean_memory function to support both CUDA and NPU devices - Implement get_device function to centralize device selection logic - Modify model initialization and memory cleaning to use the selected device - Update RapidTableModel to support both RapidOCR and PaddleOCR engines
-
myhloli authored
- Add NPU support for memory cleaning and model initialization - Optimize table model initialization and prediction process - Update memory utils to support NPU - Add language parameter for table model
-
- 25 Dec, 2024 2 commits
-
-
myhloli authored
- Comment out logging statements for title list, title completion, and length comparison - Improve code readability and reduce clutter by removing unused debug information
-
myhloli authored
- Implement llm_aided_title function to optimize document titles using LLM - Update pdf_parse_union_core_v2.py to include title optimization - Modify ocr_mkcontent.py to use optimized title levels- Add openai SDK dependency in setup.py
-
- 24 Dec, 2024 1 commit
-
-
myhloli authored
- Add LLM-aided formula and text correction functionality - Update config reader to include LLM-aided settings - Create new LLM-aided processing module - Update main processing script to incorporate LLM-aided corrections - Modify download scripts to check for new config version
-
- 20 Dec, 2024 1 commit
-
-
myhloli authored
- Remove remove_overlaps_chars function - Add check_chars_is_overlap_in_span function - Update span processing logic to handle character overlaps- Improve efficiency and readability of overlap detection
-
- 19 Dec, 2024 1 commit
-
-
myhloli authored
- Implement remove_overlaps_chars function to detect and remove overlapping characters within spans - Integrate remove_overlaps_chars function into the PDF parsing process - Improve character-level processing and reduce redundancy in OCR results
-
- 18 Dec, 2024 7 commits
-
-
pangguosheng authored
-
pangguosheng authored
-
myhloli authored
- Add automatic conversion of uploaded files to PDF - Update max page slider range and default value- Prevent interaction with PDF preview to avoid errors - Increase Markdown rendering height for better visibility - Update file change event handling for PDF conversion - Modify supported image suffixes for file upload
-
myhloli authored
- Move __use_inside_model__ and __model_mode__ from operators/__init__.py to model/__init__.py - These variables are more appropriately located in the model module since they relate to model configuration
-
icecraft authored
-
xu rui authored
-
icecraft authored
-
- 17 Dec, 2024 3 commits
- 16 Dec, 2024 1 commit
-
-
myhloli authored
- Remove YOLO_VERBOSE environment variable from multiple files - Set verbose=False in YOLOv8 prediction method to suppress logger output
-
- 13 Dec, 2024 3 commits
- 12 Dec, 2024 4 commits