- 24 May, 2025 1 commit
-
-
myhloli authored
-
- 23 May, 2025 2 commits
-
-
myhloli authored
-
myhloli authored
- Add PPHGNetV2_B4 backbone to the list of supported backbones - Introduce new OCR model configuration for PP-OCRv5 with PPHGNetV2_B4 - Update existing model configurations to use the new backbone - Modify RNN neck to support input with H > 1 - Adjust batch size for inference
-
- 22 May, 2025 1 commit
-
-
myhloli authored
- Add new PP-OCRv5 detection and recognition models - Update arch_config.yaml with new model architectures - Modify models_config.yml to include PP-OCRv5 models for ch_lite configuration- Change dictionary file for ch_lite to ppocrv5_dict.txt
-
- 09 May, 2025 2 commits
- 08 May, 2025 1 commit
-
-
myhloli authored
-
- 06 May, 2025 1 commit
-
-
myhloli authored
-
- 29 Apr, 2025 2 commits
-
-
myhloli authored
- Adjust the threshold for considering tables inside other tables from2 to 3 - Add support for custom formula delimiters through user configuration - Pin pdfminer.six to version 20250324 to prevent parsing failures
-
myhloli authored
- Add regex patterns for replacing LaTeX symbols \fint and \up with their Unicode equivalents
-
- 27 Apr, 2025 3 commits
-
-
myhloli authored
- Add \textunderscore to the list of LaTeX patterns - This allows the model to properly render underscore characters
-
myhloli authored
- Improve \left and \right command handling in LaTeX formulas - Enhance environment type matching for array, matrix, and other structures - Refactor code for better readability and maintainability
-
myhloli authored
- Refactor LaTeX left/right pair fixing logic for better balance - Add environment detection and correction for common math environments - Implement more robust whitespace handling and command substitution - Optimize regex patterns for improved performance and readability
-
- 25 Apr, 2025 2 commits
-
-
myhloli authored
- Add functions to fix LaTeX left and right commands - Implement brace matching and repair in LaTeX formulas - Remove unnecessary whitespace and repair LaTeX code - Replace specific LaTeX commands with appropriate alternatives - Add logging for debugging purposes
-
myhloli authored
- Add functions to fix LaTeX left and right commands - Implement brace matching and repair in LaTeX formulas - Remove unnecessary whitespace and repair LaTeX code - Replace specific LaTeX commands with appropriate alternatives - Add logging for debugging purposes
-
- 24 Apr, 2025 1 commit
-
-
myhloli authored
- Preserve "\ " sequences during whitespace removal - Add temporary substitution to prevent incorrect processing of "\ " sequences - Restore "\ " sequences after removing unnecessary whitespace
-
- 23 Apr, 2025 2 commits
-
-
myhloli authored
- Replace get_device() function call with direct 'device' variable usage - Simplify device configuration in OCR model initialization
-
myhloli authored
- Add new Chinese OCR model (ch_PP-OCRv4_rec_server_doc_infer) for server-side use - Update language support in app.py to include new Chinese model - Modify models_config.yml to add new model configuration
-
- 22 Apr, 2025 1 commit
-
-
myhloli authored
- Automatically change to ch_lite model when using CPU for Chinese OCR - This modification improves performance on CPU devices
-
- 15 Apr, 2025 1 commit
-
-
myhloli authored
- Add text region handling in get_res_list_from_layout_res function - Implement remove_overlaps_min_blocks function to handle overlapping blocks - Update OCR region handling to include text regions - Improve overlap resolution for all regions in layout results
-
- 09 Apr, 2025 5 commits
-
-
myhloli authored
- Comment out the line that updates det_count in batch_analyze.py - Add a new OCR model configuration for Chinese (ch_lite) in models_config.yml- Update the Chinese OCR model configuration to use a different recognition model
-
myhloli authored
- Simplify aspect ratio calculation using direct coordinate subtraction - Remove unnecessary list append operation - Improve code readability and performance in table rotation detection
-
myhloli authored
- Implement table orientation detection to identify if a table is in portrait mode - Add rotation logic to turn portrait tables 90 degrees clockwise before OCR - Update OCR processing to work with potentially rotated images - Improve text box analysis to determine if a table is rotated
-
myhloli authored
- Update predict_rec.py to check for NaN values in recognition results - Replace NaN scores with 0.0 to ensure stability and consistency
-
myhloli authored
- Add functions to calculate IoU, check if tables are inside each other, and merge tables - Implement table merging for high IoU tables - Add filtering to remove nested tables that don't overlap but cover a large area - Update table_res_list and layout_res to reflect these changes
-
- 08 Apr, 2025 1 commit
-
-
myhloli authored
- Import os and pathlib modules to handle file paths - Define the path to the slanet-plus model - Update RapidTableInput initialization to include the model path
-
- 07 Apr, 2025 1 commit
-
-
myhloli authored
- Refactor VRAM detection logic for better readability and efficiency - Add fallback mechanism for unknown VRAM sizes - Improve device checking in get_vram function
-
- 03 Apr, 2025 2 commits
-
-
myhloli authored
- Update table recognition logic to process each table individually - Refactor layout detection to use tqdm for progress tracking - Optimize OCR recognition by using a single tqdm wrapper - Improve MFR prediction with a more accurate progress bar - Simplify MFD prediction by removing unnecessary total calculation
-
myhloli authored
- Add tqdm progress bar to batch prediction loops in multiple model modules - Improve logging and error handling in batch analysis script - Update table model initialization to use default sub-model if none specified - Add tqdm dependency to requirements.txt
-
- 02 Apr, 2025 9 commits
-
-
myhloli authored
- Replace ch_PP-OCRv4_det_infer.pth with ch_PP-OCRv3_det_infer.pth in models_config.yml - Add new ch_PP-OCRv3_det_infer model configuration in arch_config.yaml
-
myhloli authored
- Remove unnecessary GPU checks and cuda() calls - Consolidate tensor device placement using .to(self.device) - Add warning suppression for cleaner output - Refactor conditional logic for better readability
-
myhloli authored
- Remove unnecessary imports and code in batch_demo.py - Update demo.py to use relative paths and improve code structure - Adjust output directory structure in both scripts - Remove redundant code and simplify functions
-
myhloli authored
- Update PyMuPDF to version <1.25.0 - Update pydantic to version <2.11 - Update transformers to version < 5.0.0 - Remove always_apply parameter from alb.ToGray in image processing
-
myhloli authored
- Update the default configuration path in pytorchocr_utility.py - Add required dependencies for paddleocr2pytorch in setup.py: - shapely - pyclipper - omegaconf
-
myhloli authored
- Remove unused UniMERNet and LayoutLMv3 model configurations - Update OCR model path and dictionary path for PaddleOCR - Modify README to update system requirements and installation instructions - Update setup.py to include new package data
-
myhloli authored
- Add newline at the beginning of arabic_dict.txt - Change mode of multiple dictionary files
-
myhloli authored
- Remove OCR utils, modified PaddleOCR, and StructEqTable model - Delete related import statements and model definitions - Update dependencies in setup.py to remove paddlepaddle and related OCR packages
-
myhloli authored
- Comment out print statements in base_ocr_v20.py and pytorch_paddle.py - Update table model initialization to use lang parameter instead of ocr_engine - Remove unused RapidOCR initialization in rapid_table.py
-
- 01 Apr, 2025 2 commits
-
-
myhloli authored
- Remove unused OCR dictionaries for Arabic, Belarusian, Bulgarian and Armenian languages - Update model configurations in arch_config.yaml: - Comment out 'out_channels' for various language models - Rename Arabic, Korean, Japanese, Tamil and Devanagari model configurations to use 'v3' instead of 'v4' - Delete ar_dict.txt, be_dict.txt and bg_dict.txt files - Update arabic_dict.txt to remove blank line at the start
-
myhloli authored
- Remove unused imports and code - Simplify model architecture by removing unnecessary components - Update initialization and forward pass logic - Rename variables for consistency
-