- 09 Jan, 2025 25 commits
-
-
Xiaomeng Zhao authored
docs: reorganize inference acceleration sections in README
-
Xiaomeng Zhao authored
docs: reorganize inference acceleration sections in README
-
myhloli authored
Reordered sections related to inference acceleration using CUDA and CANN to provide a clearer structure in both English and Chinese README files.
-
Xiaomeng Zhao authored
fix(language): enhance language detection and text processing
-
Xiaomeng Zhao authored
fix(language): enhance language detection and text processing
-
myhloli authored
-
myhloli authored
- Improve language detection by removing newline characters from the input text - Add error handling and fallback mechanism to deal with text containing control characters
-
Xiaomeng Zhao authored
-
Xiaomeng Zhao authored
Dev
-
Xiaomeng Zhao authored
feat(README): update for 1.0.0 release and add automatic language identification feature
-
myhloli authored
- Update README.md and README_zh-CN.md for1.0.0 release - Add information about new automatic language identification feature - Include link to Ascend NPU Acceleration documentation - Highlight improvements in API interface and compatibility
-
myhloli authored
- Change fast-langdetect dependency from version 0.2.0 to >=0.2.3
-
Xiaomeng Zhao authored
Dev
-
myhloli authored
- Change the default value of the language dropdown from an empty string to 'auto' - This ensures a language is always selected by default, improving
-
myhloli authored
- Remove conditional logic for OCR engine selection - Always use RapidOCR as the OCR engine - Simplify the __init__ method by removing unused code
-
myhloli authored
- Update clear_bu.add() to exclude 'language' input
-
Xiaomeng Zhao authored
Dev
-
Xiaomeng Zhao authored
Dev
-
myhloli authored
- Remove YOLO v11 language detection model from model_configs.yaml - Update language detection utils to use a fixed model path instead of dynamic configuration - Remove unused model weight parameter for YOLO v11 language detection
-
myhloli authored
-
myhloli authored
- Implement block sorting within image and table blocks - Ensure correct order of captions and footnotes within blocks - Improve overall document structure and parsing accuracy
-
myhloli authored
- Update rapid_table dependency to version 0.3.0 in setup.py
-
Xiaomeng Zhao authored
refactor(langdetect): simplify language detection model
-
Xiaomeng Zhao authored
refactor(langdetect): simplify language detection model
-
myhloli authored
- Remove LangDetectMode and related conditional logic - Use a single model weight for language detection - Add logging for language detection results - Update model initialization and prediction methods
-
- 08 Jan, 2025 11 commits
-
-
myhloli authored
- Add language detection model initialization and integration - Update model list to include language detection - Refactor language detection utils for better model management
-
myhloli authored
- Add separate models for Chinese/Japanese and English/French/German detection - Implement mode-based detection to use appropriate models for different languages - Update language detection process to use higher DPI for better accuracy - Modify model initialization and prediction logic to support new language-specific models
-
Xiaomeng Zhao authored
refactor(docs): consolidate MS Office document conversion guides
-
Xiaomeng Zhao authored
refactor(docs): consolidate MS Office document conversion guides
-
myhloli authored
-
Xiaomeng Zhao authored
docs/update_docs
-
icecraft authored
-
Xiaomeng Zhao authored
fix(pdf_parse): ensure block bounding boxes do not have negative values
-
Xiaomeng Zhao authored
fix(pdf_parse): ensure block bounding boxes do not have negative values
-
myhloli authored
- Add logic to set any negative values in block['bbox'] to 0 - This prevents potential errors when processing PDF blocks
-
Xiaomeng Zhao authored
add test case
-
- 07 Jan, 2025 4 commits