- 09 Jan, 2025 14 commits
-
-
Xiaomeng Zhao authored
feat(README): update for 1.0.0 release and add automatic language identification feature
-
myhloli authored
- Update README.md and README_zh-CN.md for1.0.0 release - Add information about new automatic language identification feature - Include link to Ascend NPU Acceleration documentation - Highlight improvements in API interface and compatibility
-
myhloli authored
- Change fast-langdetect dependency from version 0.2.0 to >=0.2.3
-
Xiaomeng Zhao authored
Dev
-
myhloli authored
- Change the default value of the language dropdown from an empty string to 'auto' - This ensures a language is always selected by default, improving
-
myhloli authored
- Remove conditional logic for OCR engine selection - Always use RapidOCR as the OCR engine - Simplify the __init__ method by removing unused code
-
myhloli authored
- Update clear_bu.add() to exclude 'language' input
-
Xiaomeng Zhao authored
Dev
-
myhloli authored
- Remove YOLO v11 language detection model from model_configs.yaml - Update language detection utils to use a fixed model path instead of dynamic configuration - Remove unused model weight parameter for YOLO v11 language detection
-
myhloli authored
-
myhloli authored
- Implement block sorting within image and table blocks - Ensure correct order of captions and footnotes within blocks - Improve overall document structure and parsing accuracy
-
myhloli authored
- Update rapid_table dependency to version 0.3.0 in setup.py
-
Xiaomeng Zhao authored
refactor(langdetect): simplify language detection model
-
myhloli authored
- Remove LangDetectMode and related conditional logic - Use a single model weight for language detection - Add logging for language detection results - Update model initialization and prediction methods
-
- 08 Jan, 2025 9 commits
-
-
myhloli authored
- Add language detection model initialization and integration - Update model list to include language detection - Refactor language detection utils for better model management
-
myhloli authored
- Add separate models for Chinese/Japanese and English/French/German detection - Implement mode-based detection to use appropriate models for different languages - Update language detection process to use higher DPI for better accuracy - Modify model initialization and prediction logic to support new language-specific models
-
Xiaomeng Zhao authored
refactor(docs): consolidate MS Office document conversion guides
-
myhloli authored
-
Xiaomeng Zhao authored
docs/update_docs
-
icecraft authored
-
Xiaomeng Zhao authored
fix(pdf_parse): ensure block bounding boxes do not have negative values
-
myhloli authored
- Add logic to set any negative values in block['bbox'] to 0 - This prevents potential errors when processing PDF blocks
-
Xiaomeng Zhao authored
add test case
-
- 07 Jan, 2025 12 commits
-
-
myhloli authored
-
myhloli authored
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
Xiaomeng Zhao authored
fix(clear_bu): remove unused input from clear button
-
myhloli authored
Remove 'table_enable' input from the clear button's function call. This change ensures that only necessary inputs are included in the clear operation, improving code efficiency and maintainability.
-
Xiaomeng Zhao authored
feat(api): simplify markdown and content list generation
-
myhloli authored
- Remove DropMode and MakeMode imports from user code - Set default drop_mode to DropMode.NONE in get_markdown and get_content_list methods - Remove md_make_mode parameter from get_content_list method - Add dump_middle_json method to PipeResult - Update examples in API documentation and demo script
-
Xiaomeng Zhao authored
-
Xiaomeng Zhao authored
-
- 06 Jan, 2025 5 commits
-
-
Xiaomeng Zhao authored
docs(Ascend): 更新已知问题说明
-
myhloli authored
- 修改 paddlepaddle 使用内嵌 onnx 模型的描述,明确仅支持中英文 ocr
-
Xiaomeng Zhao authored
docs: update README files for v0.10.0 release
-
myhloli authored
- Update README.md and README_zh-CN.md to reflect the latest features and improvements - Highlight new hybrid OCR text extraction capabilities and performance enhancements - Emphasize optimized compatibility for ARM architecture Linux systems - Mention integration with Huawei Ascend NPU acceleration
-
Xiaomeng Zhao authored
docs(ascend): 更新文档说明,增加docker运行前的环境要求
-