- 22 May, 2025 2 commits
-
-
Xiaomeng Zhao authored
feat(ocr): add PP-OCRv5 models and update configurations
-
myhloli authored
- Add new PP-OCRv5 detection and recognition models - Update arch_config.yaml with new model architectures - Modify models_config.yml to include PP-OCRv5 models for ch_lite configuration- Change dictionary file for ch_lite to ppocrv5_dict.txt
-
- 19 May, 2025 2 commits
-
-
Xiaomeng Zhao authored
fix(ocr_mkcontent): improve image handling and footnote integration in markdown output
-
myhloli authored
-
- 14 May, 2025 7 commits
-
-
Xiaomeng Zhao authored
master->dev
-
myhloli authored
-
Xiaomeng Zhao authored
Release 1.3.11
-
Xiaomeng Zhao authored
docs(changelog): remove pdfminer.six version pinning from release notes
-
Xiaomeng Zhao authored
docs(changelog): remove pdfminer.six version pinning from release notes
-
myhloli authored
-
Xiaomeng Zhao authored
Release 1.3.11
-
- 09 May, 2025 13 commits
-
-
Xiaomeng Zhao authored
fix(modeling): escape backslashes in LaTeX command descriptions
-
myhloli authored
-
myhloli authored
-
Xiaomeng Zhao authored
fix(requirements): update pdfminer.six version and restrict torch version upper limit
-
myhloli authored
-
Xiaomeng Zhao authored
fix(setup): update python_requires to support Python 3.10 to 3.13
-
myhloli authored
-
myhloli authored
-
Xiaomeng Zhao authored
fix(Dockerfile): update modelscope installation command to use mirror
-
myhloli authored
-
myhloli authored
-
Xiaomeng Zhao authored
docs(installation): update Python version and CUDA installation instructions
-
myhloli authored
-
- 08 May, 2025 5 commits
-
-
Xiaomeng Zhao authored
docs(README): reorder installation commands for clarity
-
myhloli authored
-
myhloli authored
-
Xiaomeng Zhao authored
feat(modeling): add regex patterns for LaTeX symbol replacements
-
myhloli authored
-
- 06 May, 2025 2 commits
-
-
Xiaomeng Zhao authored
feat(modeling): add 'protect' command to removal patterns
-
myhloli authored
-
- 30 Apr, 2025 5 commits
-
-
Xiaomeng Zhao authored
Update app.py: Fix parameter parsing in /file_parse endpoint
-
github-actions[bot] authored
-
Wang Yubo authored
I have updated the `/file_parse` endpoint in `app.py` to correctly handle boolean and string parameters when they are sent via `multipart/form-data` requests (commonly used for file uploads). Previously, these parameters were not being properly parsed because FastAPI expects them to be passed as query or JSON body parameters by default. ### Changes Made: - Added `Form(...)` to all non-file parameters (`parse_method`, `is_json_md_dump`, `output_dir`, and return flags like `return_layout`, etc.). - This ensures that FastAPI correctly reads these fields from form-data, allowing clients to send both files and structured configuration options in the same request. ### Why This Change Was Needed: - When using `requests.post(..., data=data, files=files)`, the `data` dictionary is sent as form-encoded data. - Without explicitly declaring these fields with `Form(...)`, FastAPI does not bind them correctly, leading to default values always being used (e.g., `False` for boolean flags). - This change allows the API to accurately reflect the client's intent and enables features like `return_layout`, `return_images`, etc., to work as expected. This update improves compatibility with HTTP clients that rely on standard form-based file upload mechanisms while preserving the existing behavior of the API.
-
Xiaomeng Zhao authored
feat(model): add logging for batch image processing
-
myhloli authored
- Add logger info for each batch processed - Include batch number and page count in log message
-
- 29 Apr, 2025 4 commits
-
-
Xiaomeng Zhao authored
update version
-
myhloli authored
-
Xiaomeng Zhao authored
Release 1.3.10
-
Xiaomeng Zhao authored
feat(model_utils): adjust table detection threshold and add features
-