- 04 Jul, 2025 5 commits
-
-
Xiaomeng Zhao authored
-
Xiaomeng Zhao authored
Co-authored-by:Copilot <175728472+Copilot@users.noreply.github.com>
-
Xiaomeng Zhao authored
Co-authored-by:Copilot <175728472+Copilot@users.noreply.github.com>
-
Xiaomeng Zhao authored
-
myhloli authored
-
- 30 Jun, 2025 1 commit
-
-
myhloli authored
-
- 18 Jun, 2025 3 commits
- 13 Jun, 2025 3 commits
- 11 Jun, 2025 2 commits
- 10 Jun, 2025 2 commits
- 09 Jun, 2025 1 commit
-
-
AdrianWang authored
将项目版本号更新为1.0.0,并在README中更新安装命令以反映新版本。
-
- 05 Jun, 2025 1 commit
-
-
AdrianWang authored
-
- 27 May, 2025 1 commit
-
-
myhloli authored
-
- 08 May, 2025 1 commit
-
-
myhloli authored
-
- 30 Apr, 2025 1 commit
-
-
Wang Yubo authored
I have updated the `/file_parse` endpoint in `app.py` to correctly handle boolean and string parameters when they are sent via `multipart/form-data` requests (commonly used for file uploads). Previously, these parameters were not being properly parsed because FastAPI expects them to be passed as query or JSON body parameters by default. ### Changes Made: - Added `Form(...)` to all non-file parameters (`parse_method`, `is_json_md_dump`, `output_dir`, and return flags like `return_layout`, etc.). - This ensures that FastAPI correctly reads these fields from form-data, allowing clients to send both files and structured configuration options in the same request. ### Why This Change Was Needed: - When using `requests.post(..., data=data, files=files)`, the `data` dictionary is sent as form-encoded data. - Without explicitly declaring these fields with `Form(...)`, FastAPI does not bind them correctly, leading to default values always being used (e.g., `False` for boolean flags). - This change allows the API to accurately reflect the client's intent and enables features like `return_layout`, `return_images`, etc., to work as expected. This update improves compatibility with HTTP clients that rely on standard form-based file upload mechanisms while preserving the existing behavior of the API.
-
- 28 Apr, 2025 1 commit
-
-
myhloli authored
- Add support for \(\) and \[\] delimiters in addition to $$ and $$- Make LaTeX delimiter configuration more flexible and user-defined - Update configuration file to include LaTeX delimiter settings - Modify OCR content generation to use configurable delimiters
-
- 23 Apr, 2025 1 commit
-
-
myhloli authored
- Add new Chinese OCR model (ch_PP-OCRv4_rec_server_doc_infer) for server-side use - Update language support in app.py to include new Chinese model - Modify models_config.yml to add new model configuration
-
- 21 Apr, 2025 1 commit
-
-
myhloli authored
- Add '.jpeg' to the list of supported image extensions in app.py and read_api.py - Update projects READMEs to indicate that web_demo is deprecated
-
- 10 Apr, 2025 1 commit
-
-
myhloli authored
- Remove unused 'layoutlmv3' model option - Update language options to include new 'add_lang' list - Set default language to 'ch' (Chinese) - Comment out old 'all_lang' definition for future reference
-
- 03 Apr, 2025 1 commit
-
-
myhloli authored
- Comment out PaddlePaddle GPU installation in Dockerfile - Add OCR model download URL in download_models.py - Update config version in magic-pdf.json - Remove outdated information and simplify README.md - Remove volume creation for PaddleOCR models in Dockerfile
-
- 21 Mar, 2025 1 commit
-
-
myhloli authored
- Comment out LayoutLMv3, TableMaster, and StructEqTable models - Update MFR model path to unimernet_hf_small_2503- Remove unused import in Unimernet.py
-
- 14 Mar, 2025 2 commits
-
-
JesseChen1031 authored
-
JesseChen1031 authored
-
- 22 Feb, 2025 1 commit
-
-
sayThQ199 authored
Determine whether the file name with a.pdf extension supports the inclusion of special characters or Chinese characters.
-
- 14 Feb, 2025 8 commits
- 22 Jan, 2025 1 commit
-
-
myhloli authored
- Change Miners homepage link from 'https://mineru.org.cn/home?source=online' to 'https://mineru.net/home?source=online' - Change Miners client download link from 'https://mineru.org.cn/client?source=online' to 'https://mineru.net/client?source=online'
-
- 21 Jan, 2025 1 commit
-
-
myhloli authored
- Update model path from 'unimernet_small' to 'unimernet_small_2501' in multiple scripts and configuration files - This change affects download_models.py, download_models_hf.py, and model_configs.yaml
-