- 20 Mar, 2025 1 commit
-
-
myhloli authored
- Add support for Apple M1 chips (mps device) - Refactor image processing for better performance and compatibility - Update model loading and inference for various devices - Adjust batch processing and memory management
-
- 13 Feb, 2025 2 commits
- 16 Jan, 2025 1 commit
-
-
myhloli authored
- Update RapidTable dependency to version 1.0.3 - Add support for sub-models in RapidTable - Update magic-pdf configuration to include table sub-model - Modify table model initialization to support sub-models - Update table prediction logic to handle new output format
-
- 14 Jan, 2025 1 commit
-
-
myhloli authored
- Merge title blocks that are close to each other horizontally - Adjust line insertion logic for title blocks- Increase image size and decrease confidence threshold for layout detection - Update DocLayoutYOLO model weights - Refactor drawing of bounding boxes for different block types
-
- 09 Jan, 2025 1 commit
-
-
myhloli authored
- Update rapid_table dependency to version 0.3.0 in setup.py
-
- 03 Jan, 2025 1 commit
-
-
myhloli authored
- Implement ONNXModelSingleton to manage ONNX models - Modify ModifiedPaddleOCR to use ONNX models on ARM CPUs without CUDA - Update RapidTableModel to use RapidOCR with ONNXRuntime on CPU - Add rapidocr_onnxruntime dependency in setup.py
-
- 26 Dec, 2024 1 commit
-
-
myhloli authored
- Update unimernet from 0.2.2 to 0.2.3 in requirements-docker.txt and setup.py - Remove torchtext/eva-decord dependency
-
- 25 Dec, 2024 1 commit
-
-
myhloli authored
- Implement llm_aided_title function to optimize document titles using LLM - Update pdf_parse_union_core_v2.py to include title optimization - Modify ocr_mkcontent.py to use optimized title levels- Add openai SDK dependency in setup.py
-
- 11 Dec, 2024 1 commit
-
-
myhloli authored
- Specify torch==2.3.1 and torchvision==0.18.1 for Windows CUDA installation - Add torch and torchvision version constraints in setup.py: - torch>=2.2.2,<=2.3.1 - torchvision>=0.17.2,<=0.18.1 - Update installation instructions in both English and Chinese README files
-
- 09 Dec, 2024 2 commits
- 06 Dec, 2024 1 commit
-
-
myhloli authored
- Update `ultralytics` dependency to version >= 8.3.43 - This change ensures compatibility with yolov8 for formula detection
-
- 18 Nov, 2024 1 commit
-
-
myhloli authored
- Add albumentations package with version <=1.4.20 for old_linux - This version is compatible with Linux systems from 2019 and earlier - Version 1.4.21 and above introduced simsimd which is not supported on older Linux systems
-
- 15 Nov, 2024 1 commit
-
-
myhloli authored
-
- 08 Nov, 2024 2 commits
-
-
myhloli authored
- Integrate RapidOCR with RapidTable model for table recognition - Improve memory management for devices with <= 8GB VRAM - Update table recognition process to use RapidOCR for RapidTable - Add rapidocr-paddle dependency in setup.py
-
myhloli authored
- Add RapidTable model support for table recognition - Update table model configuration and initialization - Modify table recognition process to use RapidTable when specified - Add RapidTable dependency to setup.py
-
- 04 Nov, 2024 1 commit
-
-
myhloli authored
- Update StructTableModel to use the latest struct-eqtable library - Add support for HTML table extraction in PDF Extract Kit - Improve error handling and model initialization - Update dependencies in setup.py for struct-eqtable
-
- 23 Oct, 2024 1 commit
-
-
myhloli authored
- Add doclayout_yolo==0.0.2 to the list of dependencies in setup.py
-
- 10 Sep, 2024 2 commits
-
-
Xiaomeng Zhao authored
update UniMERNet to 0.2.1
-
myhloli authored
Update the paths to model weights and configuration files for the UniMERNet architecture in both the demo.yaml and model_configs.yaml files. Adjust the mfr_model_init function toreflect the new weight and configuration paths. The changes include specifying more detailed paths to the unimernet_base directory and changing the weight file extension to .pth.
-
- 04 Aug, 2024 2 commits
-
-
myhloli authored
The restriction on the matplotlib version has been updated to only apply on Windows platforms, where precompiled packages are not available starting from version 3.9.1. This change enables users on Linux and macOS to install newer versions of matplotlib, addressing compatibility issues with recent bug fixes.
-
myhloli authored
fix(dependencies): remove unnecessary pypandoc and struct-eqtable packages;fix matplotlib>=3.9.1 not support Windows system without compilation environment.
-
- 01 Aug, 2024 1 commit
-
-
icecraft authored
* feat: refractor cli command * feat: add docs to describe the output files of cli * feat: resove review comments * feat: updat docs about middle.json --------- Co-authored-by:shenguanlin <shenguanlin@pjlab.org.cn>
-
- 30 Jul, 2024 1 commit
-
-
myhloli authored
-
- 28 Jul, 2024 1 commit
-
-
myhloli authored
-
- 23 Jul, 2024 1 commit
-
-
myhloli authored
Refactor the `extras_require` section in `setup.py` to simplify and clarify the available options. Consolidate CPU and GPU requirements into single "lite" and "full" options to streamline installation for users.
-
- 12 Jul, 2024 2 commits
- 11 Jul, 2024 1 commit
-
-
myhloli authored
Introduce a new feature that allows users to choose between a "lite" and a "full" model mode for PDF document analysis. The "lite" mode uses a faster, less accurate model, while the "full" mode employs a higher-precision model at the cost of speed. This selection can be made through the CLI or API, providing flexibility for different use cases.
-
- 08 Jul, 2024 1 commit
-
-
myhloli authored
-
- 25 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 20 Jun, 2024 2 commits
- 18 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 05 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 04 Jun, 2024 2 commits
- 03 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 30 May, 2024 1 commit
-
-
赵小蒙 authored
-