- 09 Dec, 2024 1 commit
-
-
myhloli authored
- Add environment variables to disable albumentations and yolo updates - Import torchtext and disable deprecation warnings - Update unimernet to 0.2.2 - Specify ultralytics version as >=8.3.48 - Remove upper version limit for torch
-
- 13 Nov, 2024 1 commit
-
-
myhloli authored
- Update the URL for downloading the model setup script in Dockerfile - Upgrade struct-eqtable to version 0.3.2 and remove pypandoc - Add new dependencies: einops, accelerate, doclayout_yolo, rapidocr-paddle, and rapid_table
-
- 23 Oct, 2024 1 commit
-
-
myhloli authored
- Add doclayout-yolo==0.0.2 to requirements-docker.txt
-
- 17 Oct, 2024 1 commit
-
-
myhloli authored
- Remove unused functions such as split_long_words, ocr_mk_mm_markdown_with_para, etc. - Simplify ocr_mk_markdown_with_para_core_v2 by removing unnecessary language detection and word splitting logic- Remove wordninja dependency from requirements - Update ocr_model_init to include additional parameters for OCR model configuration
-
- 10 Sep, 2024 2 commits
-
-
Xiaomeng Zhao authored
update UniMERNet to 0.2.1
-
myhloli authored
Update the paths to model weights and configuration files for the UniMERNet architecture in both the demo.yaml and model_configs.yaml files. Adjust the mfr_model_init function toreflect the new weight and configuration paths. The changes include specifying more detailed paths to the unimernet_base directory and changing the weight file extension to .pth.
-
- 30 Aug, 2024 1 commit
-
-
icecraft authored
* Create requirements-docker.txt * feat: update deps to support rag * feat: add support to rag, add rag_data_reader api for rag integration * feat: let user retrieve the filename of the processed file * feat: add projects demo for rag integrations --------- Co-authored-by:
Xiaomeng Zhao <moe@myhloli.com> Co-authored-by:
icecraft <xurui1@pjlab.org.cn>
-
- 22 Aug, 2024 1 commit
-
-
Xiaomeng Zhao authored
* build(docker): update base image to Ubuntu 22.04 and install PaddlePaddleUpgrade the Docker base image from ubuntu:latest to ubuntu:22.04 for improved performance and stability. Additionally, integrate PaddlePaddle GPU version 3.0.0b1 into the Docker build for enhanced AI capabilities. The MinIO configuration file has also been updated to the latest version. * build(dockerfile): Updated the Dockerfile * build(Dockerfile): update Dockerfile * docs(docker): add instructions for quick deployment with Docker Include Docker-based deployment instructions in the README for both English and Chinese locales. This update provides users a quick-start guide to using Docker for deployment, with notes on GPU VRAM requirements and default acceleration features. * build(docker): Layer the installation of dependencies, downloading the model, and the setup of the program itself. * build(docker): Layer the installation of dependencies, downloading the model, and the setup of the program itself.
-
- 05 Aug, 2024 1 commit
-
-
liukaiwen authored
-
- 04 Aug, 2024 1 commit
-
-
myhloli authored
fix(dependencies): remove unnecessary pypandoc and struct-eqtable packages;fix matplotlib>=3.9.1 not support Windows system without compilation environment.
-
- 02 Aug, 2024 1 commit
-
-
Kaiwen Liu authored
* # add table recognition using struct-eqtable ## Changelog 31/07/20204 - Support table recognition. Table images will be converted into html. ### how to use the new feature: set the attribute 'table-mode' to 'true' in magic-pdf.json ### caution: it takes 200s to 500s to convert a single table image using cpu * # add table recognition using struct-eqtable ## Changelog 31/07/20204 - Support table recognition. Table images will be converted into LaTex. ### how to use the new feature: set the attribute 'table-mode' to 'true' in magic-pdf.json ### caution: it takes 200s to 500s to convert a single table image using cpu * # feat(model inference): add table recognition and convertion to LaTeX # What's Changed ### New Features - Add table content recognition, we use weights of [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) to convert table image to LaTex. ### Instruction - pip install pypandoc struct-eqtable==0.1.0 - Download [StructEqTable weights](https://huggingface.co/wanderkid/PDF-Extract-Kit/tree/main/models/TabRec ) and put it under models/ directory. - Edit 'table-mode' value to turn on table recognition function which is turned off by default. - If you did not download any models before, refer to [how to download models](docs/how_to_download_models_zh_cn.md)。 * add table recognition and convertion to LaTeX * add table recognition and conversion to LaTeX * add table recognition and conversion to LaTeX * add table recognition and conversion to LaTeX --------- Co-authored-by:
liukaiwen <liukaiwen@pjlab.org.cn>
-
- 01 Aug, 2024 1 commit
-
-
liukaiwen authored
# What's Changed ### New Features - Add table content recognition, we use weights of [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) to convert table image to LaTex. ### Instruction - pip install pypandoc struct-eqtable==0.1.0 - Download [StructEqTable weights](https://huggingface.co/wanderkid/PDF-Extract-Kit/tree/main/models/TabRec) and put it under models/ directory. - Edit 'table-mode' value to turn on table recognition function which is turned off by default. - If you did not download any models before, refer to [how to download models](docs/how_to_download_models_zh_cn.md)。
-
- 28 Jul, 2024 2 commits
- 23 Jul, 2024 1 commit
-
-
myhloli authored
-
- 10 Jul, 2024 1 commit
-
-
zhaoxiaomeng authored
-
- 08 Jul, 2024 3 commits
- 07 Jul, 2024 1 commit
-
-
myhloli authored
-
- 28 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 27 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 26 Jun, 2024 1 commit
-
-
quyuan authored
-
- 25 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 19 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 18 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 17 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 14 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 13 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 12 Jun, 2024 1 commit
-
-
赵小蒙 authored
-
- 27 May, 2024 1 commit
-
-
赵小蒙 authored
-
- 24 May, 2024 1 commit
-
-
赵小蒙 authored
-
- 11 May, 2024 1 commit
-
-
赵小蒙 authored
-
- 26 Apr, 2024 2 commits
- 25 Apr, 2024 1 commit
-
-
quyuan authored
-
- 15 Apr, 2024 1 commit
-
-
赵小蒙 authored
-
- 11 Apr, 2024 2 commits
- 08 Apr, 2024 1 commit
-
-
Shuimo authored
-