1. 02 Apr, 2025 2 commits
  2. 20 Mar, 2025 2 commits
  3. 13 Feb, 2025 1 commit
  4. 16 Jan, 2025 1 commit
    • myhloli's avatar
      build(docker): update rapid-table dependency · 61f75b30
      myhloli authored
      - Update rapid-table version from ==0.3.0 to >=1.0.3,<2.0.0 in multiple requirements files
      - This change affects Ascend NPU, China, and Global Docker configurations
      61f75b30
  5. 15 Jan, 2025 1 commit
  6. 14 Jan, 2025 2 commits
    • myhloli's avatar
      feat(layout): improve title block handling and layout detection · c20e9a1e
      myhloli authored
      - Merge title blocks that are close to each other horizontally
      - Adjust line insertion logic for title blocks- Increase image size and decrease confidence threshold for layout detection
      - Update DocLayoutYOLO model weights
      - Refactor drawing of bounding boxes for different block types
      c20e9a1e
    • myhloli's avatar
      build(deps): add upper version limit for PyMuPDF · ee9340ea
      myhloli authored
      - Set PyMuPDF version to <= 1.24.14 in all requirements files
      - Prevent potential compatibility issues with future versions
      ee9340ea
  7. 10 Jan, 2025 1 commit
  8. 06 Jan, 2025 1 commit
    • myhloli's avatar
      build(docker): add Dockerfiles for global and Huawei NPU setups · ad099808
      myhloli authored
      - Add Dockerfile for global setup with Ubuntu base image
      - Add Dockerfile for Huawei NPU setup with Ascend base image
      - Update requirements file structure:  - Rename requirements-docker.txt to docker/china/requirements.txt - Add new requirements files for global and Huawei NPU setups
      - Install necessary packages and dependencies in both Dockerfiles- Set up virtual environment and install Python packages
      - Download models and configure magic-pdf for both setups
      ad099808
  9. 27 Dec, 2024 1 commit
  10. 26 Dec, 2024 1 commit
  11. 11 Dec, 2024 1 commit
  12. 09 Dec, 2024 1 commit
  13. 13 Nov, 2024 1 commit
  14. 23 Oct, 2024 1 commit
  15. 17 Oct, 2024 1 commit
  16. 10 Sep, 2024 2 commits
  17. 30 Aug, 2024 1 commit
    • icecraft's avatar
      Feat/support rag (#510) · b72d4ebd
      icecraft authored
      
      
      * Create requirements-docker.txt
      
      * feat: update deps to support rag
      
      * feat: add support to rag, add rag_data_reader api for rag integration
      
      * feat: let user retrieve the filename of the processed file
      
      * feat: add projects demo for rag integrations
      
      ---------
      Co-authored-by: default avatarXiaomeng Zhao <moe@myhloli.com>
      Co-authored-by: default avataricecraft <xurui1@pjlab.org.cn>
      b72d4ebd
  18. 22 Aug, 2024 1 commit
    • Xiaomeng Zhao's avatar
      build(docker): update docker build step (#471) · 1fc0b76d
      Xiaomeng Zhao authored
      * build(docker): update base image to Ubuntu 22.04 and install PaddlePaddleUpgrade the Docker base image from ubuntu:latest to ubuntu:22.04 for improved
      performance and stability.
      
      Additionally, integrate PaddlePaddle GPU version 3.0.0b1
      into the Docker build for enhanced AI capabilities. The MinIO configuration file has
      also been updated to the latest version.
      
      * build(dockerfile): Updated the Dockerfile
      
      * build(Dockerfile): update Dockerfile
      
      * docs(docker): add instructions for quick deployment with Docker
      
      Include Docker-based deployment instructions in the README for both English and
      Chinese locales. This update provides users a quick-start guide to using Docker for
      deployment, with notes on GPU VRAM requirements and default acceleration features.
      
      * build(docker): Layer the installation of dependencies, downloading the model, and the setup of the program itself.
      
      * build(docker): Layer the installation of dependencies, downloading the model, and the setup of the program itself.
      1fc0b76d
  19. 05 Aug, 2024 1 commit
  20. 04 Aug, 2024 1 commit
  21. 02 Aug, 2024 1 commit
    • Kaiwen Liu's avatar
      feat(model inference): add table recognition and conversion to LaTeX (#284) · 37925f36
      Kaiwen Liu authored
      * # add table recognition using struct-eqtable
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into html.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      
      * # add table recognition using struct-eqtable
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into LaTex.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      
      * # feat(model inference): add table recognition and convertion to LaTeX
      
      # What's Changed
      
      ### New Features
      
      - Add table content recognition, we use weights of [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) to convert table image to LaTex.
      
      ### Instruction
      
      - pip install pypandoc struct-eqtable==0.1.0
      - Download [StructEqTable weights](https://huggingface.co/wanderkid/PDF-Extract-Kit/tree/main/models/TabRec
      
      ) and put it under models/ directory.
      - Edit 'table-mode' value to turn on table recognition function which is turned off by default.
      - If you did not download any models before, refer to [how to download models](docs/how_to_download_models_zh_cn.md)。
      
      * add table recognition and convertion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      ---------
      Co-authored-by: default avatarliukaiwen <liukaiwen@pjlab.org.cn>
      37925f36
  22. 01 Aug, 2024 1 commit
  23. 28 Jul, 2024 2 commits
  24. 23 Jul, 2024 1 commit
  25. 10 Jul, 2024 1 commit
  26. 08 Jul, 2024 3 commits
  27. 07 Jul, 2024 1 commit
  28. 28 Jun, 2024 1 commit
  29. 27 Jun, 2024 1 commit
  30. 26 Jun, 2024 1 commit
  31. 25 Jun, 2024 1 commit
  32. 19 Jun, 2024 1 commit
  33. 18 Jun, 2024 1 commit