1. 02 Aug, 2024 3 commits
    • myhloli's avatar
      docs: update model download instructions and CUDA acceleration setup · 3ef4d054
      myhloli authored
      Update the documentation to reflect the latest model download procedures, emphasis on
      model file integrity checks, and expanded instructions for setting up CUDA accelerationon Ubuntu and Windows environments. The README files for various OS have been
      enhanced with additional details to assist users in configuring and verifying their
      environments for optimal performance.
      3ef4d054
    • Kaiwen Liu's avatar
      feat(model inference): add table recognition and conversion to LaTeX (#284) · 37925f36
      Kaiwen Liu authored
      * # add table recognition using struct-eqtable
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into html.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      
      * # add table recognition using struct-eqtable
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into LaTex.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      
      * # feat(model inference): add table recognition and convertion to LaTeX
      
      # What's Changed
      
      ### New Features
      
      - Add table content recognition, we use weights of [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) to convert table image to LaTex.
      
      ### Instruction
      
      - pip install pypandoc struct-eqtable==0.1.0
      - Download [StructEqTable weights](https://huggingface.co/wanderkid/PDF-Extract-Kit/tree/main/models/TabRec
      
      ) and put it under models/ directory.
      - Edit 'table-mode' value to turn on table recognition function which is turned off by default.
      - If you did not download any models before, refer to [how to download models](docs/how_to_download_models_zh_cn.md)。
      
      * add table recognition and convertion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      ---------
      Co-authored-by: default avatarliukaiwen <liukaiwen@pjlab.org.cn>
      37925f36
    • myhloli's avatar
      docs(output-file): correct poly coordinate format and update table descriptions · 41737adf
      myhloli authored
      - Fix the description of the 'poly' coordinate format in the output file documentation to correctly reflect the order of coordinates: left-top, right-top, right-bottom,
        left-bottom.
      - Update various table-related descriptions for clarity and consistency, including
        field names and their corresponding explanations.
      - Add version name field description in 'middle.json' structure to document the
        version of the magic-pdf used in the parsing process.
      - Refactor the block and line description tables to improve readability and alignment
        with the rest of the documentation.
      41737adf
  2. 01 Aug, 2024 8 commits
    • myhloli's avatar
      docs: update README for Ubuntu CUDA Acceleration · 15125623
      myhloli authored
      - Adjust command installation format for PaddlePaddle GPU.
      - Clarify instruction numbering for testing OCR acceleration.
      15125623
    • myhloli's avatar
      docs(zh_CN): update Ubuntu CUDA setup guide for accuracy · a09291ad
      myhloli authored
      Update the Ubuntu CUDA Acceleration setup guide to reflect the correct user directory
      path and improve the clarity of instructions. Remove references to Windows and macOS
      as they are out of scope for this document. Ensure the configuration file copying
      command is correctly represented for Linux users.
      a09291ad
    • myhloli's avatar
      fix(docs): correct link to magic-pdf.template.json in README · 51a0bf4a
      myhloli authored
      Update the link to the magic-pdf.template.json configuration template file in the
      README_Ubuntu_CUDA_Acceleration_zh_CN.md document. The file path was previously
      incorrect and has been amended to point to the correct location.
      51a0bf4a
    • myhloli's avatar
      docs(magic-pdf): update model directory reference in configuration · 866e47a0
      myhloli authored
      Update the instruction in README_Ubuntu_CUDA_Acceleration_zh_CN.md to reference
      the correct section number for downloading the model weights. This change ensures
      that users are directed to the correct location in the document for setting up the
      model directory in the magic-pdf.json configuration.
      866e47a0
    • myhloli's avatar
      docs: update Ubuntu CUDA acceleration guide for version 0.6.2- Add steps for... · fc18a5cf
      myhloli authored
      docs: update Ubuntu CUDA acceleration guide for version 0.6.2- Add steps for Ubuntu 22.04 LTS installation.
      - Detail the process of checking, installing, and configuring NVIDIA drivers.
      - Include instructions for installing Anaconda and creating a specific environment.
      - Provide guidance on installing magic-pdf and its dependencies.
      - Add a note to verify magic-pdf version and report issues if necessary.
      - Describe the process of downloading models and configuring the application.
      - Include a sample command to run the application with CUDA acceleration.
      - Add a note for enabling OCR CUDA acceleration with specific GPU requirements.
      
      This update ensures users have the latest information for setting up CUDA accelerationwith magic-pdf on Ubuntu 22.04 LTS, specifically for version 0.6.2, and provides clearer
      instructions on the installation and configuration process.
      fc18a5cf
    • myhloli's avatar
      docs: restructure download guide and add ModelScope options · b4b2a099
      myhloli authored
      Restructured the how-to download models document for better clarity and
      added sections on downloading models from ModelScope, including SDK and
      Git download methods. Provided detailed steps for installing Git LFS and
      checking model integrity after download. Also included recommendations
      for moving the models to an SSD for better performance.
      b4b2a099
    • icecraft's avatar
      Feat/impl cli (#264) · 40e0827e
      icecraft authored
      
      
      * feat: refractor cli command
      
      * feat: add docs to describe the output files of cli
      
      * feat: resove review comments
      
      * feat: updat docs about middle.json
      
      ---------
      Co-authored-by: default avatarshenguanlin <shenguanlin@pjlab.org.cn>
      40e0827e
    • Richard Li's avatar
      Update how_to_download_models_en.md · c30a1abd
      Richard Li authored
      c30a1abd
  3. 31 Jul, 2024 2 commits
  4. 30 Jul, 2024 1 commit
  5. 29 Jul, 2024 1 commit
  6. 26 Jul, 2024 2 commits
  7. 24 Jul, 2024 2 commits
  8. 23 Jul, 2024 2 commits
  9. 19 Jul, 2024 1 commit
  10. 17 Jul, 2024 5 commits
  11. 15 Jul, 2024 1 commit
  12. 13 Jul, 2024 1 commit
  13. 12 Jul, 2024 2 commits
  14. 05 Jul, 2024 1 commit
  15. 28 Jun, 2024 1 commit
  16. 27 Jun, 2024 4 commits
  17. 26 Jun, 2024 2 commits