1. 24 Dec, 2024 1 commit
    • myhloli's avatar
      feat(llm): add LLM-aided formula and text correction · c660fdc8
      myhloli authored
      - Add LLM-aided formula and text correction functionality
      - Update config reader to include LLM-aided settings
      - Create new LLM-aided processing module
      - Update main processing script to incorporate LLM-aided corrections
      - Modify download scripts to check for new config version
      c660fdc8
  2. 11 Nov, 2024 1 commit
    • myhloli's avatar
      refactor(model download script) · 9496c6c4
      myhloli authored
      - Remove the download script from the `docs` directory and unify it in the `scripts` directory.
      - Update the JSON template URL in the script to the master branch.
      - Modify the script download link provided in the documentation.
      - Update the table recognition model in the configuration template to `rapid_table`.
      9496c6c4
  3. 09 Nov, 2024 1 commit
  4. 08 Nov, 2024 1 commit
  5. 23 Oct, 2024 2 commits
    • myhloli's avatar
      feat(model): add support for DocLayout-YOLO model · 1279f2cd
      myhloli authored
      - Add new layout model option: DocLayout-YOLO
      - Implement model initialization and prediction for DocLayout-YOLO
      - Update configuration options to include new model- Modify existing code to support both LayoutLMv3 and DocLayout-YOLO models
      - Update Gradio app to support more Custom Switch
      1279f2cd
    • myhloli's avatar
      update:update config json · 790691d6
      myhloli authored
      790691d6
  6. 09 Oct, 2024 1 commit
    • myhloli's avatar
      refactor(docs): update model download instructions and configuration process · 044e84e4
      myhloli authored
      - Simplify model download instructions, include initial download and updates
      - Recommend using Hugging Face or ModelScope for downloading models
      - Remove detailed steps for Git LFS download due to reliability issues
      - Update configuration file name from template to final version
      - Clarify location and automatic generation of the configuration file
      - Update READMEs across different platforms to reflect these changes
      044e84e4
  7. 08 Oct, 2024 1 commit
  8. 02 Sep, 2024 2 commits
  9. 02 Aug, 2024 1 commit
    • Kaiwen Liu's avatar
      feat(model inference): add table recognition and conversion to LaTeX (#284) · 37925f36
      Kaiwen Liu authored
      * # add table recognition using struct-eqtable
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into html.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      
      * # add table recognition using struct-eqtable
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into LaTex.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      
      * # feat(model inference): add table recognition and convertion to LaTeX
      
      # What's Changed
      
      ### New Features
      
      - Add table content recognition, we use weights of [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) to convert table image to LaTex.
      
      ### Instruction
      
      - pip install pypandoc struct-eqtable==0.1.0
      - Download [StructEqTable weights](https://huggingface.co/wanderkid/PDF-Extract-Kit/tree/main/models/TabRec
      
      ) and put it under models/ directory.
      - Edit 'table-mode' value to turn on table recognition function which is turned off by default.
      - If you did not download any models before, refer to [how to download models](docs/how_to_download_models_zh_cn.md)。
      
      * add table recognition and convertion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      ---------
      Co-authored-by: default avatarliukaiwen <liukaiwen@pjlab.org.cn>
      37925f36
  10. 01 Aug, 2024 2 commits
  11. 31 Jul, 2024 1 commit
    • liukaiwen's avatar
      # add table recognition using struct-eqtable · b29badc1
      liukaiwen authored
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into html.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      b29badc1
  12. 12 Jul, 2024 1 commit
    • myhloli's avatar
      feat(config-reader): add models-dir and device-mode configurations · 695b3579
      myhloli authored
      Add new configuration options for custom model directories and device modeselection. This allows users to specify the directory where models are stored
      and choose between CPU and GPU modes for model inference. The configurations
      are read from a JSON file and can be easily extended to support additional
      options in the future.
      695b3579
  13. 07 Apr, 2024 1 commit