1. 24 Oct, 2024 1 commit
  2. 23 Oct, 2024 1 commit
    • myhloli's avatar
      feat(model): add support for DocLayout-YOLO model · 1279f2cd
      myhloli authored
      - Add new layout model option: DocLayout-YOLO
      - Implement model initialization and prediction for DocLayout-YOLO
      - Update configuration options to include new model- Modify existing code to support both LayoutLMv3 and DocLayout-YOLO models
      - Update Gradio app to support more Custom Switch
      1279f2cd
  3. 08 Oct, 2024 1 commit
  4. 05 Aug, 2024 1 commit
  5. 04 Aug, 2024 1 commit
    • myhloli's avatar
      fix(pdf-extract): ensure table recognition config defaults to disabled · 52156eae
      myhloli authored
      If 'table-config' is not present in the configuration file, the table recognition
      feature will default to being disabled to ensure consistent behavior. This change
      adds a warning log and sets a default configuration for table recognition when the
      expected config is missing.
      52156eae
  6. 02 Aug, 2024 1 commit
    • Kaiwen Liu's avatar
      feat(model inference): add table recognition and conversion to LaTeX (#284) · 37925f36
      Kaiwen Liu authored
      * # add table recognition using struct-eqtable
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into html.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      
      * # add table recognition using struct-eqtable
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into LaTex.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      
      * # feat(model inference): add table recognition and convertion to LaTeX
      
      # What's Changed
      
      ### New Features
      
      - Add table content recognition, we use weights of [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) to convert table image to LaTex.
      
      ### Instruction
      
      - pip install pypandoc struct-eqtable==0.1.0
      - Download [StructEqTable weights](https://huggingface.co/wanderkid/PDF-Extract-Kit/tree/main/models/TabRec
      
      ) and put it under models/ directory.
      - Edit 'table-mode' value to turn on table recognition function which is turned off by default.
      - If you did not download any models before, refer to [how to download models](docs/how_to_download_models_zh_cn.md)。
      
      * add table recognition and convertion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      ---------
      Co-authored-by: default avatarliukaiwen <liukaiwen@pjlab.org.cn>
      37925f36
  7. 01 Aug, 2024 2 commits
  8. 31 Jul, 2024 1 commit
    • liukaiwen's avatar
      # add table recognition using struct-eqtable · b29badc1
      liukaiwen authored
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into html.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      b29badc1
  9. 24 Jul, 2024 3 commits
  10. 12 Jul, 2024 1 commit
    • myhloli's avatar
      feat(config-reader): add models-dir and device-mode configurations · 695b3579
      myhloli authored
      Add new configuration options for custom model directories and device modeselection. This allows users to specify the directory where models are stored
      and choose between CPU and GPU modes for model inference. The configurations
      are read from a JSON file and can be easily extended to support additional
      options in the future.
      695b3579
  11. 15 Apr, 2024 1 commit
  12. 11 Apr, 2024 1 commit
  13. 09 Apr, 2024 2 commits
  14. 08 Apr, 2024 2 commits
  15. 07 Apr, 2024 1 commit