1. 03 Sep, 2024 1 commit
  2. 02 Sep, 2024 1 commit
  3. 20 Aug, 2024 1 commit
  4. 09 Aug, 2024 2 commits
  5. 07 Aug, 2024 1 commit
  6. 06 Aug, 2024 1 commit
  7. 05 Aug, 2024 1 commit
  8. 04 Aug, 2024 1 commit
    • myhloli's avatar
      fix(pdf-extract): ensure table recognition config defaults to disabled · 52156eae
      myhloli authored
      If 'table-config' is not present in the configuration file, the table recognition
      feature will default to being disabled to ensure consistent behavior. This change
      adds a warning log and sets a default configuration for table recognition when the
      expected config is missing.
      52156eae
  9. 02 Aug, 2024 1 commit
    • Kaiwen Liu's avatar
      feat(model inference): add table recognition and conversion to LaTeX (#284) · 37925f36
      Kaiwen Liu authored
      * # add table recognition using struct-eqtable
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into html.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      
      * # add table recognition using struct-eqtable
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into LaTex.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      
      * # feat(model inference): add table recognition and convertion to LaTeX
      
      # What's Changed
      
      ### New Features
      
      - Add table content recognition, we use weights of [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) to convert table image to LaTex.
      
      ### Instruction
      
      - pip install pypandoc struct-eqtable==0.1.0
      - Download [StructEqTable weights](https://huggingface.co/wanderkid/PDF-Extract-Kit/tree/main/models/TabRec
      
      ) and put it under models/ directory.
      - Edit 'table-mode' value to turn on table recognition function which is turned off by default.
      - If you did not download any models before, refer to [how to download models](docs/how_to_download_models_zh_cn.md)。
      
      * add table recognition and convertion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      ---------
      Co-authored-by: default avatarliukaiwen <liukaiwen@pjlab.org.cn>
      37925f36
  10. 01 Aug, 2024 2 commits
  11. 31 Jul, 2024 2 commits
  12. 24 Jul, 2024 3 commits
  13. 23 Jul, 2024 2 commits
  14. 19 Jul, 2024 1 commit
  15. 13 Jul, 2024 1 commit
  16. 12 Jul, 2024 2 commits
  17. 08 Jul, 2024 1 commit
  18. 07 Jul, 2024 1 commit
  19. 28 Jun, 2024 3 commits
  20. 26 Jun, 2024 1 commit
  21. 25 Jun, 2024 1 commit
  22. 20 Jun, 2024 4 commits
  23. 19 Jun, 2024 3 commits
  24. 18 Jun, 2024 1 commit
  25. 17 Jun, 2024 2 commits