1. 05 Aug, 2024 1 commit
  2. 04 Aug, 2024 1 commit
    • myhloli's avatar
      fix(ocr_mkcontent): add spaces around inline equation in content · 0998d22a
      myhloli authored
      Ensure proper formatting of inline equations by adding spaces outside the equation delimitersto prevent markdown from interpreting the equation content as part of a link. This addresses
      the issue where inline OCR equations appear without the correct markdown formatting.
      0998d22a
  3. 02 Aug, 2024 1 commit
    • Kaiwen Liu's avatar
      feat(model inference): add table recognition and conversion to LaTeX (#284) · 37925f36
      Kaiwen Liu authored
      * # add table recognition using struct-eqtable
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into html.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      
      * # add table recognition using struct-eqtable
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into LaTex.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      
      * # feat(model inference): add table recognition and convertion to LaTeX
      
      # What's Changed
      
      ### New Features
      
      - Add table content recognition, we use weights of [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) to convert table image to LaTex.
      
      ### Instruction
      
      - pip install pypandoc struct-eqtable==0.1.0
      - Download [StructEqTable weights](https://huggingface.co/wanderkid/PDF-Extract-Kit/tree/main/models/TabRec
      
      ) and put it under models/ directory.
      - Edit 'table-mode' value to turn on table recognition function which is turned off by default.
      - If you did not download any models before, refer to [how to download models](docs/how_to_download_models_zh_cn.md)。
      
      * add table recognition and convertion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      * add table recognition and conversion to LaTeX
      
      ---------
      Co-authored-by: default avatarliukaiwen <liukaiwen@pjlab.org.cn>
      37925f36
  4. 01 Aug, 2024 3 commits
  5. 31 Jul, 2024 2 commits
    • liukaiwen's avatar
      # add table recognition using struct-eqtable · d6c58ecc
      liukaiwen authored
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into LaTex.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      d6c58ecc
    • liukaiwen's avatar
      # add table recognition using struct-eqtable · b29badc1
      liukaiwen authored
      ## Changelog
      31/07/20204
      - Support table recognition. Table images will be converted into html.
      
      ### how to use the new feature:
      set the attribute 'table-mode' to 'true' in magic-pdf.json
      
      ### caution:
      it takes 200s to 500s to convert a single table image using cpu
      b29badc1
  6. 30 Jul, 2024 1 commit
  7. 13 Jul, 2024 1 commit
  8. 19 Jun, 2024 1 commit
  9. 30 Apr, 2024 1 commit
  10. 29 Apr, 2024 4 commits
  11. 25 Apr, 2024 2 commits
  12. 23 Apr, 2024 1 commit
  13. 22 Apr, 2024 3 commits
  14. 16 Apr, 2024 1 commit
  15. 15 Apr, 2024 2 commits
  16. 11 Apr, 2024 1 commit
  17. 10 Apr, 2024 1 commit
  18. 08 Apr, 2024 3 commits
  19. 07 Apr, 2024 1 commit
  20. 29 Mar, 2024 2 commits
  21. 26 Mar, 2024 1 commit
  22. 25 Mar, 2024 1 commit
  23. 24 Mar, 2024 2 commits
  24. 22 Mar, 2024 3 commits