1. 08 Apr, 2025 1 commit
  2. 07 Apr, 2025 1 commit
  3. 02 Apr, 2025 2 commits
    • myhloli's avatar
      refactor(demo): simplify batch_demo.py and update demo.py · b0e220c5
      myhloli authored
      - Remove unnecessary imports and code in batch_demo.py
      - Update demo.py to use relative paths and improve code structure
      - Adjust output directory structure in both scripts
      - Remove redundant code and simplify functions
      b0e220c5
    • myhloli's avatar
      docs(gpu): update CUDA acceleration documentation · a778645b
      myhloli authored
      - Update CUDA version requirements to12.4
      - Recommend nvidia-driver-570-server for Ubuntu
      - Remove Python version specification for conda environment
      - Update magic-pdf version requirement to 1.3.0
      - Simplify CUDA acceleration testing instructions
      - Remove OCR acceleration with paddlepaddle-gpu
      - Update torch and torchvision installation instructions for Windows
      a778645b
  4. 18 Feb, 2025 1 commit
  5. 11 Dec, 2024 1 commit
    • myhloli's avatar
      build(deps): update torch and torchvision version requirements · 9a96362d
      myhloli authored
      - Specify torch==2.3.1 and torchvision==0.18.1 for Windows CUDA installation
      - Add torch and torchvision version constraints in setup.py:
        - torch>=2.2.2,<=2.3.1
        - torchvision>=0.17.2,<=0.18.1
      - Update installation instructions in both English and Chinese README files
      9a96362d
  6. 09 Dec, 2024 1 commit
    • myhloli's avatar
      docs(windows): update CUDA installation guide · ede7d361
      myhloli authored
      - Remove specific version requirements for torch and torchvision
      - Simplify installation command in both English and Chinese guides
      - Delete important note about version compatibility
      ede7d361
  7. 08 Nov, 2024 2 commits
  8. 06 Nov, 2024 1 commit
    • myhloli's avatar
      docs: enhance documentation with important notices and tips · 928d123c
      myhloli authored
      - Add important notice about git lfs download issues in model download docs
      - Include warning about 0.9.x version changes in model update section
      - Add tip for finding user directory in config file location
      - Improve readability of TODO list in README files
      - Standardize important notices and tips across multiple language versions
      928d123c
  9. 02 Nov, 2024 1 commit
  10. 29 Oct, 2024 1 commit
    • myhloli's avatar
      (docs&build): switch to Aliyun PyPI mirror · 4c412b28
      myhloli authored
      - Update PyPI mirror from Tsinghua to Aliyun in multiple Dockerfiles and installation scripts
      - This change may improve package download speed and reliability for users in China
      4c412b28
  11. 28 Oct, 2024 1 commit
  12. 27 Oct, 2024 1 commit
  13. 14 Oct, 2024 1 commit
  14. 09 Oct, 2024 1 commit
    • myhloli's avatar
      refactor(docs): update model download instructions and configuration process · 044e84e4
      myhloli authored
      - Simplify model download instructions, include initial download and updates
      - Recommend using Hugging Face or ModelScope for downloading models
      - Remove detailed steps for Git LFS download due to reliability issues
      - Update configuration file name from template to final version
      - Clarify location and automatic generation of the configuration file
      - Update READMEs across different platforms to reflect these changes
      044e84e4
  15. 08 Oct, 2024 1 commit
    • myhloli's avatar
      docs: update CUDA acceleration guides and README content · 2fb3869e
      myhloli authored
      - Update GPU hardware support information in README.md and README_zh-CN.md
      - Enhance CUDA acceleration guides for Ubuntu and Windows
      - Modify README_zh-CN.md to reflect changes in GPU requirements and configurations
      - Update TODO list to mark semantic reading order as completed
      2fb3869e
  16. 02 Sep, 2024 2 commits
  17. 20 Aug, 2024 1 commit
    • Xiaomeng Zhao's avatar
      fix(ocr_mkcontent): revise table caption output (#397) · dd19f59e
      Xiaomeng Zhao authored
      
      
      * fix(ocr_mkcontent): revise table caption output
      
      - Ensuring that
        table captions are properly included in the output.
      - Remove the redundant `table_caption` variable。
      
      * Update cla.yml
      
      * Update bug_report.yml
      
      * feat(cli): add debug option for detailed error handling
      
      Enable users to invoke the CLI command with a new debug flag to get detailed debugging information.
      
      * fix(pdf-extract-kit): adjust crop_paste parameters for better accuracyThe crop_paste_x and crop_paste_y values in the pdf_extract_kit.py have been modified
      to improve the accuracy and consistency of OCR processing. The new values are set to 25
      to ensure more precise image cropping and pasting which leads to better OCR recognition
      results.
      
      * Update README_zh-CN.md (#404)
      
      correct FAQ url
      
      * Update README_zh-CN.md (#404) (#409) (#410)
      
      correct FAQ url
      Co-authored-by: default avatarsfk <18810651050@163.com>
      
      * Update FAQ_zh_cn.md
      
      add new issue
      
      * Update FAQ_en_us.md
      
      * Update README_Windows_CUDA_Acceleration_zh_CN.md
      
      * Update README_zh-CN.md
      
      * @Thepathakarpit has signed the CLA in opendatalab/MinerU#418
      
      * fix(pdf-extract-kit): increase crop_paste margin for OCR processingDouble the crop_paste margin from25 to 50 to ensure better OCR accuracy and
      handling of border cases. This change will help in improving the overall quality of
      OCR'ed text by providing more context around the detected text areas.
      
      * fix(common): deep copy model list before drawing model bbox
      
      Use a deep copy of the original model list in `drow_model_bbox` to avoid potential
      modifications to the source data. This ensures the integrity of the original models
      is maintained while generating the model bounding boxes visualization.
      
      ---------
      Co-authored-by: default avatarsfk <18810651050@163.com>
      Co-authored-by: default avatardrunkpig <60862764+drunkpig@users.noreply.github.com>
      Co-authored-by: default avatargithub-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
      dd19f59e
  18. 13 Aug, 2024 1 commit
  19. 09 Aug, 2024 1 commit
  20. 08 Aug, 2024 1 commit
  21. 06 Aug, 2024 1 commit
    • myhloli's avatar
      docs: update URLs to gitee for Windows CUDA acceleration guides · d3e42e08
      myhloli authored
      Update the URLs for downloading the `magic-pdf.template.json` and `small_ocr.pdf`
      files in the Windows CUDA acceleration guides. The links now point to the giteerepository instead of GitHub, ensuring users have access to the necessary files
      from the correct source.
      d3e42e08
  22. 05 Aug, 2024 1 commit
    • myhloli's avatar
      mirror(conda): use tuna mirror for Anaconda download · 29e48c73
      myhloli authored
      Update the download links for Anaconda in both Ubuntu and Windows CUDA
      Acceleration documents to use the Tuna mirror. This change helps ensure that
      users in China have faster access to the Anaconda distribution.
      29e48c73
  23. 02 Aug, 2024 4 commits
    • myhloli's avatar
      docs: specify absolute path for model weights configuration · 9778a461
      myhloli authored
      Update the README documents to clarify that the "models-dir" in the
      configuration should be an absolute path. Also, provide additional guidance
      for Windows users on how to correctly format the path to avoid common issues
      with path escaping in JSON files.
      9778a461
    • myhloli's avatar
      docs: add wget command for Ubuntu and powershell script for Windows · 44a2dc37
      myhloli authored
      Add instructions to download the magic-pdf.template.json file using wget on
      Ubuntu and a PowerShell script on Windows in the respective README files.
      This is to facilitate the setup process by providing direct download options,
      replacing manual file transfers.
      44a2dc37
    • myhloli's avatar
      fix(docs): pin Magic-PDF version to 0.6.2b1 in install commands · a0c62b26
      myhloli authored
      Update the install commands in both Ubuntu and Windows CUDA Acceleration
      guides to specify Magic-PDF version 0.6.2b1, ensuring consistency andavoiding potential version mismatches.
      a0c62b26
    • myhloli's avatar
      docs: update model download instructions and CUDA acceleration setup · 3ef4d054
      myhloli authored
      Update the documentation to reflect the latest model download procedures, emphasis on
      model file integrity checks, and expanded instructions for setting up CUDA accelerationon Ubuntu and Windows environments. The README files for various OS have been
      enhanced with additional details to assist users in configuring and verifying their
      environments for optimal performance.
      3ef4d054
  24. 30 Jul, 2024 1 commit
  25. 01 Mar, 2024 3 commits
  26. 29 Feb, 2024 1 commit