- 15 Nov, 2024 3 commits
- 11 Nov, 2024 6 commits
-
-
xu rui authored
-
xu rui authored
-
xu rui authored
-
xu rui authored
-
myhloli authored
- Update table recognition model to "rapid_table" in README.md and README_zh-CN.md - Update documentation warning in README_ja-JP.md - Improve clarity of table recognition configuration options
-
myhloli authored
- Add DocLayout-YOLO repository link - Add RapidTable repository link
-
- 08 Nov, 2024 2 commits
- 06 Nov, 2024 5 commits
-
-
myhloli authored
- Change heading from "!IMPORTANT" to "!WARNING" for pre-installation notice - This update improves clarity and consistency in the documentation
-
myhloli authored
- Add important notice about git lfs download issues in model download docs - Include warning about 0.9.x version changes in model update section - Add tip for finding user directory in config file location - Improve readability of TODO list in README files - Standardize important notices and tips across multiple language versions
-
myhloli authored
- Update version number from 0.9.1 to 0.9.2 in both English and Chinese README files - Enhance documentation formatting: - Add important notices and tips using Markdown admonitions - Improve readability with better headings and lists - Update screenshots and code examples
-
myhloli authored
docs(README): update changelog for v0.9.1 release- Add entry for 0.9.1 release on 2024/11/06- Update changelog in both English (README.md) and Chinese (README_zh-CN.md) - Include integration of StructTable-InternVL2-1B model for table recognition
-
myhloli authored
- Replace the incorrect arXiv paper link with the correct one in both README.md and README_zh-CN.md - Update the badge image link from 'pdf' to 'abs' for the correct paper URL
-
- 05 Nov, 2024 1 commit
-
-
myhloli authored
- Update the Colab demo link in the README files to the new version - Add a note in the Japanese README indicating that the document is outdated
-
- 28 Oct, 2024 9 commits
-
-
myhloli authored
- Update README.md and README_zh-CN.md to include new model download instructions - Provide detailed steps on how to download models after PDF-Extract-Kit 1.0 repository change - Emphasize the need to re-download models due to repository change
-
myhloli authored
- Update README.md and README_zh-CN.md to include new model download instructions - Provide detailed steps on how to download models after PDF-Extract-Kit 1.0 repository change - Emphasize the need to re-download models due to repository change
-
myhloli authored
- Remove import and usage of StructTableModel- Add support for TableMaster model- Update table model initialization logic to support TableMaster - Log error and exit if StructEqTable is selected, as it's under upgrade - Update README files to reflect changes in table parsing capabilities
-
Xiaomeng Zhao authored
-
Xiaomeng Zhao authored
-
myhloli authored
- Update image path in README.md and README_zh-CN.md - Update chemical formula recognition link in README.md and README_zh-CN.md
-
myhloli authored
- Change the logo path from 'docs/images/MinerU-logo.png' to 'old_docs/images/MinerU-logo.png' in both README.md and README_zh-CN.md- This update ensures that the correct logo is displayed in the project's README files
-
myhloli authored
- Delete unnecessary empty line in the table-config JSON example- Improve readability and formatting consistency in the configuration example
-
myhloli authored
- Add changelog for v0.9.0 release with major refactoring and improvements - Update key features list to include new functionalities - Modify system requirements and hardware support information - Add section for deploying derived projects - Update known issues and TODO list
-
- 16 Oct, 2024 1 commit
-
-
myhloli authored
- Improve reading order with model-based sorting- Add list recognition within text - Implement table of contents recognition - Support table recognition - Enhance code block and geometric shape recognition - Address known issues in both English and Chinese READMEs
-
- 08 Oct, 2024 2 commits
-
-
myhloli authored
- Update GPU hardware support information in README.md and README_zh-CN.md - Enhance CUDA acceleration guides for Ubuntu and Windows - Modify README_zh-CN.md to reflect changes in GPU requirements and configurations - Update TODO list to mark semantic reading order as completed
-
myhloli authored
Added a link to the layoutreader repository in the Related Projects sections of both the README.md and README_zh-CN.md files. This addition helps to provide users with more resources and tools related to document layout analysis and processing.
-
- 30 Sep, 2024 3 commits
- 27 Sep, 2024 4 commits
-
-
sfk authored
update backlog
-
sfk authored
update backlog
-
Xiaomeng Zhao authored
-
sfk authored
update backlog
-
- 26 Sep, 2024 1 commit
-
-
unknown authored
-
- 12 Sep, 2024 1 commit
-
-
myhloli authored
-
- 10 Sep, 2024 2 commits
-
-
Xiaomeng Zhao authored
-
drunkpig authored
* release: release 0.7.1 version (#526) * Update README_zh-CN.md (#404) (#409) correct FAQ url Co-authored-by:
sfk <18810651050@163.com> * add dockerfile (#189) Co-authored-by:
drunkpig <60862764+drunkpig@users.noreply.github.com> * Update cla.yml * Update cla.yml * feat<table model>: add tablemaster with paddleocr to detect and recognize table (#493) * Update cla.yml * Update bug_report.yml * Update README_zh-CN.md (#404) correct FAQ url * Update README_zh-CN.md (#404) (#409) (#410) correct FAQ url Co-authored-by:
sfk <18810651050@163.com> * Update FAQ_zh_cn.md add new issue * Update FAQ_en_us.md * Update README_Windows_CUDA_Acceleration_zh_CN.md * Update README_zh-CN.md * @Thepathakarpit has signed the CLA in opendatalab/MinerU#418 * Update cla.yml * feat: add tablemaster_paddle (#463) * Update README_zh-CN.md (#404) (#409) correct FAQ url Co-authored-by:
sfk <18810651050@163.com> * add dockerfile (#189) Co-authored-by:
drunkpig <60862764+drunkpig@users.noreply.github.com> * Update cla.yml * Update cla.yml --------- Co-authored-by:
drunkpig <60862764+drunkpig@users.noreply.github.com> Co-authored-by:
sfk <18810651050@163.com> Co-authored-by:
Aoyang Fang <222010547@link.cuhk.edu.cn> Co-authored-by:
Xiaomeng Zhao <moe@myhloli.com> * <fix>(para_split_v2): index out of range issue of span_text first char (#396) Co-authored-by:
liukaiwen <liukaiwen@pjlab.org.cn> * @Matthijz98 has signed the CLA in opendatalab/MinerU#467 * Create download_models.py * Create requirements-docker.txt * feat<table model>: add tablemaster with paddleocr to detect and recognize table * @strongerfly has signed the CLA in opendatalab/MinerU#487 * feat<table model>: add tablemaster with paddleocr to detect and recognize table * feat<table model>: add tablemaster with paddleocr to detect and recognize table * feat<table model>: add tablemaster with paddleocr to detect and recognize table * feat<table model>: add tablemaster with paddleocr to detect and recognize table --------- Co-authored-by:
Xiaomeng Zhao <moe@myhloli.com> Co-authored-by:
sfk <18810651050@163.com> Co-authored-by:
drunkpig <60862764+drunkpig@users.noreply.github.com> Co-authored-by:
github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by:
Aoyang Fang <222010547@link.cuhk.edu.cn> Co-authored-by:
liukaiwen <liukaiwen@pjlab.org.cn> * feat<table model>: add tablemaster with paddleocr to detect and recognize table (#508) * Update cla.yml * Update bug_report.yml * Update README_zh-CN.md (#404) correct FAQ url * Update README_zh-CN.md (#404) (#409) (#410) correct FAQ url Co-authored-by:
sfk <18810651050@163.com> * Update FAQ_zh_cn.md add new issue * Update FAQ_en_us.md * Update README_Windows_CUDA_Acceleration_zh_CN.md * Update README_zh-CN.md * @Thepathakarpit has signed the CLA in opendatalab/MinerU#418 * Update cla.yml * feat: add tablemaster_paddle (#463) * Update README_zh-CN.md (#404) (#409) correct FAQ url Co-authored-by:
sfk <18810651050@163.com> * add dockerfile (#189) Co-authored-by:
drunkpig <60862764+drunkpig@users.noreply.github.com> * Update cla.yml * Update cla.yml --------- Co-authored-by:
drunkpig <60862764+drunkpig@users.noreply.github.com> Co-authored-by:
sfk <18810651050@163.com> Co-authored-by:
Aoyang Fang <222010547@link.cuhk.edu.cn> Co-authored-by:
Xiaomeng Zhao <moe@myhloli.com> * <fix>(para_split_v2): index out of range issue of span_text first char (#396) Co-authored-by:
liukaiwen <liukaiwen@pjlab.org.cn> * @Matthijz98 has signed the CLA in opendatalab/MinerU#467 * Create download_models.py * Create requirements-docker.txt * feat<table model>: add tablemaster with paddleocr to detect and recognize table * @strongerfly has signed the CLA in opendatalab/MinerU#487 * feat<table model>: add tablemaster with paddleocr to detect and recognize table * feat<table model>: add tablemaster with paddleocr to detect and recognize table * feat<table model>: add tablemaster with paddleocr to detect and recognize table * feat<table model>: add tablemaster with paddleocr to detect and recognize table * Update cla.yml * Delete .github/workflows/gpu-ci.yml * Update Huggingface and ModelScope links to organization account * feat<table model>: add tablemaster with paddleocr to detect and recognize table --------- Co-authored-by:
Xiaomeng Zhao <moe@myhloli.com> Co-authored-by:
sfk <18810651050@163.com> Co-authored-by:
drunkpig <60862764+drunkpig@users.noreply.github.com> Co-authored-by:
github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by:
Aoyang Fang <222010547@link.cuhk.edu.cn> Co-authored-by:
liukaiwen <liukaiwen@pjlab.org.cn> Co-authored-by:
yyy <102640628+dt-yy@users.noreply.github.com> Co-authored-by:
wangbinDL <wangbin_research@163.com> * feat<table model>: add tablemaster with paddleocr to detect and recognize table (#511) * Update cla.yml * Update bug_report.yml * Update README_zh-CN.md (#404) correct FAQ url * Update README_zh-CN.md (#404) (#409) (#410) correct FAQ url Co-authored-by:
sfk <18810651050@163.com> * Update FAQ_zh_cn.md add new issue * Update FAQ_en_us.md * Update README_Windows_CUDA_Acceleration_zh_CN.md * Update README_zh-CN.md * @Thepathakarpit has signed the CLA in opendatalab/MinerU#418 * Update cla.yml * feat: add tablemaster_paddle (#463) * Update README_zh-CN.md (#404) (#409) correct FAQ url Co-authored-by:
sfk <18810651050@163.com> * add dockerfile (#189) Co-authored-by:
drunkpig <60862764+drunkpig@users.noreply.github.com> * Update cla.yml * Update cla.yml --------- Co-authored-by:
drunkpig <60862764+drunkpig@users.noreply.github.com> Co-authored-by:
sfk <18810651050@163.com> Co-authored-by:
Aoyang Fang <222010547@link.cuhk.edu.cn> Co-authored-by:
Xiaomeng Zhao <moe@myhloli.com> * <fix>(para_split_v2): index out of range issue of span_text first char (#396) Co-authored-by:
liukaiwen <liukaiwen@pjlab.org.cn> * @Matthijz98 has signed the CLA in opendatalab/MinerU#467 * Create download_models.py * Create requirements-docker.txt * feat<table model>: add tablemaster with paddleocr to detect and recognize table * @strongerfly has signed the CLA in opendatalab/MinerU#487 * feat<table model>: add tablemaster with paddleocr to detect and recognize table * feat<table model>: add tablemaster with paddleocr to detect and recognize table * feat<table model>: add tablemaster with paddleocr to detect and recognize table * feat<table model>: add tablemaster with paddleocr to detect and recognize table * Update cla.yml * Delete .github/workflows/gpu-ci.yml * Update Huggingface and ModelScope links to organization account * feat<table model>: add tablemaster with paddleocr to detect and recognize table * feat<table model>: add tablemaster with paddleocr to detect and recognize table * feat<table model>: add tablemaster with paddleocr to detect and recognize table --------- Co-authored-by:
Xiaomeng Zhao <moe@myhloli.com> Co-authored-by:
sfk <18810651050@163.com> Co-authored-by:
drunkpig <60862764+drunkpig@users.noreply.github.com> Co-authored-by:
github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by:
Aoyang Fang <222010547@link.cuhk.edu.cn> Co-authored-by:
liukaiwen <liukaiwen@pjlab.org.cn> Co-authored-by:
yyy <102640628+dt-yy@users.noreply.github.com> Co-authored-by:
wangbinDL <wangbin_research@163.com> --------- Co-authored-by:
drunkpig <60862764+drunkpig@users.noreply.github.com> Co-authored-by:
sfk <18810651050@163.com> Co-authored-by:
Aoyang Fang <222010547@link.cuhk.edu.cn> Co-authored-by:
Xiaomeng Zhao <moe@myhloli.com> Co-authored-by:
Kaiwen Liu <lkw_buaa@163.com> Co-authored-by:
github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by:
liukaiwen <liukaiwen@pjlab.org.cn> Co-authored-by:
wangbinDL <wangbin_research@163.com> * Hotfix readme 0.7.1 (#528) * Update README.md * Update README_zh-CN.md * Update README_zh-CN.md * Update README.md * Update README_zh-CN.md * Update README_zh-CN.md add HF、modelscope、colab url * Update README.md * Update README.md * Update README.md * Update README.md * Update README_zh-CN.md * Rename README.md to README_zh-CN.md * Create readme.md * Rename readme.md to README.md * Rename README.md to README_zh-CN.md * Update README_zh-CN.md * Create README.md * Update README.md * Update README.md * Update README.md * Update README_zh-CN.md * Create download_models_hf.py * Update README.md * Update README_zh-CN.md * Update README_zh-CN.md * Update README.md * Update README_zh-CN.md * Update FAQ_zh_cn.md * Update FAQ_en_us.md * Update FAQ_zh_cn.md * fix: resolve inaccuracy of drawing layout box caused by paragraphs combination #384 (#573) * fix: resolve inaccuracy of drawing layout box caused by paragraphs combination * fix: resolve inaccuracy of drawing layout box caused by paragraphs combination #384 * fix: resolve inaccuracy of drawing layout box caused by paragraphs combination #384 * fix: resolve inaccuracy of drawing layout box caused by paragraphs combination #384 * fix: resolve inaccuracy of drawing layout box caused by paragraphs combination #384 * Update README_zh-CN.md * Update README.md * Update README.md * Update README.md * Update README_zh-CN.md * add rag data api * Update README_zh-CN.md update rag api image * Update README.md docs: remove RAG related release notes * Update README_zh-CN.md docs: remove RAG related release notes * Update README_zh-CN.md update 更新记录 --------- Co-authored-by:
yyy <102640628+dt-yy@users.noreply.github.com> Co-authored-by:
sfk <18810651050@163.com> Co-authored-by:
Aoyang Fang <222010547@link.cuhk.edu.cn> Co-authored-by:
Xiaomeng Zhao <moe@myhloli.com> Co-authored-by:
Kaiwen Liu <lkw_buaa@163.com> Co-authored-by:
github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by:
liukaiwen <liukaiwen@pjlab.org.cn> Co-authored-by:
wangbinDL <wangbin_research@163.com>
-