Unverified Commit 784c61a2 authored by Xiaomeng Zhao's avatar Xiaomeng Zhao Committed by GitHub
Browse files

Merge pull request #906 from icecraft/feat/add_en_docs

Feat/add en docs
parents 5f794533 91f8cbe2
Known Issues
============
- Reading order is determined by the model based on the spatial
distribution of readable content, and may be out of order in some
areas under extremely complex layouts.
- Vertical text is not supported.
- Tables of contents and lists are recognized through rules, and some
uncommon list formats may not be recognized.
- Only one level of headings is supported; hierarchical headings are
not currently supported.
- Code blocks are not yet supported in the layout model.
- Comic books, art albums, primary school textbooks, and exercises
cannot be parsed well.
- Table recognition may result in row/column recognition errors in
complex tables.
- OCR recognition may produce inaccurate characters in PDFs of
lesser-known languages (e.g., diacritical marks in Latin script,
easily confused characters in Arabic script).
- Some formulas may not render correctly in Markdown.
\ No newline at end of file
...@@ -46,20 +46,29 @@ the relevant PDF**. ...@@ -46,20 +46,29 @@ the relevant PDF**.
Key Features Key Features
------------ ------------
- Removes elements such as headers, footers, footnotes, and page - Remove headers, footers, footnotes, page numbers, etc., to ensure
numbers while maintaining semantic continuity semantic coherence.
- Outputs text in a human-readable order from multi-column documents - Output text in human-readable order, suitable for single-column,
- Retains the original structure of the document, including titles, multi-column, and complex layouts.
paragraphs, and lists - Preserve the structure of the original document, including headings,
- Extracts images, image captions, tables, and table captions paragraphs, lists, etc.
- Automatically recognizes formulas in the document and converts them - Extract images, image descriptions, tables, table titles, and
to LaTeX footnotes.
- Automatically recognizes tables in the document and converts them to - Automatically recognize and convert formulas in the document to LaTeX
LaTeX format.
- Automatically detects and enables OCR for corrupted PDFs - Automatically recognize and convert tables in the document to LaTeX
- Supports both CPU and GPU environments or HTML format.
- Supports Windows, Linux, and Mac platforms - Automatically detect scanned PDFs and garbled PDFs and enable OCR
functionality.
- OCR supports detection and recognition of 84 languages.
- Supports multiple output formats, such as multimodal and NLP
Markdown, JSON sorted by reading order, and rich intermediate
formats.
- Supports various visualization results, including layout
visualization and span visualization, for efficient confirmation of
output quality.
- Supports both CPU and GPU environments.
- Compatible with Windows, Linux, and Mac platforms.
User Guide User Guide
------------- -------------
...@@ -91,14 +100,6 @@ Additional Notes ...@@ -91,14 +100,6 @@ Additional Notes
additional_notes/known_issues additional_notes/known_issues
additional_notes/faq additional_notes/faq
additional_notes/changelog
additional_notes/glossary additional_notes/glossary
Projects
---------
.. toctree::
:maxdepth: 1
:caption: Projects
projects
\ No newline at end of file
...@@ -137,7 +137,7 @@ Download a sample file from the repository and test it. ...@@ -137,7 +137,7 @@ Download a sample file from the repository and test it.
.. code:: sh .. code:: sh
wget https://github.com/opendatalab/MinerU/raw/master/demo/small_ocr.pdf wget https://github.com/opendatalab/MinerU/raw/master/demo/small_ocr.pdf
magic-pdf -p small_ocr.pdf magic-pdf -p small_ocr.pdf -o ./output
9. Test CUDA Acceleration 9. Test CUDA Acceleration
~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~
...@@ -145,10 +145,6 @@ Download a sample file from the repository and test it. ...@@ -145,10 +145,6 @@ Download a sample file from the repository and test it.
If your graphics card has at least **8GB** of VRAM, follow these steps If your graphics card has at least **8GB** of VRAM, follow these steps
to test CUDA acceleration: to test CUDA acceleration:
❗ Due to the extremely limited nature of 8GB VRAM for running this
application, you need to close all other programs using VRAM to
ensure that 8GB of VRAM is available when running this application.
1. Modify the value of ``"device-mode"`` in the ``magic-pdf.json`` 1. Modify the value of ``"device-mode"`` in the ``magic-pdf.json``
configuration file located in your home directory. configuration file located in your home directory.
...@@ -162,7 +158,7 @@ to test CUDA acceleration: ...@@ -162,7 +158,7 @@ to test CUDA acceleration:
.. code:: sh .. code:: sh
magic-pdf -p small_ocr.pdf magic-pdf -p small_ocr.pdf -o ./output
10. Enable CUDA Acceleration for OCR 10. Enable CUDA Acceleration for OCR
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
...@@ -178,7 +174,9 @@ to test CUDA acceleration: ...@@ -178,7 +174,9 @@ to test CUDA acceleration:
.. code:: sh .. code:: sh
magic-pdf -p small_ocr.pdf magic-pdf -p small_ocr.pdf -o ./output
.. _windows_10_or_11_section: .. _windows_10_or_11_section:
...@@ -252,7 +250,7 @@ Download a sample file from the repository and test it. ...@@ -252,7 +250,7 @@ Download a sample file from the repository and test it.
.. code:: powershell .. code:: powershell
wget https://github.com/opendatalab/MinerU/raw/master/demo/small_ocr.pdf -O small_ocr.pdf wget https://github.com/opendatalab/MinerU/raw/master/demo/small_ocr.pdf -O small_ocr.pdf
magic-pdf -p small_ocr.pdf magic-pdf -p small_ocr.pdf -o ./output
8. Test CUDA Acceleration 8. Test CUDA Acceleration
~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~
...@@ -260,10 +258,6 @@ Download a sample file from the repository and test it. ...@@ -260,10 +258,6 @@ Download a sample file from the repository and test it.
If your graphics card has at least 8GB of VRAM, follow these steps to If your graphics card has at least 8GB of VRAM, follow these steps to
test CUDA-accelerated parsing performance. test CUDA-accelerated parsing performance.
❗ Due to the extremely limited nature of 8GB VRAM for running this
application, you need to close all other programs using VRAM to
ensure that 8GB of VRAM is available when running this application.
1. **Overwrite the installation of torch and torchvision** supporting 1. **Overwrite the installation of torch and torchvision** supporting
CUDA. CUDA.
...@@ -295,7 +289,7 @@ test CUDA-accelerated parsing performance. ...@@ -295,7 +289,7 @@ test CUDA-accelerated parsing performance.
:: ::
magic-pdf -p small_ocr.pdf magic-pdf -p small_ocr.pdf -o ./output
9. Enable CUDA Acceleration for OCR 9. Enable CUDA Acceleration for OCR
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
...@@ -311,5 +305,4 @@ test CUDA-accelerated parsing performance. ...@@ -311,5 +305,4 @@ test CUDA-accelerated parsing performance.
:: ::
magic-pdf -p small_ocr.pdf magic-pdf -p small_ocr.pdf -o ./output
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment