index.md 4.76 KB
Newer Older
Sidney233's avatar
Sidney233 committed
1
2
# Using MinerU

3
4
## Quick Model Source Configuration
MinerU uses `huggingface` as the default model source. If users cannot access `huggingface` due to network restrictions, they can conveniently switch the model source to `modelscope` through environment variables:
Sidney233's avatar
Sidney233 committed
5
```bash
6
export MINERU_MODEL_SOURCE=modelscope
Sidney233's avatar
Sidney233 committed
7
```
8
For more information about model source configuration and custom local model paths, please refer to the [Model Source Documentation](./model_source.md) in the documentation.
Sidney233's avatar
Sidney233 committed
9
10
11

---

12
13
## Quick Usage via Command Line
MinerU has built-in command line tools that allow users to quickly use MinerU for PDF parsing through the command line:
Sidney233's avatar
Sidney233 committed
14
```bash
15
# Default parsing using pipeline backend
Sidney233's avatar
Sidney233 committed
16
17
mineru -p <input_path> -o <output_path>
```
18
19
- `<input_path>`: Local PDF/image file or directory
- `<output_path>`: Output directory
Sidney233's avatar
Sidney233 committed
20

21
22
> [!NOTE]
> The command line tool will automatically attempt cuda/mps acceleration on Linux and macOS systems. Windows users who need cuda acceleration should visit the [PyTorch official website](https://pytorch.org/get-started/locally/) to select the appropriate command for their cuda version to install acceleration-enabled `torch` and `torchvision`.
Sidney233's avatar
Sidney233 committed
23

24
25
> [!TIP]
> For more information about output files, please refer to [Output File Documentation](./output_file.md).
Sidney233's avatar
Sidney233 committed
26
27

```bash
28
29
# Or specify vlm backend for parsing
mineru -p <input_path> -o <output_path> -b vlm-transformers
Sidney233's avatar
Sidney233 committed
30
```
31
32
> [!TIP]
> The vlm backend additionally supports `sglang` acceleration. Compared to the `transformers` backend, `sglang` can achieve 20-30x speedup. You can check the installation method for the complete package supporting `sglang` acceleration in the [Extension Modules Installation Guide](../quick_start/extension_modules.md).
Sidney233's avatar
Sidney233 committed
33

34
If you need to adjust parsing options through custom parameters, you can also check the more detailed [Command Line Tools Usage Instructions](./cli_tools.md) in the documentation.
Sidney233's avatar
Sidney233 committed
35
36
37

---

38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
## Advanced Usage via API, WebUI, sglang-client/server

- Direct Python API calls: [Python Usage Example](https://github.com/opendatalab/MinerU/blob/master/demo/demo.py)
- FastAPI calls:
  ```bash
  mineru-api --host 127.0.0.1 --port 8000
  ```
  Access http://127.0.0.1:8000/docs in your browser to view the API documentation.
- Start Gradio WebUI visual frontend:
  ```bash
  # Using pipeline/vlm-transformers/vlm-sglang-client backends
  mineru-gradio --server-name 127.0.0.1 --server-port 7860
  # Or using vlm-sglang-engine/pipeline backends (requires sglang environment)
  mineru-gradio --server-name 127.0.0.1 --server-port 7860 --enable-sglang-engine true
  ```
  Access http://127.0.0.1:7860 in your browser to use Gradio WebUI or access http://127.0.0.1:7860/?view=api to use the Gradio API.
- Using `sglang-client/server` method:
  ```bash
  # Start sglang server (requires sglang environment)
  mineru-sglang-server --port 30000
  # In another terminal, connect to sglang server via sglang client (only requires CPU and network, no sglang environment needed)
  mineru -p <input_path> -o <output_path> -b vlm-sglang-client -u http://127.0.0.1:30000
  ``` 
Sidney233's avatar
Sidney233 committed
61
> [!TIP]
62
63
64
65
> All officially supported sglang parameters can be passed to MinerU through command line arguments, including the following commands: `mineru`, `mineru-sglang-server`, `mineru-gradio`, `mineru-api`.
> We have compiled some commonly used parameters and usage methods for `sglang`, which can be found in the documentation [Advanced Command Line Parameters](./advanced_cli_parameters.md).

## Extending MinerU Functionality with Configuration Files
Sidney233's avatar
Sidney233 committed
66

67
68
69
70
71
72
- MinerU is now ready to use out of the box, but also supports extending functionality through configuration files. You can create a `mineru.json` file in your user directory to add custom configurations.
- The `mineru.json` file will be automatically generated when you use the built-in model download command `mineru-models-download`, or you can create it by copying the [configuration template file](https://github.com/opendatalab/MinerU/blob/master/mineru.template.json) to your user directory and renaming it to `mineru.json`.
- Here are some available configuration options:
  - `latex-delimiter-config`: Used to configure LaTeX formula delimiters, defaults to `$` symbol, can be modified to other symbols or strings as needed.
  - `llm-aided-config`: Used to configure parameters for LLM-assisted title hierarchy, compatible with all LLM models supporting `openai protocol`, defaults to using Alibaba Cloud Bailian's `qwen2.5-32b-instruct` model. You need to configure your own API key and set `enable` to `true` to enable this feature.
  - `models-dir`: Used to specify local model storage directory, please specify model directories for `pipeline` and `vlm` backends separately. After specifying the directory, you can use local models by configuring the environment variable `export MINERU_MODEL_SOURCE=local`.