index.md 2.18 KB
Newer Older
sfk's avatar
sfk committed
1
2
# Frequently Asked Questions

3
4
5
6
If your question is not listed, you can also use [DeepWiki](https://deepwiki.com/opendatalab/MinerU) to communicate with the AI assistant, which can solve most common problems.

If you still cannot resolve the issue, you can join the community through [Discord](https://discord.gg/Tdedn9GTXq) or [WeChat](http://mineru.space/s/V85Yl) to communicate with other users and developers.

Sidney233's avatar
Sidney233 committed
7
## 1. Encountered the error `ImportError: libGL.so.1: cannot open shared object file: No such file or directory` in Ubuntu 22.04 on WSL2
8
9
10
11
12
13
14
15

The `libgl` library is missing in Ubuntu 22.04 on WSL2. You can install the `libgl` library with the following command to resolve the issue:

```bash
sudo apt-get install libgl1-mesa-glx
```

Reference: https://github.com/opendatalab/MinerU/issues/388
16

drunkpig's avatar
drunkpig committed
17

Sidney233's avatar
Sidney233 committed
18
## 2. Error when installing MinerU on CentOS 7 or Ubuntu 18: `ERROR: Failed building wheel for simsimd`
Xiaomeng Zhao's avatar
Xiaomeng Zhao committed
19
20
21

The new version of albumentations (1.4.21) introduces a dependency on simsimd. Since the pre-built package of simsimd for Linux requires a glibc version greater than or equal to 2.28, this causes installation issues on some Linux distributions released before 2019. You can resolve this issue by using the following command:
```
22
23
24
conda create -n mineru python=3.11 -y
conda activate mineru
pip install -U "mineru[pipeline_old_linux]"
Xiaomeng Zhao's avatar
Xiaomeng Zhao committed
25
26
27
```

Reference: https://github.com/opendatalab/MinerU/issues/1004
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42


## 3. Missing text information in parsing results when installing and using on Linux systems.

MinerU uses `pypdfium2` instead of `pymupdf` as the PDF page rendering engine in versions >=2.0 to resolve AGPLv3 license issues. On some Linux distributions, due to missing CJK fonts, some text may be lost during the process of rendering PDFs to images.
To solve this problem, you can install the noto font package with the following commands, which are effective on Ubuntu/Debian systems:
```bash
sudo apt update
sudo apt install fonts-noto-core
sudo apt install fonts-noto-cjk
fc-cache -fv
```
You can also directly use our [Docker deployment](../quick_start/docker_deployment.md) method to build the image, which includes the above font packages by default.

Reference: https://github.com/opendatalab/MinerU/issues/2915