Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
e422bdc0
Commit
e422bdc0
authored
Jun 12, 2025
by
myhloli
Browse files
fix: add checks for empty overlay PDFs and log warnings for invalid pages
parent
4f6d8d7c
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
15 additions
and
2 deletions
+15
-2
mineru/utils/draw_bbox.py
mineru/utils/draw_bbox.py
+15
-2
No files found.
mineru/utils/draw_bbox.py
View file @
e422bdc0
import
json
from
io
import
BytesIO
from
loguru
import
logger
from
pypdf
import
PdfReader
,
PdfWriter
from
reportlab.pdfgen
import
canvas
...
...
@@ -182,7 +183,13 @@ def draw_layout_bbox(pdf_info, pdf_bytes, out_path, filename):
packet
.
seek
(
0
)
overlay_pdf
=
PdfReader
(
packet
)
page
.
merge_page
(
overlay_pdf
.
pages
[
0
])
# 添加检查确保overlay_pdf.pages不为空
if
len
(
overlay_pdf
.
pages
)
>
0
:
page
.
merge_page
(
overlay_pdf
.
pages
[
0
])
else
:
# 记录日志并继续处理下一个页面
logger
.
warning
(
f
"layout.pdf: 第
{
i
+
1
}
页未能生成有效的overlay PDF"
)
output_pdf
.
add_page
(
page
)
# 保存结果
...
...
@@ -290,7 +297,13 @@ def draw_span_bbox(pdf_info, pdf_bytes, out_path, filename):
packet
.
seek
(
0
)
overlay_pdf
=
PdfReader
(
packet
)
page
.
merge_page
(
overlay_pdf
.
pages
[
0
])
# 添加检查确保overlay_pdf.pages不为空
if
len
(
overlay_pdf
.
pages
)
>
0
:
page
.
merge_page
(
overlay_pdf
.
pages
[
0
])
else
:
# 记录日志并继续处理下一个页面
logger
.
warning
(
f
"span.pdf: 第
{
i
+
1
}
页未能生成有效的overlay PDF"
)
output_pdf
.
add_page
(
page
)
# Save the PDF
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment