Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
4e2a5622
Commit
4e2a5622
authored
Jun 15, 2025
by
myhloli
Browse files
fix: simplify formula enable handling by removing redundant function call
parent
1383787b
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
1 deletion
+2
-1
mineru/backend/pipeline/model_json_to_middle_json.py
mineru/backend/pipeline/model_json_to_middle_json.py
+2
-1
No files found.
mineru/backend/pipeline/model_json_to_middle_json.py
View file @
4e2a5622
...
@@ -78,7 +78,7 @@ def page_model_info_to_page_info(page_model_info, image_dict, page, image_writer
...
@@ -78,7 +78,7 @@ def page_model_info_to_page_info(page_model_info, image_dict, page, image_writer
"""将所有区块的bbox整理到一起"""
"""将所有区块的bbox整理到一起"""
if
get_formula_enable
(
formula_enabled
)
:
if
formula_enabled
:
interline_equation_blocks
=
[]
interline_equation_blocks
=
[]
if
len
(
interline_equation_blocks
)
>
0
:
if
len
(
interline_equation_blocks
)
>
0
:
...
@@ -165,6 +165,7 @@ def page_model_info_to_page_info(page_model_info, image_dict, page, image_writer
...
@@ -165,6 +165,7 @@ def page_model_info_to_page_info(page_model_info, image_dict, page, image_writer
def
result_to_middle_json
(
model_list
,
images_list
,
pdf_doc
,
image_writer
,
lang
=
None
,
ocr_enable
=
False
,
formula_enabled
=
True
):
def
result_to_middle_json
(
model_list
,
images_list
,
pdf_doc
,
image_writer
,
lang
=
None
,
ocr_enable
=
False
,
formula_enabled
=
True
):
middle_json
=
{
"pdf_info"
:
[],
"_backend"
:
"pipeline"
,
"_version_name"
:
__version__
}
middle_json
=
{
"pdf_info"
:
[],
"_backend"
:
"pipeline"
,
"_version_name"
:
__version__
}
formula_enabled
=
get_formula_enable
(
formula_enabled
)
for
page_index
,
page_model_info
in
tqdm
(
enumerate
(
model_list
),
total
=
len
(
model_list
),
desc
=
"Processing pages"
):
for
page_index
,
page_model_info
in
tqdm
(
enumerate
(
model_list
),
total
=
len
(
model_list
),
desc
=
"Processing pages"
):
page
=
pdf_doc
[
page_index
]
page
=
pdf_doc
[
page_index
]
image_dict
=
images_list
[
page_index
]
image_dict
=
images_list
[
page_index
]
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment