Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
be99753f
"git@developer.sourcefind.cn:OpenDAS/megatron-lm.git" did not exist on "ba2264abb7fe939c0ad30a4bfb6ac21e9938ae46"
Commit
be99753f
authored
Jun 13, 2025
by
myhloli
Browse files
feat: add progress bar to page processing in result_to_middle_json function
parent
59d8f105
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
1 deletion
+2
-1
mineru/backend/pipeline/model_json_to_middle_json.py
mineru/backend/pipeline/model_json_to_middle_json.py
+2
-1
No files found.
mineru/backend/pipeline/model_json_to_middle_json.py
View file @
be99753f
...
...
@@ -2,6 +2,7 @@
import
time
from
loguru
import
logger
from
tqdm
import
tqdm
from
mineru.utils.config_reader
import
get_device
,
get_llm_aided_config
from
mineru.backend.pipeline.model_init
import
AtomModelSingleton
...
...
@@ -164,7 +165,7 @@ def page_model_info_to_page_info(page_model_info, image_dict, page, image_writer
def
result_to_middle_json
(
model_list
,
images_list
,
pdf_doc
,
image_writer
,
lang
=
None
,
ocr_enable
=
False
,
formula_enabled
=
True
):
middle_json
=
{
"pdf_info"
:
[],
"_backend"
:
"pipeline"
,
"_version_name"
:
__version__
}
for
page_index
,
page_model_info
in
enumerate
(
model_list
):
for
page_index
,
page_model_info
in
tqdm
(
enumerate
(
model_list
)
,
total
=
len
(
model_list
),
desc
=
"Processing pages"
)
:
page
=
pdf_doc
[
page_index
]
image_dict
=
images_list
[
page_index
]
page_info
=
page_model_info_to_page_info
(
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment