Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
wangsen
MinerU
Commits
a5342950
Commit
a5342950
authored
Feb 09, 2025
by
myhloli
Browse files
fix(filter): toggle invalid character detection method
parent
f35a6c08
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
2 additions
and
2 deletions
+2
-2
magic_pdf/filter/pdf_meta_scan.py
magic_pdf/filter/pdf_meta_scan.py
+2
-2
No files found.
magic_pdf/filter/pdf_meta_scan.py
View file @
a5342950
...
@@ -323,8 +323,8 @@ def get_language(doc: fitz.Document):
...
@@ -323,8 +323,8 @@ def get_language(doc: fitz.Document):
def
check_invalid_chars
(
pdf_bytes
):
def
check_invalid_chars
(
pdf_bytes
):
"""乱码检测."""
"""乱码检测."""
return
detect_invalid_chars_by_pymupdf
(
pdf_bytes
)
#
return detect_invalid_chars_by_pymupdf(pdf_bytes)
#
return detect_invalid_chars(pdf_bytes)
return
detect_invalid_chars
(
pdf_bytes
)
def
pdf_meta_scan
(
pdf_bytes
:
bytes
):
def
pdf_meta_scan
(
pdf_bytes
:
bytes
):
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment