".github/vscode:/vscode.git/clone" did not exist on "e1aaa79ac9954c705f839e8304d29eac452ce04b"
- 09 Dec, 2024 3 commits
-
-
icecraft authored
-
Xiaomeng Zhao authored
fix: add parse_pdf_type and version
-
icecraft authored
-
- 07 Dec, 2024 4 commits
-
-
Xiaomeng Zhao authored
-
sawmice authored
-
Xiaomeng Zhao authored
fix(dict2md): add space for inline equations in CJK contexts
-
myhloli authored
- In Chinese, Japanese, and Korean (CJK) languages, no space is needed for line breaks within paragraphs. - However, if an inline equation is at the end of a line, a space should be added to separate it from the following text. - This change improves the formatting of documents containing both CJK text and inline equations.
-
- 06 Dec, 2024 31 commits
-
-
Xiaomeng Zhao authored
Refactor/add user api
-
Xiaomeng Zhao authored
refactor(magic-pdf): optimize model initialization and concurrency control
-
myhloli authored
- Remove concurrency limit logic from app.py - Update model initialization process in various modules - Remove unused VRAM check for concurrency limit - Refactor OCR model initialization in pdf_extract_kit.py - Update txt_spans_extract_v2 function to use lang parameter instead of ocr_model
-
Xiaomeng Zhao authored
refactor(ocr): replace AtomModelSingleton with ocr_model_init for OCR model instantiation
-
myhloli authored
- Remove usage of AtomModelSingleton for OCR model creation - Add ocr_model_init function to initialize OCR model - Update OCR model initialization in pdf_extract_kit.py and pdf_parse_union_core_v2.py - Modify txt_spans_extract_v2 function to accept ocr_model as a parameter - Update parse_page_core function to use ocr_model instead of lang for OCR processing
-
Xiaomeng Zhao authored
refactor(model): implement thread-safe OCR model initialization
-
myhloli authored
- Add threading support for OCR model initialization - Modify AtomModelSingleton to handle thread-specific instances - Update PDFExtractKit and PDFParseUnionCoreV2 to use new thread-safe OCR initialization
-
Xiaomeng Zhao authored
build(deps): specify minimum version for ultralytics
-
myhloli authored
- Update `ultralytics` dependency to version >= 8.3.43 - This change ensures compatibility with yolov8 for formula detection
-
Xiaomeng Zhao authored
refactor(magic_pdf): remove unused threading lock and model initialization code
-
myhloli authored
- Remove threading.Lock import and usage - Delete unused model initialization comments and code- Simplify OCR model initialization in both pdf_extract_kit.py and pdf_parse_union_core_v2.py
-
Xiaomeng Zhao authored
feat: update test case
-
dt-yy authored
-
Xiaomeng Zhao authored
fix(multi-threading ):Enable multi-threading support for PaddleOCR.
-
myhloli authored
- Remove usage of AtomModelSingleton for OCR model initialization- Use ocr_model_init function for creating OCR model instance - Update import statement to include ocr_model_init- Comment out old OCR model initialization code
-
myhloli authored
- Remove usage of AtomModelSingleton for OCR model initialization - Add import of ocr_model_init from model_init module - Update OCR model initialization process to use ocr_model_init function - Remove lock for OCR processing as it's no longer needed
-
myhloli authored
- Remove usage of ModelSingleton class - Initialize model directly using custom_model_init function - Add self._lock attribute to PDFExtractKit class for thread safety- Replace local lock with self._lock for OCR processing
-
Xiaomeng Zhao authored
fix(model): simplify model initialization logic
-
myhloli authored
-
Xiaomeng Zhao authored
fix: update notify
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
赵小蒙 authored
- Remove unnecessary threading.Lock in AtomModelSingleton - Add threading.Lock to CustomPEKModel for OCR processing - Simplify model initialization logic in AtomModelSingleton
-
Xiaomeng Zhao authored
perf(model): optimize model initialization
-
myhloli authored
- Add condition to return existing model if already initialized - Improve efficiency by avoiding redundant model creation
-
- 05 Dec, 2024 2 commits
-
-
Xiaomeng Zhao authored
perf(model): add threading lock for OCR model initialization
-
myhloli authored
- Introduce a lock to synchronize access to OCR model initialization- This change improves thread safety when multiple threads access the OCR model concurrently - The lock ensures that the OCR model is initialized only once, even in multi-threaded scenarios
-