- 06 Dec, 2024 22 commits
-
-
Xiaomeng Zhao authored
refactor(magic_pdf): remove unused threading lock and model initialization code
-
myhloli authored
- Remove threading.Lock import and usage - Delete unused model initialization comments and code- Simplify OCR model initialization in both pdf_extract_kit.py and pdf_parse_union_core_v2.py
-
Xiaomeng Zhao authored
feat: update test case
-
dt-yy authored
-
Xiaomeng Zhao authored
fix(multi-threading ):Enable multi-threading support for PaddleOCR.
-
myhloli authored
- Remove usage of AtomModelSingleton for OCR model initialization- Use ocr_model_init function for creating OCR model instance - Update import statement to include ocr_model_init- Comment out old OCR model initialization code
-
myhloli authored
- Remove usage of AtomModelSingleton for OCR model initialization - Add import of ocr_model_init from model_init module - Update OCR model initialization process to use ocr_model_init function - Remove lock for OCR processing as it's no longer needed
-
myhloli authored
- Remove usage of ModelSingleton class - Initialize model directly using custom_model_init function - Add self._lock attribute to PDFExtractKit class for thread safety- Replace local lock with self._lock for OCR processing
-
Xiaomeng Zhao authored
fix(model): simplify model initialization logic
-
myhloli authored
-
Xiaomeng Zhao authored
fix: update notify
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
dt-yy authored
-
赵小蒙 authored
- Remove unnecessary threading.Lock in AtomModelSingleton - Add threading.Lock to CustomPEKModel for OCR processing - Simplify model initialization logic in AtomModelSingleton
-
Xiaomeng Zhao authored
perf(model): optimize model initialization
-
myhloli authored
- Add condition to return existing model if already initialized - Improve efficiency by avoiding redundant model creation
-
- 05 Dec, 2024 2 commits
-
-
Xiaomeng Zhao authored
perf(model): add threading lock for OCR model initialization
-
myhloli authored
- Introduce a lock to synchronize access to OCR model initialization- This change improves thread safety when multiple threads access the OCR model concurrently - The lock ensures that the OCR model is initialized only once, even in multi-threaded scenarios
-
- 03 Dec, 2024 3 commits
-
-
Xiaomeng Zhao authored
feat(gradio_app): implement dynamic concurrency limit based on VRAM
-
myhloli authored
- Update VRAM checking logic in app.py and model_utils.py - Add None and type checks for VRAM values - Adjust concurrency limit calculation in app.py - Modify clean_vram function to handle cases with no VRAM information
-
myhloli authored
- Add get_concurrency_limit function to calculate concurrency limit based on VRAM - Update clean_vram function and rename to get_vram for better clarity - Apply concurrency limit to the to_markdown function in the Gradio app
-
- 02 Dec, 2024 8 commits
-
-
Xiaomeng Zhao authored
master->dev
-
myhloli authored
-
Xiaomeng Zhao authored
Release 0.10.5
-
Xiaomeng Zhao authored
Dev -> 0.10.5
-
Xiaomeng Zhao authored
fix(pre_proc): prevent errors when imageWriter is None
-
Xiaomeng Zhao authored
refactor(para): adjust line height multiplier for block splitting,fix(pre_proc): prevent errors when imageWriter is None
-
myhloli authored
- Decrease the maximum width and height from 9000 to 4500 pixels - This change aims to prevent excessive resource usage when rendering PDFs
-
myhloli authored
- Updated cut_image.py to check for NoneType imageWriter - Prevents AttributeError when imageWriter is not provided
-
- 30 Nov, 2024 4 commits
-
-
Xiaomeng Zhao authored
refactor(para): adjust line height multiplier for block splitting
-
myhloli authored
- Decrease the line height multiplier from 0.8 to 0.7 for both left and right sides - This modification aims to improve the accuracy of paragraph splitting
-
Xiaomeng Zhao authored
-
houlinfeng authored
-
- 29 Nov, 2024 1 commit
-
-
myhloli authored
-