perf(inference): adjust batch ratio for GPU memory sizes

- Simplify batch ratio logic for GPU memory >= 16GB - Remove unnecessary conditions for 20GB and 40GB memory

perf(inference): adjust batch ratio for GPU memory sizes
- Simplify batch ratio logic for GPU memory >= 16GB - Remove unnecessary conditions for 20GB and 40GB memory
0d3304d7 · myhloli · 59fc80d4 · 0d3304d7
Commit 0d3304d7 authored Mar 03, 2025 by myhloli
Hide whitespace changes
Inline Side-by-side

Showing with 1 addition and 5 deletions

magic_pdf/model/doc_analyze_by_custom_model.py magic_pdf/model/doc_analyze_by_custom_model.py +1 -5

No files found.
--- a/magic_pdf/model/doc_analyze_by_custom_model.py
+++ b/magic_pdf/model/doc_analyze_by_custom_model.py
@@ -170,11 +170,7 @@ def doc_analyze(
        gpu_memory = int(os.getenv("VIRTUAL_VRAM_SIZE", round(get_vram(device))))
        if gpu_memory is not None and gpu_memory >= 8:
-            if gpu_memory >= 40:
+            if gpu_memory >= 16:
-                batch_ratio = 32
-            elif gpu_memory >=20:
-                batch_ratio = 16
-            elif gpu_memory >= 16:
                batch_ratio = 8
            elif gpu_memory >= 10:
                batch_ratio = 4