refactor(ocr): improve text processing and span handling
- Remove unused language detection code - Simplify text content processing logic - Update span sorting and text extraction in pdf_parse_union_core_v2.py
Showing
Please register or sign in to comment