feat(pdf_parse): improve text extraction for vertical spans
- Calculate median span height to identify vertical spans - Use PyMuPDF's 'dict' output to fill vertical spans with lines
Showing
Please register or sign in to comment
- Calculate median span height to identify vertical spans - Use PyMuPDF's 'dict' output to fill vertical spans with lines