refactor(pdf_parse): improve character spacing handling in PDF text extraction
- Update the logic for inserting spaces between characters- Consider the next character's position instead of the previous one - Adjust the spacing threshold to 25% of the average character width - Ignore spaces at the end of lines to prevent double spaces
Showing
Please register or sign in to comment