- 08 May, 2025 1 commit
-
-
myhloli authored
-
- 06 May, 2025 1 commit
-
-
myhloli authored
-
- 30 Apr, 2025 1 commit
-
-
myhloli authored
- Add logger info for each batch processed - Include batch number and page count in log message
-
- 29 Apr, 2025 10 commits
-
-
Xiaomeng Zhao authored
update version
-
myhloli authored
-
Xiaomeng Zhao authored
Release 1.3.10
-
Xiaomeng Zhao authored
feat(model_utils): adjust table detection threshold and add features
-
myhloli authored
-
myhloli authored
- Adjust the threshold for considering tables inside other tables from2 to 3 - Add support for custom formula delimiters through user configuration - Pin pdfminer.six to version 20250324 to prevent parsing failures
-
Xiaomeng Zhao authored
build(deps): pin pdfminer.six version to 20250324
-
myhloli authored
- Update pdfminer.six dependency from >=20250416 to ==20250324 - This change ensures compatibility with specific project requirements
-
myhloli authored
-
myhloli authored
- Add regex patterns for replacing LaTeX symbols \fint and \up with their Unicode equivalents
-
- 28 Apr, 2025 2 commits
-
-
Xiaomeng Zhao authored
feat(latex): enhance LaTeX delimiter support and configurability
-
myhloli authored
- Add support for \(\) and \[\] delimiters in addition to $$ and $$- Make LaTeX delimiter configuration more flexible and user-defined - Update configuration file to include LaTeX delimiter settings - Modify OCR content generation to use configurable delimiters
-
- 27 Apr, 2025 18 commits
-
-
Xiaomeng Zhao authored
fix(mfr): add underscore symbol to unimernet
-
myhloli authored
-
myhloli authored
- Add \textunderscore to the list of LaTeX patterns - This allows the model to properly render underscore characters
-
Xiaomeng Zhao authored
update version
-
myhloli authored
-
Xiaomeng Zhao authored
Release 1.3.9
-
Xiaomeng Zhao authored
Dev
-
Xiaomeng Zhao authored
docs: correct typo for Apple Silicon in install guide and README
-
myhloli authored
- Fix typo in install.rst and README_zh-CN.md - Change 'apple slicon' to 'Apple silicon'
-
Xiaomeng Zhao authored
docs(README): fix typo
-
Xiaomeng Zhao authored
docs(README): fix typo
-
Xiaomeng Zhao authored
update readme
-
Xiaomeng Zhao authored
feat(pdf): optimize formula parsing and update pdfminer.six
-
myhloli authored
- Improve formula parsing success rate for better formula rendering - Upgrade pdfminer.six to the latest version to fix PDF parsing issues- Update changelog in both English and Chinese README files
-
Xiaomeng Zhao authored
build(deps): update pdfminer.six to latest version
-
myhloli authored
- Improve \left and \right command handling in LaTeX formulas - Enhance environment type matching for array, matrix, and other structures - Refactor code for better readability and maintainability
-
myhloli authored
- Refactor LaTeX left/right pair fixing logic for better balance - Add environment detection and correction for common math environments - Implement more robust whitespace handling and command substitution - Optimize regex patterns for improved performance and readability
-
myhloli authored
- Change pdfminer.six dependency from ==20231228 to >=20250416 - This update ensures compatibility with the latest version of pdfminer.six
-
- 25 Apr, 2025 3 commits
-
-
myhloli authored
- Add functions to fix LaTeX left and right commands - Implement brace matching and repair in LaTeX formulas - Remove unnecessary whitespace and repair LaTeX code - Replace specific LaTeX commands with appropriate alternatives - Add logging for debugging purposes
-
myhloli authored
- Add functions to fix LaTeX left and right commands - Implement brace matching and repair in LaTeX formulas - Remove unnecessary whitespace and repair LaTeX code - Replace specific LaTeX commands with appropriate alternatives - Add logging for debugging purposes
-
github-actions[bot] authored
-
- 24 Apr, 2025 3 commits
-
-
小林在忙毕业设计 authored
-
Xiaomeng Zhao authored
fix(mfr): improve LaTeX whitespace handling in unimernet model
-
myhloli authored
- Preserve "\ " sequences during whitespace removal - Add temporary substitution to prevent incorrect processing of "\ " sequences - Restore "\ " sequences after removing unnecessary whitespace
-
- 23 Apr, 2025 1 commit
-
-
Xiaomeng Zhao authored
master->dev
-