"docs/en_US/Nnictl.md" did not exist on "e6b3828cc98119e4ba9293f73fb9a0b5b0ce3740"
  • myhloli's avatar
    fix(ocr): improve image and table content extraction · b7e9d454
    myhloli authored
    - Update image content extraction to iterate through all spans in a block
    - Add support for extracting table content from spans within a block
    - Handle multiple content types within table spans (latex, html, image)
    - Refactor code to be more modular and easier to maintain
    b7e9d454
ocr_mkcontent.py 11.8 KB