pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compliant PDF or PDF/A-3u files that are accessible, searchable, and suitable for archiving
pdf data image ocr recognition glyphs tesseract scan character spanish searchable ligatures hindi portuguese optical archival mandarin extractable iso-compliant diacritic
-
Updated
Jan 11, 2025 - C#