ocr

Here are 142 public repositories matching this topic...

Unstructured-IO / unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.

Updated Apr 8, 2025
HTML

kha-white / mokuro

Star

Read Japanese manga inside browser with selectable text.

ocr japanese manga comics manga-reader comics-reader

Updated Jan 28, 2025
HTML

XMuli / SunnyCapturer

Star

A simple and beautiful cross-platform screenshot software, It also supports OCR, image translation, stickers and pinning images features. | 简单且漂亮的跨平台截图软件，支持离线 OCR、图片翻译、贴图和钉图等功能

screenshot image ocr snapshot screen capture translate sunnycapturer

Updated Mar 3, 2025
HTML

pd3f / pd3f

Star

🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based

python pdf machine-learning ocr pipeline text-extraction pdf-to-text language-model extract-text parsr pd3f

Updated Oct 13, 2023
HTML

victorqribeiro / ocr

Star

Simple app to extract text from pictures using Tesseract

ocr tesseract text-extraction text-recognition image-recognition

Updated Jul 19, 2021
HTML

WZBSocialScienceCenter / pdf2xml-viewer

Star

A simple viewer and inspection tool for text boxes in PDF documents

d3 pdf ocr xml viewer pdf-document

Updated Mar 7, 2022
HTML

BruceWind / Image-Anti-OCR

Star

ocr anti-ocr

Updated Nov 8, 2022
HTML

bensonruan / Tesseract-OCR

Star

Tesseract.js OCR

javascript machine-learning ocr computer-vision tesseract artificial-intelligence image-to-text

Updated Jun 19, 2023
HTML

gojiplus / abbyyR

Star

R Client for the Abbyy Cloud OCR

cran ocr ocr-engine abbyy-cloud-ocr

Updated Jul 4, 2023
HTML

WangRongsheng / PaddleOCR-Flask-deploy

Star

✅Deploy PaddleOCR with flask | 利用Flask对PaddleOCR进行部署，方便调用

flask ocr paddleocr

Updated Jun 13, 2022
HTML

mbzuai-oryx / AIN

Star

AIN - The First Arabic Inclusive Large Multimodal Model. It is a versatile bilingual LMM excelling in visual and contextual understanding across diverse domains.

ocr culture remote-sensing vqa vlm vision-and-language lmm multi-images