tesseract with python

Jupyter notebook code for Tesseract ocr to extract the images to text, box files and hocr files using pytesseract + python

pip install pytesseract

sudo find / -name "tesseract"

sudo find / -name "tessdata"

Keep all the png files in one folder and replace path with its location
Use "" to enhance the dpi of the images
It will create the text file, box file and hocr file for the input image and will save in the same directory

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
_config.yml		_config.yml
python_tesseract.ipynb		python_tesseract.ipynb

Provide feedback