This is a proof-of-concept project that scans PDF files for text data, and outputs the result into console Requirements Windows 10 Tesseract installed with Russian language data, and recognised in PATH