VisionScript

A Python3 script to convert multiple images of scanned text into a single word document using the Google Vision API and python-docx.

Follow the steps below to download, install, and run this project.

Dependencies

Install these prerequisites:

Python: https://www.python.org/downloads/
Google Cloud Vision API Access: https://cloud.google.com/vision/

Step 1. Clone the project or download ZIP

git clone https://github.com/teraflik/VisionScript.git

Step 2. Install dependencies

Open PowerShell or Bash and type:

$ cd VisionScript
$ pip install -r requirements.txt

Step 3. Set up your Google Cloud API Key:

On Windows go to Environment Variables and add a new key. Set Variable Name to GOOGLE_APPLICATION_CREDENTIALS and Variable Value to the path where the your access key is stored.

Step 4. Store your images and run the script

Copy your images to the images\ folder alongside main.py and execute the script by double-clicking it or typing in console:

python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VisionScript

Dependencies

Step 1. Clone the project or download ZIP

Step 2. Install dependencies

Step 3. Set up your Google Cloud API Key:

Step 4. Store your images and run the script

Step 5. Output is stored in Word.docx

About

Releases

Packages

Languages

teraflik/VisionScript

Folders and files

Latest commit

History

Repository files navigation

VisionScript

Dependencies

Step 1. Clone the project or download ZIP

Step 2. Install dependencies

Step 3. Set up your Google Cloud API Key:

Step 4. Store your images and run the script

Step 5. Output is stored in Word.docx

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages