Skip to content

A Robotic Process Automation (RPA) Bot that automatically extracts and filters important data from PDF files using scraping techniques. It will then input the filtered data into excel.

Notifications You must be signed in to change notification settings

AlvinOctaH/PDF-scraper-bot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PDF-scraper-bot


Table of contents


1. Introduction

This repository will discuss the build process and results of a Robotic Process Automation (RPA) Bot that automatically extracts and filters important data from PDF files using scraping techniques.

2. Build Process

This Robotic Process Automation (RPA) was created using Uipath and uses Adobe Acrobat to open PDF documents. Using scraping techniques, the Robotic Process Automation (RPA) will retrieve data and enter it into a predefined Excel (ScrapedInvoiceOutput.xlsx). The data to be retrieved from the PDF are Invoice Number, Invoice Date, Customer Name, Total Amount, and Customer ID. The creation of the RPA can be seen in the following image: Build Process Robotic Process Automation (RPA)

3. Results

The results of the data scraping that has been done, can be seen in the following figure: Results of the data scraping

About

A Robotic Process Automation (RPA) Bot that automatically extracts and filters important data from PDF files using scraping techniques. It will then input the filtered data into excel.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published