Uses spacy's named entity recognition and tesseract to cherry-pick important data from images of Zillow house listings.
-
Updated
Sep 1, 2020 - Python
Uses spacy's named entity recognition and tesseract to cherry-pick important data from images of Zillow house listings.
A sample dataset of over 1000 Zillow listings, extracted using the Bright Data API, ideal for boosting your brand and analyzing competitors.
This library scrapes zillow property website
This is an end-to-end AWS Cloud ETL project. This data pipeline orchestration uses Apache Airflow on AWS EC2 as well as AWS Lambda. It demonstrates how to build ETL data pipeline that would perform data transformation using Lambda function as well as loading into a Redshift cluster table. The data would then be visualized using Amazon QuickSight.
Add a description, image, and links to the zillow-house-listings topic page so that developers can more easily learn about it.
To associate your repository with the zillow-house-listings topic, visit your repo's landing page and select "manage topics."