Pizza-Sales-Data-Analytics---End-to-End-Azure-Data-Engineering-Production-level-project1

🔧 Analyzing Sales of Pizza sales Data🔌

On-prem DB to Azure Cloud Pipeline with Data Factory, Lake Storage, Spark, Databricks, Synapse, PowerBI

📝 Table of Contents

Project Overview
Key Insights
Project Architecture
3.1. Data Ingestion
3.2. Data Transformation
3.3. Data Loading
3.4. Data Reporting
Credits
Contact

🔬 Project Overview

This an end-to-end data engineering project on the Azure cloud. Where I did data ingestion from a on-premise SQL Server to Azure Data Lake using Data Factory to transformation using Databricks and Spark, loading to Synapse, and reporting using PowerBI.

💾 Dataset

Dataset link : https://drive.google.com/file/d/1i4aRieq_WDVJDGpqtZq8UW9CH8sCbaBd/view?pli=1

Business Requirement.

Project steps to follow:

In this project we are going to create an end to end data platform right from Data Ingestion, Data Transformation, Data Loading and Reporting.

The tools that are covered in this project are,

SQL server migration
Azure Data Factory
Azure Data Lake Storage Gen2
Azure Databricks
PYSPARK
SPARK SQL
Microsoft Power BI

The use case for this project is building an end to end solution by ingesting the tables from on-premise SQL Server database using Azure Data Factory and then store the data in Azure Data Lake. Then Azure databricks is used to transform the RAW data to the most cleanest form of data and finally using Microsoft Power BI to integrate with Azure synapse analytics to build an interactive dashboard.

🎯 Project Goals

Establish a connection between on-premise SQL server and Azure cloud.
Ingest tables into the Azure Data Lake.
Apply data cleaning and transformation using Azure Databricks.
Utilize Azure Synapse Analytics for loading clean data.
Create interactive data visualizations and reports with Microsoft Power BI.

🕵️ Key Insights

💸 Total Revenue by Product Category
🌍 Sales by Pizza Name and size
- N°1: The L size pizza generated the total revenue of 45%.
- N°2: The M size pizza generated the total revenue of 30.49%.
🚻 Sales by Pizza category
- 26.91% of the revenue is generated by Classic pizza category
- 23.96% of the revenue is generated by Chicken pizza category

This can be explained by males have more interest in doing outdoor activites with the different categories of Bikes than females.

📝 Project Architecture

You can find the detailed information on the diagram below:

📤 Data Ingestion

Connected the on-premise SQL Server with Azure using Microsoft Integration Runtime.
Setup the Resource group with needed services (Storage Account, Data Factory, Databricks, Synapse Analytics)
Migrated the tables from on-premise SQL Server to Azure Data Lake Storage Gen2.

⚙️ Data Transformation

Mounted Azure Blob Storage to Databricks to retrieve raw data from the Data Lake.
Used Spark Cluster in Azure Databricks to clean and refine the raw data And do some aggregations.
Saved the cleaned data in a PARQUET format; optimized for further analysis.

📥 Data Loading(only required for instant analytics)

Used Azure Synapse Analytics to load the refined data efficiently.
Created SQL database and connected it to the data lake.

📊 Data Reporting

Connected Microsoft Power BI to Azure Synapse, and used the Views of the DB to create interactive and insightful data visualizations.

🛠️ Technologies Used

Data Source: SQL Server
Orchestration: Azure Data Factory
Ingestion: Azure Data Lake Gen2
Storage: Azure Synapse Analytics(if required)
Data Visualization: PowerBI

📋 Credits

This Project is inspired by the video of the YouTube Channel "Learn by doing it"

📨 Contact Me

LinkedIn • Gmail •

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Assets		Assets
Dataset		Dataset
Power BI Report		Power BI Report
Source code Notebook files		Source code Notebook files
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pizza-Sales-Data-Analytics---End-to-End-Azure-Data-Engineering-Production-level-project1

🔧 Analyzing Sales of Pizza sales Data🔌

📝 Table of Contents

🔬 Project Overview

💾 Dataset

Business Requirement.

Project steps to follow:

🎯 Project Goals

🕵️ Key Insights

📝 Project Architecture

📤 Data Ingestion

⚙️ Data Transformation

📥 Data Loading(only required for instant analytics)

📊 Data Reporting

🛠️ Technologies Used

📋 Credits

📨 Contact Me

About

Releases

Packages

Languages

zBalachandar/Pizza-Sales-Data-Analytics---End-to-End-Azure-Data-Engineering-Production-level-project-01

Folders and files

Latest commit

History

Repository files navigation

Pizza-Sales-Data-Analytics---End-to-End-Azure-Data-Engineering-Production-level-project1

🔧 Analyzing Sales of Pizza sales Data🔌

📝 Table of Contents

🔬 Project Overview

💾 Dataset

Business Requirement.

Project steps to follow:

🎯 Project Goals

🕵️ Key Insights

📝 Project Architecture

📤 Data Ingestion

⚙️ Data Transformation

📥 Data Loading(only required for instant analytics)

📊 Data Reporting

🛠️ Technologies Used

📋 Credits

📨 Contact Me

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages