Welcome to the Power Query Practice repository! This project is focused on various data cleaning techniques using Power Query in Power BI.
In this project, we perform the following tasks:
- Column Profiling: Understanding the data types and distribution of values in each column.
- Dealing with Missing Values: Identifying and handling empty or blank values.
- Encoding Nulls: Converting null values into a format suitable for analysis.
- Imputing Missing Values: Replacing missing values with appropriate substitutes.
- Working with Dates: Cleaning and formatting date values.
- Adding New Columns: Creating new columns based on existing data.
- Splitting / Extracting Data: Dividing columns into multiple parts or extracting specific data.
- Extracting First Item: Extracting the first item from lists or text strings.
- Text / Sentiment Analysis: Performing basic text analysis and sentiment evaluation.
- Filtering Unnecessary Data: Removing irrelevant rows or columns to streamline the dataset.
To follow along with this project, download the sample data from Kaggle:
First, clone the repository to your local machine:
git clone https://github.com/MasterMindRomii/Data-Cleaning-Using-Power-Query
cd Data-Cleaning-Using-Power-Query