Explore insights from 100k orders (2016-2018) across Brazilian marketplaces. From data loading to model deployment, this project covers EDA, preprocessing, modeling, NLP for customer satisfaction, and customer segmentation.
Two sample CSV files (EDA.csv, Clustering Sample.csv) are provided in the repository for EDA and clustering analysis testing.
- Introduction
- Data Loading
- Data Cleaning
- Merging Dataframes
- Handling Missing Values
- Drop Duplicates
- Feature Engineering
- Exploratory Data Analysis (EDA)
- Univariate Analysis
- Multivariate Analysis
- Data Preprocessing
- Data Encoding
- Feature Scaling
- Handle Imbalance
- Modeling
- Apply ML Models
- Hyperparameter Tuning
- Pipeline
- NLP for Customer Satisfaction
- Customer Segmentation
- RFM Analysis
- K-Means
- Model Deployment (Classification & Clustering)
- Wrap Up & Conclusion