Skip to content

This project involves predictive analysis of credit card default payments in Taiwan, utilizing demographic and financial variables from the UCI Machine Learning Repository. The objective is to build an accurate classification model to identify potential defaulters.

Notifications You must be signed in to change notification settings

rohanarora03/Credit-Card-Defaulters-Classification

Repository files navigation

Credit-Card-Defaulters-Classification

Description: This project involves predictive analysis of credit card default payments in Taiwan, utilizing demographic and financial variables from the UCI Machine Learning Repository. The objective is to build an accurate classification model to identify potential defaulters.

Project Overview: Credit card default prediction is crucial for financial institutions to manage credit risk and reduce financial losses. This project employs various machine learning algorithms to predict default payments, enhancing decision-making processes in the credit industry.

Key Features:

  • Data Processing and ETL: Efficient extraction, preprocessing, transformation, and feature engineering of the data to ensure high-quality inputs for the machine learning models.

  • Statistical Analysis and EDA: Thorough exploratory data analysis (EDA) to understand the data distribution, identify patterns, and detect anomalies. Statistical analysis helps in selecting the most accurate predictors of defaulters.

  • Correlation Analysis: Identifying significant relationships between variables to enhance the predictive power of the models.

  • Data Engineering and Modeling: Applying data engineering and data warehousing techniques to create a structured and optimized dataset for model training, employing data modeling techniques to build and refine machine learning models.

  • Model Development: Utilization of various classification & ensemble algorithms, including Logistic Regression, Decision Trees, Random Forest, XGBoost, and LightGBM, focusing on achieving high prediction accuracy. The LightGBM algorithm, in particular, achieved an impressive 82% accuracy.

  • Model Evaluation: Assessing model performance using data mining techniques such as ROC curve, AUC, and confusion matrix to ensure reliable predictions and identify key predictors of default payments.

Results: The LightGBM algorithm achieved the highest prediction accuracy of 82%. Key predictors of default payments were identified, providing valuable insights into the factors influencing credit card defaults.

Conclusion: This project demonstrates the effective use of machine learning techniques to predict credit card defaulters. The insights gained can be utilized by financial institutions to mitigate risks and make informed decisions.

About

This project involves predictive analysis of credit card default payments in Taiwan, utilizing demographic and financial variables from the UCI Machine Learning Repository. The objective is to build an accurate classification model to identify potential defaulters.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published