This project aims to detect fraudulent health and car insurance claims using machine learning models. The project is built using Streamlit for the user interface and employs Decision Tree Classifiers for the detection. The models are trained on separate datasets for health and car insurance claims.
- Real-time Fraud Detection: Detects fraudulent claims in real-time using trained models.
- Interactive Interface: User-friendly interface built with Streamlit.
- Separate Models: Different models for health and car insurance claims.
- Visualization: Visual representation of class distribution in the datasets.
To get started with this project, follow these steps:
-
Clone the repository:
git clone https://github.com/yourusername/InsuranceFraudDetection.git cd InsuranceFraudDetection
-
Create a virtual environment:
bash python -m venv venv source venv/bin/activate # On Windows use
venv\Scripts\activate
-
Install the required packages:
bash pip install -r requirements.txt
-
Download the datasets: Place the insurance_dataset.xlsx and health_insurance_data.csv files in the root directory of the project.
-
Run the Streamlit application:
bash streamlit run streamliteasysurance.py
-
Select the Insurance Type:
-
Use the sidebar to select either "Car Insurance" or "Health Insurance".
-
Enter the Claim Data: Fill in the required fields in the sidebar based on the selected insurance type.
-
Submit the Data: Click the "Submit" button to get the fraud detection result.
Here is a demo result of the fraud detection:
Streamlit
Pandas
NumPy
Matplotlib
Seaborn
scikit-learn