Project Title: Cardiovascular Disease Prediction

Team Members: Fabian Roulin, Léa Goffinet, Samuel Mouny

Project Overview

This project aims to predict the likelihood of cardiovascular disease based on individual health factors. The analysis involves data exploration, feature engineering, and implementing machine learning algorithms for binary classification. This work is part of the EPFL Machine Learning Course, Fall 2024.

Repository Structure

The repository is organized as follows:

data/: Placeholder for dataset files (data/raw/ for raw data).
doc/: Contains project description and data codebook.
notebooks/: Jupyter notebooks for data exploration, processing, model training, etc.
report/: Final report (LaTeX format) and accompanying figures.
src/: Custom package with modules:
- data_exploration, data_loading, data_processing
- models, train_pipeline, utils
tests/: Test files provided to validate the functions.

Root Files:

helpers.py: Helper functions for submission creation.
implementations.py: Project-required implementations of functions.
requirements.txt: Environment dependencies.
run.py: Script to produce the best predictions, might take a while to train the model.
setup.py: Script to install the src package.

Getting Started

Setup Instructions

Follow these steps to clone the repository, set up the environment, and install the package.

1. Clone the Repository

To clone this repository, use the following command:

git clone <repository_link>
cd <repository_name>

2. Install Dependencies

Set up the environment by installing required packages from requirements.txt:

pip install -r requirements.txt

3. Install the `src` Package

To install the custom src package, run:

pip install .

For editable mode, allowing modifications without reinstalling:

pip install -e .

Running the Project

To execute the project pipeline and generate predictions:

python run.py

Usage

Data Preparation: Place raw data files in data/raw/ for smooth execution.
Exploration and Modeling: Use the notebooks in notebooks/ for step-by-step analysis and model development.
Final Report: The final analysis is documented in report/.

For further resources, visit:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
doc		doc
notebooks		notebooks
report/figures		report/figures
results		results
src		src
tests		tests
README.md		README.md
Team_FSL_Project_1_Report.pdf		Team_FSL_Project_1_Report.pdf
cross_validation gamma.png		cross_validation gamma.png
cross_validation threshold.png		cross_validation threshold.png
cross_validation.ipynb		cross_validation.ipynb
cross_validation.py		cross_validation.py
implementations.py		implementations.py
requirements.txt		requirements.txt
run.py		run.py
setup.py		setup.py
test.ipynb		test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Title: Cardiovascular Disease Prediction

Project Overview

Repository Structure

Getting Started

Setup Instructions

1. Clone the Repository

2. Install Dependencies

3. Install the `src` Package

Running the Project

Usage

About

Releases

Packages

Languages

Fab1205/ML-Project-1-cs433

Folders and files

Latest commit

History

Repository files navigation

Project Title: Cardiovascular Disease Prediction

Project Overview

Repository Structure

Getting Started

Setup Instructions

1. Clone the Repository

2. Install Dependencies

3. Install the src Package

Running the Project

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

3. Install the `src` Package

Packages