EQUINOTHERAPY PILOT SLEEP

How to execute

First its necessary to install Conda in order to execute the code.

Install libs on new environment: conda env create --file environment.yml
Activate the environment: conda activate EQUINOTHERAPY_PILOT_SLEEP
Execute whole project by running the main file: python src/main.py

Specific libraries versions

In case environment.yml does not work, here are the specific versions of the libraries used in the project:

Python: 3.11.5
Luigi: 3.5.1
Scikit-learn: 1.2.2
XGBoost: 2.0.3
Catboost: 1.2.5
Tensorflow: 2.16.1
Keras: 3.4.0
Numpy: 1.26.4
Pandas: 2.2.2
TQDM: 4.66.4
Imbalanced-Learn: 0.12.3
Seaborn: 0.13.2

Note: Some libraries are OS-specific. For instance, on macOS, tensorflow-macos and tensorflow-metal are required to enable GPU use.

Pipelines Summaries

Preprocessing Pipeline

Phase	Description	Script
Consolidation	The data obtained from the watches are matched with the stages obtained from polysomnography (PSG).	consolidation.py
Cleaning	Two filters are applied to clean the data: - Time Filter: Only data between 20:00 and 12:00 is collected. - HRR Filter: Only windows of w_size and w_overlap are collected. Heart rate recovery is calculated in each window and if it exceeds the threshold, the window is not included in the final file.	cleaning.py
Scaling	The data is scaled to improve model performance. First, an adjustment of the median of the individual data for each patient is performed, then a global 'RobustScaler' is applied to the entire dataset.	scaling.py
Preprocessing	Feature extraction is performed (see features.py for details). Then, the feature matrix of each patient is normalized using a StandardScaler (Z-score normalization).	preprocessing.py

Shallow Training Pipeline

Phase	Description	Script
Partitioning	Two validation methods are used: Stratified KFold with n_splits (see luigi.cfg to check value). Leave-One-Participant-Out.	partitioning.py
Oversampling	To achieve a balanced dataset, the SMOTE algorithm is applied to the training set of both validation methods.	oversampling.py
Training	A list of models is trained using the training set of both validation methods. These models are defined in models.py. Parameter optimization is performed for each partition (fold or participant), and only the best model of each partition is saved.	training.py
Analysis	Various analysis approaches are conducted in this phase: Comparative Boxplots. Comparative Matrix between validation methods. Confusion Matrix for each model.	analysis.py

Deep Training Pipeline

Phase	Description	Script
Partitioning	Using a combination of the aforementioned validation methods, Leave-One-Participant-Out is used to divide the dataset into training and testing sets, then a Stratified KFold with n_splits (see luigi.cfg to check value) is used on the training set, creating the training and validation folds.	partitioning.py
Training	Using a DataGenerator, the training and validation sets are fed to the model. This model might be one of three different versions (as seen in lstm_creation.py), the selection is specified in luigi.cfg, as well as the mode to train the model, which consists of the list of inputs which will be passed to the model. These inputs might be: ACCX, ACCY, ACCZ, and HR ACCX, ACCY, ACCZ, ACC_MAGNITUDE, and HR ACC_MAGNITUDE and HR	training.py
Analysis	Previously trained models are evaluated using the testing set. Then, Confussion Matrix are obtained.	analysis.py

Experiments Summary

Shallow Models

1. Using all phases

Round	Patients used	N Phases	Description
1st	All Patients	5	Window Gaussian normalization, Individuals only, MinMaxScaler (0,1) on feature matrix, no Oversampling
2nd	All Patients	5	Mean adjust, Global RobustScaler, MinMaxScaler (0,1) on feature matrix, Feature selection (ANOVA), Smote Oversampling
3rd	All Patients	5	Mean adjust, Global RobustScaler, StandardScaler (Gaussian) on feature matrix, Feature selection (ANOVA), Smote Oversampling
4th	HQ Patients	5	Mean adjust, Global RobustScaler, StandardScaler (Gaussian) on feature matrix, Feature selection (ANOVA), Smote Oversampling
5th	MLQ Patients	5	Mean adjust, Global RobustScaler, StandardScaler (Gaussian) on feature matrix, Feature selection (ANOVA), Smote Oversampling
6th	All Patients	2	Mean adjust, Global RobustScaler, StandardScaler (Gaussian) on feature matrix, Feature selection (ANOVA), Smote Oversampling

Deep Models

Round	Description
First	3DACC, HR and MAGACC with LSTM_1
Second	3DACC, HR with LSTM_1

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
files/img		files/img
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
luigi.cfg		luigi.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EQUINOTHERAPY PILOT SLEEP

How to execute

Specific libraries versions

Pipelines Summaries

Preprocessing Pipeline

Shallow Training Pipeline

Deep Training Pipeline

Experiments Summary

Shallow Models

1. Using all phases

Deep Models

About

Releases

Packages

Languages

License

Omitg24/TFG-Equinotherapy

Folders and files

Latest commit

History

Repository files navigation

EQUINOTHERAPY PILOT SLEEP

How to execute

Specific libraries versions

Pipelines Summaries

Preprocessing Pipeline

Shallow Training Pipeline

Deep Training Pipeline

Experiments Summary

Shallow Models

1. Using all phases

Deep Models

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages