This is the project directory for Projet group 4 (Francisco Correia, Szymon Pajzert, Bijan Rad): Healthcare ECG & cardiology for the DL course @ école polytechnique. For this project the MIT-BIH Arrhythmia Dataset is used and it is available on Kaggle ECG Heartbeat Categorization Dataset and on Google drive here.
This directory has been adapted from the github project dldiy-gtsrb. Visit the link to see how a directry could be structured for easier testing of new architectures and parameters, tracking of results and improving of the models. View the TODO list for next steps and grading instructions.
Click on the links to interactively visualise the data:
The zipped data can download from here. Note that on running the commander.py
script for the first time, the data is automatically downloaded, unzipped, and saved for future use.
The project is structured as following:
.
├── loaders
| └── dataset selector
| └── ecg_loader.py # loading and pre-processing ecg data
| └── gdd.py # loading and unzipping data from google drive
├── models
| └── architecture selector
| └── lstm.py # baseline lstm model
| └── cnn1d.py # classical 1 dimensional CNN
| └── resnet1d.py # 1 dimensional CNN with residual skip connections
| └── resnet1d_v2.py # deeper version of resnet1d
├── toolbox
| └── optimizer and losses selectors
| └── logger.py # keeping track of most results during training and storage to static .html file
| └── metrics.py # code snippets for computing scores and main values to track
| └── plotter.py # snippets for plotting and saving plots to disk
| └── utils.py # various utility functions
├── notebooks
| └── graphs # plotly HTML graphs
| └── notebook1.ipynb # initial notebook with data visualisations
├── commander.py # main file from the project serving for calling all necessary functions for training and testing
├── args.py # parsing all command line arguments for experiments
├── trainer.py # pipelines for training, validation and testing
Experiments can be launched by calling commander.py
and a set of input arguments to customize the experiments. You can find the list of available arguments in args.py
and some default values. Note that not all parameters are mandatory for launching and most of them will be assigned their default value if the user does not modify them.
Here is a typical launch command and some comments:
python commander.py --dataset ecg --epochs 20 --root-dir ecg_data --model-name resnet1d_10 --batch-size 128 --lr 0.001 --scheduler ReduceLROnPlateau --optimizer adam --lr-decay 0.5 --step 15 --workers 1 --tensorboard --sampler equal --version bump
- this experiment is on the ecg (or ecg_bin) dataset which can be found in
--root-dir/ecg
trained over resnet1d_10. It optimizes with adam with initial learning rate (--lr
) of1e-3
which is decayed by half whenever the--scheduler
ReduceLRonPlateau does not see an improvement in the validation accuracy for more than--step
epochs. Input sequences are of size 187. In addition it saves intermediate results to--tensorboard
. - when debugging you can add the
--short-run
argument which performs training and validation epochs of 10 mini-batches. This allows testing your entire pipeline before launching an experiment - if you want to resume a previously paused experiment you can use the
--resume
flag which can continue the training from best, latest or a specifically designated epoch. - if you want to use your model only for evaluation on the test set, add the
--test
flag. When doing this, ensure that the--resume
flag is also being used (as well as the correct model name) otherwise testing will be done with an untrained model. - for unbalanced data sets the
--sampler
flag can be used to up-sample, alternatively--class-balance
can be used to penalize errors in under-represented classes more. The two flags are mutually exclusive. - to manage different runs of the same model, use
--version latest
to run on the last run or--version bump
to create a new run.
- this experiment is on the ecg (or ecg_bin) dataset which can be found in
Here are some more typical launch commands:
-
python commander.py --model-name cnn1d_3 --tensorboard --version bump
-
python commander.py --model-name resnet1d_10 --tensorboard --version bump
-
python commander.py --model-name resnet1d_v2 --tensorboard --version bump
To compute results on the testing run:
python commander.py --model-name cnn1d_3 --tensorboard --test --resume best --version latest
or in the case of testing older models
python commander.py --model-name cnn1d_3 --tensorboard --test --resume best --version 2
For each experiment a folder with the same name is created in the folder root-dir/ecg/runs
. This folder contains the following items:
.
├── checkpoints (\*.pth.tar) # models and logs are saved every epoch in .tar files. Non-modulo 5 epochs are then deleted.
├── best model (\*.pth.tar) # the currently best model for the experiment is saved separately
├── config.json # experiment hyperparameters
├── logger.json # scores and metrics from all training epochs (loss, learning rate, accuracy,etc.)
├── res # predictions for each sample from the validation set for every epoch
├── pred # predictions for each sample from the test set and other outputs e.g. confusion matrices
├── tensorboard # experiment values saved in tensorboard format
In order the visualize metrics and results in tensorboard you need to launch it separately (from the project directory): tensorboard --logdir ecg_data/ecg/runs
. You can then access tensorboard in our browser at localhost:6006. If you have performed multiple experiments, tensorboard will aggregate them in the same dashboard. To check if any tensorboard runs exist run tensorboard --inspect --logdir ecg_data/ecg/runs