Simple Self-Organizing Maps (SimpSOM) is a lightweight Python 3 library to train SOM. It offers an efficient way of training SOM in Python while keeping its implementation simple and easy to read.
Version 3 is a rewriting focusing on performance.
simpsom
can be downloaded from PyPI with
pip install simpsom
To install the latest (unreleased) version you can download it from this repository by running
git clone https://github.com/fcomitani/simpsom
cd simpsom
python setup.py install
Core dependencies:
- numpy
- scikit-learn
- matplotlib
If available, CuPy
can be used to run simpsom
on the GPU.
CuML
is also optional, but will allow you
to run clustering on the GPU as well.
For a full list see requirements.txt
Running simpsom
is easy. After setting up a network by providing size and tiling style,
train it with the train
method.
import simpsom as sps
net = sps.SOMNet(20, 20, data, topology='hexagonal',
init='PCA', metric='cosine',
neighborhood_fun='gaussian', PBC=True,
random_seed=32, GPU=False, CUML=False,
output_path="./")
net.train(train_algo='batch', start_learning_rate=0.01, epochs=-1,
batch_size=-1)
The trained map can be saved to disk.
net.save_map("./trained_som.npy")
The results can be inspected with a variety of plotting functions.
net.plot_map_by_difference(show=True, print_out=True)
net.plot_projected_points(projected_data, color_val=[n.difference for n in net.nodes_list],
project=False, jitter=False,
show=True, print_out=False)
Detailed documentation, API references and tutorials can be found here.
Here are some of the research works that use SimpSOM:
Postema, J. T. (2019). Explaining system behaviour in radar systems (Master's thesis, University of Twente).
Lorenzi, C., Barriere, S., Villemin, J. P., Dejardin Bretones, L., Mancheron, A., & Ritchie, W. (2020). iMOKA: k-mer based software to analyze large collections of sequencing data. Genome biology, 21(1), 1-19.
Saunders, J. K., McIlvin, M. R., Dupont, C. L., Kaul, D., Moran, D. M., Horner, T., ... & Saito, M. A. (2022). Microbial functional diversity across biogeochemical provinces in the central Pacific Ocean. Proceedings of the National Academy of Sciences, 119(37), e2200014119.
Contributions are always welcome. If you would like to help us improve this library please fork the main branch and make sure pytest pass after your changes.
When using this library for your work, please cite the appropriate version from Zenodo
Federico Comitani. (2022). SimpSOM (v2.0.2). Zenodo. https://zenodo.org/record/7187332