A Survey on Recent Advances in Self-Organizing Maps
Axel Guérin, Pierre Chauvet, Frédéric Saubion
TL;DR
Self-Organising Maps project a $p$-dimensional input space onto a two-dimensional map and organize similar observations into topologically ordered regions. The survey catalogs advances in data management, topology/metrics, learning strategies, visualization, performance, and hyperparameterisation from 2014 onward. Notable contributions include SS-SOM for semi-supervised clustering, FWSOM for feature weighting, DBSOM with Wasserstein distances, non-Euclidean and multilinear distance frameworks, and PSOM/DPSOM for probabilistic clustering and time-series. The review highlights commercial relevance, especially for customer analytics, while noting ongoing challenges in handling diverse data types, missing values, scalability, and automated hyperparameter tuning.
Abstract
Self-organising maps are a powerful tool for cluster analysis in a wide range of data contexts. From the pioneer work of Kohonen, many variants and improvements have been proposed. This review focuses on the last decade, in order to provide an overview of the main evolution of the seminal SOM algorithm as well as of the methodological developments that have been achieved in order to better fit to various application contexts and users' requirements. We also highlight a specific and important application field that is related to commercial use of SOM, which involves specific data management.
