Minimal Learning Machine for Multi-Label Learning
Joonas Hämäläinen, Antoine Hubermont, Amauri Souza, César L. C. Mattos, João P. P. Gomes, Tommi Kärkkäinen
TL;DR
The paper tackles multi-label classification by reframing learning as a distance-regression problem that maps input-space distances to output-space distances. It introduces ML-MLM, which combines the MLM distance regression with inverse distance weighting to produce deterministic, ranking-friendly predictions, with hyper-parameters selected via a closed-form LOOCV-based ranking loss. Empirical results on ten datasets show ML-MLM achieves competitive ranking performance relative to state-of-the-art methods and favorable bipartition metrics, while offering interpretable uncertainty estimates through predicted distances. The approach provides a lightweight, parameter-efficient alternative for small-to-moderate MLC tasks and outlines practical considerations for uncertainty, thresholding, and complexity, along with avenues for extensions such as ensembles and feature selection.
Abstract
Distance-based supervised method, the minimal learning machine, constructs a predictive model from data by learning a mapping between input and output distance matrices. In this paper, we propose new methods and evaluate how their core component, the distance mapping, can be adapted to multi-label learning. The proposed approach is based on combining the distance mapping with an inverse distance weighting. Although the proposal is one of the simplest methods in the multi-label learning literature, it achieves state-of-the-art performance for small to moderate-sized multi-label learning problems. In addition to its simplicity, the proposed method is fully deterministic: Its hyper-parameter can be selected via ranking loss-based statistic which has a closed form, thus avoiding conventional cross-validation-based hyper-parameter tuning. In addition, due to its simple linear distance mapping-based construction, we demonstrate that the proposed method can assess the uncertainty of the predictions for multi-label classification, which is a valuable capability for data-centric machine learning pipelines.
