Minimal Algorithmic Information Loss Methods for Dimension Reduction, Feature Selection and Network Sparsification
Hector Zenil, Narsis A. Kiani, Alyssa Adams, Felipe S. Abrahão, Antonio Rueda-Toicen, Allan A. Zea, Luan Ozelim, Jesper Tegnér
TL;DR
The paper addresses the problem of reducing high-dimensional data and complex networks without discarding essential structural information. It introduces Minimal Information Loss Selection (MILS), an unsupervised method grounded in algorithmic information theory, utilizing the information-difference $I(G,F)=C(G)-C(G\setminus F)$ and block-based complexity via $\text{BDM}(x)=\sum_i \text{CTM}(x_i)+\log n$ together with $K(s) \approx -\log P(s)$ to guide edge deletions. Theoretical results establish bounds on information loss that scale with the number of perturbations and network properties, and MILS achieves polynomial-time performance while approaching exponential-optimal reductions. Empirically, MILS often outperforms traditional statistical dimensionality-reduction methods and state-of-the-art graph sparsification across networks and image data, while enabling lossy compression that preserves algorithmically meaningful structure. These findings suggest MILS as a versatile, principled tool for scalable data reduction with broad applicability in ML pipelines, network analysis, and data compression, with potential for hardware acceleration.
Abstract
We present a novel, domain-agnostic, model-independent, unsupervised, and universally applicable Machine Learning approach for dimensionality reduction based on the principles of algorithmic complexity. Specifically, but without loss of generality, we focus on addressing the challenge of reducing certain dimensionality aspects, such as the number of edges in a network, while retaining essential features of interest. These features include preserving crucial network properties like degree distribution, clustering coefficient, edge betweenness, and degree and eigenvector centralities but can also go beyond edges to nodes and weights for network pruning and trimming. Our approach outperforms classical statistical Machine Learning techniques and state-of-the-art dimensionality reduction algorithms by preserving a greater number of data features that statistical algorithms would miss, particularly nonlinear patterns stemming from deterministic recursive processes that may look statistically random but are not. Moreover, previous approaches heavily rely on a priori feature selection, which requires constant supervision. Our findings demonstrate the effectiveness of the algorithms in overcoming some of these limitations while maintaining a time-efficient computational profile. Our approach not only matches, but also exceeds, the performance of established and state-of-the-art dimensionality reduction algorithms. We extend the applicability of our method to lossy compression tasks involving images and any multi-dimensional data. This highlights the versatility and broad utility of the approach in multiple domains.
