Estimating Global Input Relevance and Enforcing Sparse Representations with a Scalable Spectral Neural Network Approach
Lorenzo Chicchi, Lorenzo Buffoni, Diego Febbe, Lorenzo Giambagli, Raffaele Marino, Duccio Fanelli
TL;DR
The paper addresses the challenge of identifying globally relevant input features for deep neural networks and enabling sparse representations for explainability. It introduces a spectral parametrization in which a transfer operator $A=\Phi\Lambda\Phi^{-1}$ is optimized, and input relevance is encoded in the eigenvalues $\lambda_i$, allowing a byproduct feature ranking during training and regularization-based sparsification. The approach is validated across independent Gaussian, correlated Gaussian, MNIST, and stellar spectra datasets, showing that nonzero eigenvalues correctly flag relevant inputs, align with domain-discriminative metrics, and enable accurate performance with only a small subset of inputs. Compared to post-hoc methods like SHAP, LRP, and gradient-based explanations, the spectral method yields a fast, globally applicable relevance score and directly enforces sparsity during training, signaling a practical path toward explainable and compressible neural models for scientific applications.
Abstract
In machine learning practice it is often useful to identify relevant input features. Isolating key input elements, ranked according their respective degree of relevance, can help to elaborate on the process of decision making. Here, we propose a novel method to estimate the relative importance of the input components for a Deep Neural Network. This is achieved by leveraging on a spectral re-parametrization of the optimization process. Eigenvalues associated to input nodes provide in fact a robust proxy to gauge the relevance of the supplied entry features. Notably, the spectral features ranking is performed automatically, as a byproduct of the network training, with no additional processing to be carried out. Moreover, by leveraging on the regularization of the eigenvalues, it is possible to enforce solutions making use of a minimum subset of the input components, increasing the explainability of the model and providing sparse input representations. The technique is compared to the most common methods in the literature and is successfully challenged against both synthetic and real data.
