Learning Active Subspaces and Discovering Important Features with Gaussian Radial Basis Functions Neural Networks
Danny D'Agostino, Ilija Ilievski, Christine Annette Shoemaker
TL;DR
The paper addresses the interpretability of nonlinear predictive models on tabular data by proposing a Gaussian Radial Basis Function Neural Network (GRBFNN) with a learnable precision matrix $\mathbf{P}$ in the Gaussian kernel. By jointly learning $\mathbf{w}$ and the kernel parameters, and regularizing $\mathbf{P}$ via $\lambda_{\mathbf{u}}$ and $\lambda_{\mathbf{w}}$, the model enables extraction of an active subspace and a feature-importance ranking from the spectrum of $\mathbf{P}$ after training. The authors provide theory and practical methods for obtaining the active subspace directions (eigenvectors of $\mathbf{P}$) and a Jacobian-based feature-importance measure, demonstrate visualization in the active subspace, and show through extensive experiments that GRBFNN achieves competitive predictive performance while offering meaningful interpretability. The work includes two center-selection schemes (unsupervised and supervised), a comprehensive benchmark against standard ML models and deep embedding methods, and an analysis of regularization and dimensionality-reduction behavior, with public PyTorch code available for replication. Overall, the GRBFNN framework advances interpretable nonlinear modeling for tabular data and offers tools for supervised dimensionality reduction and feature selection in real-world applications such as healthcare and engineering optimization.
Abstract
Providing a model that achieves a strong predictive performance and is simultaneously interpretable by humans is one of the most difficult challenges in machine learning research due to the conflicting nature of these two objectives. To address this challenge, we propose a modification of the radial basis function neural network model by equipping its Gaussian kernel with a learnable precision matrix. We show that precious information is contained in the spectrum of the precision matrix that can be extracted once the training of the model is completed. In particular, the eigenvectors explain the directions of maximum sensitivity of the model revealing the active subspace and suggesting potential applications for supervised dimensionality reduction. At the same time, the eigenvectors highlight the relationship in terms of absolute variation between the input and the latent variables, thereby allowing us to extract a ranking of the input variables based on their importance to the prediction task enhancing the model interpretability. We conducted numerical experiments for regression, classification, and feature selection tasks, comparing our model against popular machine learning models, the state-of-the-art deep learning-based embedding feature selection techniques, and a transformer model for tabular data. Our results demonstrate that the proposed model does not only yield an attractive prediction performance compared to the competitors but also provides meaningful and interpretable results that potentially could assist the decision-making process in real-world applications. A PyTorch implementation of the model is available on GitHub at the following link. https://github.com/dannyzx/Gaussian-RBFNN
