ProtoGate: Prototype-based Neural Networks with Global-to-local Feature Selection for Tabular Biomedical Data
Xiangjian Jiang, Andrei Margeloiu, Nikola Simidjievski, Mateja Jamnik
TL;DR
HDLSS tabular biomedical data pose challenges for feature selection and prediction due to high dimensionality and limited samples. ProtoGate tackles this by integrating a global-to-local feature selector with a non-parametric prototype-based predictor, guarded by a disjoint training loss to avoid co-adaptation. The model uses soft global sparsity via $\ ext{\|W^{[1]}\|_1}$ and instance-specific local masks derived from Gaussian-perturbed activations, together with a differentiable $K$-NN over a prototype base and a hybrid NeuralSort-QuickSort scheme for efficient, explainable predictions. Empirical results on seven HDLSS real-world datasets and four non-HDLSS datasets show ProtoGate achieves higher accuracy with fewer selected features, maintains computational efficiency, and provides robust interpretability through prototypical explanations and feature fidelity, with an open-source implementation available.
Abstract
Tabular biomedical data poses challenges in machine learning because it is often high-dimensional and typically low-sample-size (HDLSS). Previous research has attempted to address these challenges via local feature selection, but existing approaches often fail to achieve optimal performance due to their limitation in identifying globally important features and their susceptibility to the co-adaptation problem. In this paper, we propose ProtoGate, a prototype-based neural model for feature selection on HDLSS data. ProtoGate first selects instance-wise features via adaptively balancing global and local feature selection. Furthermore, ProtoGate employs a non-parametric prototype-based prediction mechanism to tackle the co-adaptation problem, ensuring the feature selection results and predictions are consistent with underlying data clusters. We conduct comprehensive experiments to evaluate the performance and interpretability of ProtoGate on synthetic and real-world datasets. The results show that ProtoGate generally outperforms state-of-the-art methods in prediction accuracy by a clear margin while providing high-fidelity feature selection and explainable predictions. Code is available at https://github.com/SilenceX12138/ProtoGate.
