A Neural Network Alternative to Tree-based Models
Salvatore Raieli, Nathalie Jeanray, Stéphane Gerart, Sebastien Vachenc, Abdulrahman Altahhan
TL;DR
This work tackles the challenge of applying neural networks to tabular biological data, where tree-based methods excel in interpretability but neural nets struggle with performance. It introduces sTabNet, a sparse, attention-guided neural network that enforces a priori sparsity via a feature-wise adjacency mask and yields intrinsic feature importance, enabling end-to-end interpretability without post-hoc explanations. Across METABRIC, TCGA-BRCA/LUAD, single-cell, and survival analyses, sTabNet achieves competitive or superior performance to XGBoost while offering transfer-learning capabilities and meaningful latent representations. The approach demonstrates robust in-domain and out-of-domain adaptation and suggests that sparse tabular foundations with attention can be scalable and explainable for biomedical AI applications.
Abstract
Tabular datasets are widely used in scientific disciplines such as biology. While these disciplines have already adopted AI methods to enhance their findings and analysis, they mainly use tree-based methods due to their interpretability. At the same time, artificial neural networks have been shown to offer superior flexibility and depth for rich and complex non-tabular problems, but they are falling behind tree-based models for tabular data in terms of performance and interpretability. Although sparsity has been shown to improve the interpretability and performance of ANN models for complex non-tabular datasets, enforcing sparsity structurally and formatively for tabular data before training the model, remains an open question. To address this question, we establish a method that infuses sparsity in neural networks by utilising attention mechanisms to capture the features' importance in tabular datasets. We show that our models, Sparse TABular NET or sTAB-Net with attention mechanisms, are more effective than tree-based models, reaching the state-of-the-art on biological datasets. They further permit the extraction of insights from these datasets and achieve better performance than post-hoc methods like SHAP.
