TabNet: Attentive Interpretable Tabular Learning
Sercan O. Arik, Tomas Pfister
TL;DR
The paper tackles the challenge of high-performance, interpretable tabular learning by introducing TabNet, a canonical deep architecture that uses sequential attentive feature masks to select salient features at each decision step. It combines sparse, instance-wise feature selection with multi-step reasoning, achieving strong empirical results and enabling both local and global interpretability. A key contribution is the demonstration of self-supervised pre-training for tabular data, which yields substantial gains in downstream supervised tasks and faster convergence. Overall, TabNet provides a competitive, interpretable alternative to tree ensembles and introduces a new direction for unsupervised representation learning in tabular domains.
Abstract
We propose a novel high-performance and interpretable canonical deep tabular data learning architecture, TabNet. TabNet uses sequential attention to choose which features to reason from at each decision step, enabling interpretability and more efficient learning as the learning capacity is used for the most salient features. We demonstrate that TabNet outperforms other neural network and decision tree variants on a wide range of non-performance-saturated tabular datasets and yields interpretable feature attributions plus insights into the global model behavior. Finally, for the first time to our knowledge, we demonstrate self-supervised learning for tabular data, significantly improving performance with unsupervised representation learning when unlabeled data is abundant.
