XTab: Cross-table Pretraining for Tabular Transformers
Bingzhao Zhu, Xingjian Shi, Nick Erickson, Mu Li, George Karypis, Mahsa Shoaran
TL;DR
XTab enables cross-table pretraining for tabular transformers by separating data-specific featurizers and projection heads from a shared backbone and training across many tables with federated learning. It validates across 84 AMLB tasks, showing that pretrained backbones consistently outperform randomly initialized baselines and that reconstruction objectives often yield the best downstream results. The framework supports multiple backbones and objectives, significantly boosting generalization and learning efficiency for downstream tabular tasks. This approach offers a scalable path toward cross-domain tabular pretraining and provides a foundation for further integration with tree-based methods and multimodal learning.
Abstract
The success of self-supervised learning in computer vision and natural language processing has motivated pretraining methods on tabular data. However, most existing tabular self-supervised learning models fail to leverage information across multiple data tables and cannot generalize to new tables. In this work, we introduce XTab, a framework for cross-table pretraining of tabular transformers on datasets from various domains. We address the challenge of inconsistent column types and quantities among tables by utilizing independent featurizers and using federated learning to pretrain the shared component. Tested on 84 tabular prediction tasks from the OpenML-AutoML Benchmark (AMLB), we show that (1) XTab consistently boosts the generalizability, learning speed, and performance of multiple tabular transformers, (2) by pretraining FT-Transformer via XTab, we achieve superior performance than other state-of-the-art tabular deep learning models on various tasks such as regression, binary, and multiclass classification.
