AdapTable: Test-Time Adaptation for Tabular Data via Shift-Aware Uncertainty Calibrator and Label Distribution Handler
Changhun Kim, Taewon Kim, Seungyeon Woo, June Yong Yang, Eunho Yang
TL;DR
The paper tackles distribution shifts in tabular data, which undermine reliability in real-world deployments. It proposes AdapTable, a two-stage test-time adaptation framework leveraging a shift-aware uncertainty calibrator (via a graph neural network over table columns) and a label distribution handler to align predictions with the target label distribution without updating model parameters. Theoretical analysis bounds the adaptation error and highlights the importance of accurately estimating the target distribution and controlling representation shift. Empirically, AdapTable achieves state-of-the-art performance across six datasets and six corruptions, with substantial gains on HELOC and strong cross-architecture robustness, making it a practical, privacy-friendly TTA solution for tabular domains.
Abstract
In real-world scenarios, tabular data often suffer from distribution shifts that threaten the performance of machine learning models. Despite its prevalence and importance, handling distribution shifts in the tabular domain remains underexplored due to the inherent challenges within the tabular data itself. In this sense, test-time adaptation (TTA) offers a promising solution by adapting models to target data without accessing source data, crucial for privacy-sensitive tabular domains. However, existing TTA methods either 1) overlook the nature of tabular distribution shifts, often involving label distribution shifts, or 2) impose architectural constraints on the model, leading to a lack of applicability. To this end, we propose AdapTable, a novel TTA framework for tabular data. AdapTable operates in two stages: 1) calibrating model predictions using a shift-aware uncertainty calibrator, and 2) adjusting these predictions to match the target label distribution with a label distribution handler. We validate the effectiveness of AdapTable through theoretical analysis and extensive experiments on various distribution shift scenarios. Our results demonstrate AdapTable's ability to handle various real-world distribution shifts, achieving up to a 16% improvement on the HELOC dataset.
