A principled framework for uncertainty decomposition in TabPFN
Sandra Fortini, Kenyon Ng, Sonia Petrone, Judith Rousseau, Susan Wei
TL;DR
This work develops a principled uncertainty decomposition framework for TabPFN by casting uncertainty quantification as Bayesian Predictive Inference (BPI) in a supervised, in-context setting. It derives a predictive CLT under quasi-martingale conditions, enabling fast, black-box estimates of epistemic and aleatoric uncertainty from the history of TabPFN's predictive updates and enabling asymptotically valid credible bands. The framework extends to entropy-based decomposition for classification via Beta/Dirichlet approximations and demonstrates near-nominal frequentist coverage on synthetic benchmarks and real data such as PSID labor-force participation. Taken together, the approach provides scalable, principled uncertainty quantification for foundation transformer models operating in tabular domains. The results highlight practical uncertainty diagnostics and open avenues for broader supervised-BPI extensions beyond TabPFN.
Abstract
TabPFN is a transformer that achieves state-of-the-art performance on supervised tabular tasks by amortizing Bayesian prediction into a single forward pass. However, there is currently no method for uncertainty decomposition in TabPFN. Because it behaves, in an idealised limit, as a Bayesian in-context learner, we cast the decomposition challenge as a Bayesian predictive inference (BPI) problem. The main computational tool in BPI, predictive Monte Carlo, is challenging to apply here as it requires simulating unmodeled covariates. We therefore pursue the asymptotic alternative, filling a gap in the theory for supervised settings by proving a predictive CLT under quasi-martingale conditions. We derive variance estimators determined by the volatility of predictive updates along the context. The resulting credible bands are fast to compute, target epistemic uncertainty, and achieve near-nominal frequentist coverage. For classification, we further obtain an entropy-based uncertainty decomposition.
