TabMGP: Martingale posterior with TabPFN
Kenyon Ng, Edwin Fong, David T. Frazier, Jeremias Knoblauch, Susan Wei
TL;DR
TabMGP introduces a novel martingale-posterior framework powered by TabPFN to obtain uncertainty about functionals $ heta(F_ inf)$ for tabular data without specifying a prior or likelihood. By forward sampling with TabPFN (and Bayesian bootstrap for covariates), it generates empirical samples $ heta(F_N)$ that approximate the posterior of $ heta(F_ inf)$ given data, yielding near-nominal coverage and expressive posteriors. Despite lacking strict martingale or a.c.i.d. guarantees for the predictive rule, TabMGP demonstrates robust uncertainty quantification across synthetic and real datasets, often outperforming handcrafted MGPs and standard Bayesian baselines. The work also highlights gaps in current theory, advocating empirical diagnostics and weaker conditions to justify validity in the era of foundation-model predictors.
Abstract
Bayesian inference provides principled uncertainty quantification but is often limited by challenges of prior and likelihood elicitation. The martingale posterior (MGP) (Fong et al., 2023) offers an alternative by replacing these requirements with a predictive rule. Additionally MGP focuses inference on parameters defined through a loss function. This framework is especially resonant in the era of foundation transformers; practitioners increasingly leverage models like TabPFN for their state-of-the-art capabilities, yet often require epistemic uncertainty for a scientific estimand $θ$ that need not parameterise the model's implicit latent model. The MGP provides the mechanism to recover these posterior distributions. We introduce TabMGP, an MGP built on TabPFN for tabular data. TabMGP produces credible sets with near-nominal coverage and often outperforms both handcrafted MGP constructions and standard Bayesian baselines.
