AI-Powered Bayesian Inference
Sean O'Hagan, Veronika Ročková
TL;DR
This paper proposes AI-powered Bayesian inference by treating generative AI outputs as priors on the data-generating distribution via a Dirichlet process, enabling coherent uncertainty quantification that leverages AI predictions without fully trusting them for decision making. It develops likelihood-driven AI priors (Power, Expected-Posterior, Catalytic) and a loss-based non-parametric framework (Bayes without likelihood) that places priors on the data-generating distribution $F_0$ and uses posterior bootstrap for computation. Theoretical results center on the concentration parameter $\alpha$, including coverage-based and asymptotic calibration to align AI-prior influence with frequentist properties. Empirical illustrations in dermatology and astronomy demonstrate improved predictive calibration and narrower uncertainty regions when AI priors are appropriately tuned, while highlighting the danger of over-reliance on AI priors as $\alpha$ grows large. Overall, the work offers a flexible, scalable, and principled route to integrate AI-driven predictions into Bayesian inference with quantified uncertainty across diverse domains.
Abstract
The advent of Generative Artificial Intelligence (GAI) has heralded an inflection point that changed how society thinks about knowledge acquisition. While GAI cannot be fully trusted for decision-making, it may still provide valuable information that can be integrated into a decision pipeline. Rather than seeing the lack of certitude and inherent randomness of GAI as a problem, we view it as an opportunity. Indeed, variable answers to given prompts can be leveraged to construct a prior distribution which reflects assuredness of AI predictions. This prior distribution may be combined with tailored datasets for a fully Bayesian analysis with an AI-driven prior. In this paper, we explore such a possibility within a non-parametric Bayesian framework. The basic idea consists of assigning a Dirichlet process prior distribution on the data-generating distribution with AI generative model as its baseline. Hyper-parameters of the prior can be tuned out-of-sample to assess the informativeness of the AI prior. Posterior simulation is achieved by computing a suitably randomized functional on an augmented data that consists of observed (labeled) data as well as fake data whose labels have been imputed using AI. This strategy can be parallelized and rapidly produces iid samples from the posterior by optimization as opposed to sampling from conditionals. Our method enables (predictive) inference and uncertainty quantification leveraging AI predictions in a coherent probabilistic manner.
