Large Language Bayes

Justin Domke

Large Language Bayes

Justin Domke

TL;DR

This work introduces a novel framework called LLB that fuses large-language-model generated probabilistic models with probabilistic programming to form a joint distribution over models, data, and latent variables. By marginalizing over the space of generated models and combining per-model posteriors via data-driven weights derived from approximate marginal likelihoods, it achieves an interpretable Bayesian model averaging mechanism driven by user text and data. The authors provide a practical inference recipe leveraging self-normalized importance sampling and variationalBounds, and validate it across Rain, Coin, Polling, City Temperature, and Gold problems, showing that the approach often improves over naive flat ensembles and captures user intent. Theoretical analysis connects SNIS weights, ELBO bounds, and joint divergences, and the paper discusses limitations and future directions such as better model priors and scalable inference. Overall, the work demonstrates a principled path to turning informal problem descriptions into calibrated probabilistic predictions without committing to a single formal model.

Abstract

Many domain experts do not have the time or expertise to write formal Bayesian models. This paper takes an informal problem description as input, and combines a large language model and a probabilistic programming language to define a joint distribution over formal models, latent variables, and data. A posterior over latent variables follows by conditioning on observed data and integrating over formal models. This presents a challenging inference problem. We suggest an inference recipe that amounts to generating many formal models from the large language model, performing approximate inference on each, and then doing a weighted average. This is justified and analyzed as a combination of self-normalized importance sampling, MCMC, and importance-weighted variational inference. Experimentally, this produces sensible predictions from only data and an informal problem description, without the need to specify a formal model.

Large Language Bayes

TL;DR

Abstract

Large Language Bayes

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (47)

Theorems & Definitions (10)