BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design

Deepro Choudhury; Sinead Williamson; Adam Goliński; Ning Miao; Freddie Bickford Smith; Michael Kirchhof; Yizhe Zhang; Tom Rainforth

BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design

Deepro Choudhury, Sinead Williamson, Adam Goliński, Ning Miao, Freddie Bickford Smith, Michael Kirchhof, Yizhe Zhang, Tom Rainforth

TL;DR

BED-LLM reframes interactive information gathering with LLMs as a sequential Bayesian experimental design problem, deriving a joint model from the LLM and using information gain to select queries. It advocates a prior–likelihood pairing with belief filtering ($p_f(\theta; h_{t-1})$) over a purely in-context update, and employs a Rao–Blackwellized estimator to compute EIG for candidate questions. Across 20 Questions and preference elicitation tasks, BED-LLM demonstrates substantial gains over naive prompting and simpler baselines, and shows robustness to questioner–answerer model mismatch. The work provides a principled, scalable blueprint for turning LLMs into adaptive information-gathering agents with practical impact in tasks like preference elicitation and interactive surveys.

Abstract

We propose a general-purpose approach for improving the ability of Large Language Models (LLMs) to intelligently and adaptively gather information from a user or other external source using the framework of sequential Bayesian experimental design (BED). This enables LLMs to act as effective multi-turn conversational agents and interactively interface with external environments. Our approach, which we call BED-LLM (Bayesian Experimental Design with Large Language Models), is based on iteratively choosing questions or queries that maximize the expected information gain (EIG) about the task of interest given the responses gathered previously. We show how this EIG can be formulated (and then estimated) in a principled way using a probabilistic model derived from the LLM's predictive distributions and provide detailed insights into key decisions in its construction and updating procedure. We find that BED-LLM achieves substantial gains in performance across a wide range of tests based on the 20 questions game and using the LLM to actively infer user preferences, compared to direct prompting of the LLM and other adaptive design strategies.

BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design

TL;DR

Abstract

BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)