Bayesian Teaching Enables Probabilistic Reasoning in Large Language Models
Linlu Qiu, Fei Sha, Kelsey Allen, Yoon Kim, Tal Linzen, Sjoerd van Steenkiste
TL;DR
<3-5 sentence high-level summary> The paper investigates whether large language models (LLMs) form and update probabilistic beliefs in line with Bayesian inference. It introduces a flight-recommendation task and a normative Bayesian Assistant as a gold-standard updater, revealing that off-the-shelf LLMs struggle to update beliefs over rounds. It then shows that supervised fine-tuning via Bayesian teaching—training LLMs to mimic the Bayesian Assistant—substantially improves belief updating and allows generalization to new tasks and domains (hotel recommendations and web shopping), including interactions with real humans. The findings demonstrate that LLMs can acquire transferable probabilistic reasoning skills from demonstrations, enabling robust decision-making under uncertainty across diverse applications and settings.
Abstract
Artificial intelligence systems based on large language models (LLMs) are increasingly used as agents that interact with users and with the world. To do so successfully, LLMs need to construct internal representations of the world and form probabilistic beliefs about those representations. To provide a user with personalized recommendations, for example, the LLM needs to gradually infer the user's preferences, over the course of multiple interactions. To evaluate whether contemporary LLMs are able to do so, we use the Bayesian inference framework from probability theory, which lays out the optimal way to update an agent's beliefs as it receives new information. We first show that LLMs do not update their beliefs as expected from the Bayesian framework, and that consequently their predictions do not improve as expected as more information becomes available. To address this issue, we teach the LLMs to reason in a Bayesian manner by training them to mimic the predictions of the normative Bayesian model. We find that this approach not only significantly improves the LLM's performance on the particular recommendation task it is trained on, but also enables generalization to other tasks. This suggests that this method teaches the LLM to better approximate Bayesian reasoning. More generally, our results indicate that LLMs can effectively learn reasoning skills from examples and generalize those skills to new domains.
