More Than Irrational: Modeling Belief-Biased Agents
Yifan Zhu, Sammie Katt, Samuel Kaski
TL;DR
This work introduces a computational-rationality framework for modeling belief-biased agents whose sub-optimal actions arise from memory-bound cognitive biases. It defines an explicit memory process $f_{\theta}$ that corrupts histories and yields biased beliefs $\tilde{b}_t$, enabling the CR agent to act optimally under $\tilde{b}_t$ with policy $\pi_*(\cdot|\tilde{b};\theta)$. To infer latent memory bounds online, the authors develop Nested Particle Filtering (NPF) to jointly track the internal memory state $\tilde{h}_{t-1}$ and the bound $\theta$ from passively observed actions, demonstrating accurate recovery with limited data. They validate the model on a T-maze task and show how the inferred bounds can power adaptive assistive-POMDP behavior, enabling AI assistants to tailor support to memory limitations. This provides a principled path toward modeling and assisting human collaborators whose decisions reflect bounded memory and biased beliefs.
Abstract
Despite the explosive growth of AI and the technologies built upon it, predicting and inferring the sub-optimal behavior of users or human collaborators remains a critical challenge. In many cases, such behaviors are not a result of irrationality, but rather a rational decision made given inherent cognitive bounds and biased beliefs about the world. In this paper, we formally introduce a class of computational-rational (CR) user models for cognitively-bounded agents acting optimally under biased beliefs. The key novelty lies in explicitly modeling how a bounded memory process leads to a dynamically inconsistent and biased belief state and, consequently, sub-optimal sequential decision-making. We address the challenge of identifying the latent user-specific bound and inferring biased belief states from passive observations on the fly. We argue that for our formalized CR model family with an explicit and parameterized cognitive process, this challenge is tractable. To support our claim, we propose an efficient online inference method based on nested particle filtering that simultaneously tracks the user's latent belief state and estimates the unknown cognitive bound from a stream of observed actions. We validate our approach in a representative navigation task using memory decay as an example of a cognitive bound. With simulations, we show that (1) our CR model generates intuitively plausible behaviors corresponding to different levels of memory capacity, and (2) our inference method accurately and efficiently recovers the ground-truth cognitive bounds from limited observations ($\le 100$ steps). We further demonstrate how this approach provides a principled foundation for developing adaptive AI assistants, enabling adaptive assistance that accounts for the user's memory limitations.
