Estimating cognitive biases with attention-aware inverse planning
Sounak Banerjee, Daphne Cornelisse, Deepak Gopinath, Emily Sumner, Jonathan DeCastro, Guy Rosman, Eugene Vinitsky, Mark K. Ho
TL;DR
This work addresses how attentional biases shape goal-directed behavior and proposes attention-aware inverse planning to infer these biases from observed actions. Building on value-guided construal, it defines an attention-limited decision process where construals of the environment are chosen via a bias-enhanced softmax over representations, with $VOR(s,C)=V(s,\pi_C)+\text{Cost}(C)$ and $R_C$, $T_C$ derived from the construed state. It introduces a bias function $H_{\lambda}$ to capture heuristics, and formalizes maximum-likelihood inference of $\lambda$ from trajectories, using exact dynamic programming in simple domains and a pre-trained policy to scale to driving tasks. The approach is validated in a tabular DrivingWorld and implemented in GPUDrive with Waymo Open Motion data, showing that certain biases are recoverable and that this method can outperform standard IRL in capturing attention-limited behavior. Overall, the paper demonstrates a scalable, interpretable framework that integrates cognitive modeling with deep RL to model and infer human-like attentional biases in complex, real-world scenarios.
Abstract
People's goal-directed behaviors are influenced by their cognitive biases, and autonomous systems that interact with people should be aware of this. For example, people's attention to objects in their environment will be biased in a way that systematically affects how they perform everyday tasks such as driving to work. Here, building on recent work in computational cognitive science, we formally articulate the attention-aware inverse planning problem, in which the goal is to estimate a person's attentional biases from their actions. We demonstrate how attention-aware inverse planning systematically differs from standard inverse reinforcement learning and how cognitive biases can be inferred from behavior. Finally, we present an approach to attention-aware inverse planning that combines deep reinforcement learning with computational cognitive modeling. We use this approach to infer the attentional strategies of RL agents in real-life driving scenarios selected from the Waymo Open Dataset, demonstrating the scalability of estimating cognitive biases with attention-aware inverse planning.
