Information Seeking for Robust Decision Making under Partial Observability

Djengo Cyun-Jyun Fang; Tsung-Wei Ke

Information Seeking for Robust Decision Making under Partial Observability

Djengo Cyun-Jyun Fang, Tsung-Wei Ke

TL;DR

The paper tackles robust decision making under partial observability and noisy dynamics by linking internal model alignment to explicit information seeking. It introduces InfoSeeker, an LLM-based agent that interleaves targeted information-seeking actions with task planning in a closed loop, aiming to realign its internal dynamics with the environment. A novel text-based benchmark evaluates planning under both observation and dynamics uncertainty, and results show a substantial 74% absolute improvement over prior methods with strong generalization across LLMs and tasks, while preserving sample efficiency. The work also formalizes connections between LLM planning and POMDPs and highlights the practical impact of explicit information seeking for real-world robustness in uncertain environments.

Abstract

Explicit information seeking is essential to human problem-solving in practical environments characterized by incomplete information and noisy dynamics. When the true environmental state is not directly observable, humans seek information to update their internal dynamics and inform future decision-making. Although existing Large Language Model (LLM) planning agents have addressed observational uncertainty, they often overlook discrepancies between their internal dynamics and the actual environment. We introduce Information Seeking Decision Planner (InfoSeeker), an LLM decision-making framework that integrates task-oriented planning with information seeking to align internal dynamics and make optimal decisions under uncertainty in both agent observations and environmental dynamics. InfoSeeker prompts an LLM to actively gather information by planning actions to validate its understanding, detect environmental changes, or test hypotheses before generating or revising task-oriented plans. To evaluate InfoSeeker, we introduce a novel benchmark suite featuring partially observable environments with incomplete observations and uncertain dynamics. Experiments demonstrate that InfoSeeker achieves a 74% absolute performance gain over prior methods without sacrificing sample efficiency. Moreover, InfoSeeker generalizes across LLMs and outperforms baselines on established benchmarks such as robotic manipulation and web navigation. These findings underscore the importance of tightly integrating planning and information seeking for robust behavior in partially observable environments. The project page is available at https://infoseekerllm.github.io

Information Seeking for Robust Decision Making under Partial Observability

TL;DR

Abstract

Information Seeking for Robust Decision Making under Partial Observability

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)