Table of Contents
Fetching ...

Active Epistemic Control for Query-Efficient Verified Planning

Shuhui Qu

TL;DR

Active Epistemic Control (AEC) introduces a grounded-verification layer for planning under partial observability by maintaining a clear separation between externally grounded facts $w$ and model-based beliefs $\hat{w}$. The framework uses a verifier dependent only on grounded evidence to certify feasibility, while epistemic actions (querying vs. simulating) prune candidate plans without letting predictions become truth. By discretizing uncertain predictions with an ambiguity margin and leveraging a pullback-based categorical consistency check (SQ-BCP), AEC achieves efficient planning with fewer replanning rounds on ALFWorld and robustness to distribution shifts on ScienceWorld. The approach provides a principled, model-agnostic safety gate atop LLM planners and world models, reducing the risk of silent infeasibility while preserving efficiency gains from prediction-based pruning. Overall, AEC demonstrates strong performance and generality across embodied planning benchmarks, offering a practical path to reliable planning under uncertainty.

Abstract

Planning in interactive environments is challenging under partial observability: task-critical preconditions (e.g., object locations or container states) may be unknown at decision time, yet grounding them through interaction is costly. Learned world models can cheaply predict missing facts, but prediction errors can silently induce infeasible commitments. We present \textbf{Active Epistemic Control (AEC)}, an epistemic-categorical planning layer that integrates model-based belief management with categorical feasibility checks. AEC maintains a strict separation between a \emph{grounded fact store} used for commitment and a \emph{belief store} used only for pruning candidate plans. At each step, it either queries the environment to ground an unresolved predicate when uncertainty is high or predictions are ambiguous, or simulates the predicate to filter hypotheses when confidence is sufficient. Final commitment is gated by grounded precondition coverage and an SQ-BCP pullback-style compatibility check, so simulated beliefs affect efficiency but cannot directly certify feasibility. Experiments on ALFWorld and ScienceWorld show that AEC achieves competitive success with fewer replanning rounds than strong LLM-agent baselines.

Active Epistemic Control for Query-Efficient Verified Planning

TL;DR

Active Epistemic Control (AEC) introduces a grounded-verification layer for planning under partial observability by maintaining a clear separation between externally grounded facts and model-based beliefs . The framework uses a verifier dependent only on grounded evidence to certify feasibility, while epistemic actions (querying vs. simulating) prune candidate plans without letting predictions become truth. By discretizing uncertain predictions with an ambiguity margin and leveraging a pullback-based categorical consistency check (SQ-BCP), AEC achieves efficient planning with fewer replanning rounds on ALFWorld and robustness to distribution shifts on ScienceWorld. The approach provides a principled, model-agnostic safety gate atop LLM planners and world models, reducing the risk of silent infeasibility while preserving efficiency gains from prediction-based pruning. Overall, AEC demonstrates strong performance and generality across embodied planning benchmarks, offering a practical path to reliable planning under uncertainty.

Abstract

Planning in interactive environments is challenging under partial observability: task-critical preconditions (e.g., object locations or container states) may be unknown at decision time, yet grounding them through interaction is costly. Learned world models can cheaply predict missing facts, but prediction errors can silently induce infeasible commitments. We present \textbf{Active Epistemic Control (AEC)}, an epistemic-categorical planning layer that integrates model-based belief management with categorical feasibility checks. AEC maintains a strict separation between a \emph{grounded fact store} used for commitment and a \emph{belief store} used only for pruning candidate plans. At each step, it either queries the environment to ground an unresolved predicate when uncertainty is high or predictions are ambiguous, or simulates the predicate to filter hypotheses when confidence is sufficient. Final commitment is gated by grounded precondition coverage and an SQ-BCP pullback-style compatibility check, so simulated beliefs affect efficiency but cannot directly certify feasibility. Experiments on ALFWorld and ScienceWorld show that AEC achieves competitive success with fewer replanning rounds than strong LLM-agent baselines.
Paper Structure (58 sections, 1 theorem, 12 equations, 3 figures, 2 tables, 1 algorithm)

This paper contains 58 sections, 1 theorem, 12 equations, 3 figures, 2 tables, 1 algorithm.

Key Result

Theorem 3.4

Consider one run of Algorithm alg:aec. Let $w_0$ be the initial grounded facts and let $w_{\mathrm{final}}$ be the grounded facts at the moment a plan $\pi$ is committed. Define i.e., the set of predicates whose truth values were grounded after initialization. Assume: Then the committed plan $\pi$ satisfies In particular, errors made by simulation-only predicates (those appearing only in $\hat{

Figures (3)

  • Figure 1: Ablation on ALFWorld. Blue: failure rate. Orange: replanning rounds.
  • Figure 2: ALFWorld success rate of AEC under different LLM backbones.
  • Figure 3: ALFWorld success rate of AEC across iterations.

Theorems & Definitions (4)

  • Definition 3.1: Epistemic State
  • Definition 3.2: Sound Verifier
  • Definition 3.3: Plan Application
  • Theorem 3.4: Verified Epistemic Control