Active Perception using Neural Radiance Fields
Siming He, Christopher D. Hsu, Dexter Ong, Yifei Simon Shao, Pratik Chaudhari
TL;DR
The paper formulates active perception as maximizing the predictive information $I(y_{ ext{future}}, y_{ ext{past}})$ to guide sensing and motion in indoor environments. It proposes a semantic NeRF representation that can synthesize multi-modal observations and enables a sampling-based planner to evaluate information gain along dynamically feasible quadrotor trajectories. Through Habitat-based simulations, the approach demonstrates improved object localization and scene reconstruction quality, driven by an integrated loop of perception, generative modeling, and planning. The work highlights the practical potential of information-theoretic objectives combined with rich scene representations for autonomous exploration tasks.
Abstract
We study active perception from first principles to argue that an autonomous agent performing active perception should maximize the mutual information that past observations posses about future ones. Doing so requires (a) a representation of the scene that summarizes past observations and the ability to update this representation to incorporate new observations (state estimation and mapping), (b) the ability to synthesize new observations of the scene (a generative model), and (c) the ability to select control trajectories that maximize predictive information (planning). This motivates a neural radiance field (NeRF)-like representation which captures photometric, geometric and semantic properties of the scene grounded. This representation is well-suited to synthesizing new observations from different viewpoints. And thereby, a sampling-based planner can be used to calculate the predictive information from synthetic observations along dynamically-feasible trajectories. We use active perception for exploring cluttered indoor environments and employ a notion of semantic uncertainty to check for the successful completion of an exploration task. We demonstrate these ideas via simulation in realistic 3D indoor environments.
