Approximate Sequential Optimization for Informative Path Planning
Joshua Ott, Mykel J. Kochenderfer, Stephen Boyd
TL;DR
The paper tackles IPP on graphs under a budget by formulating a mixed-integer convex program and introducing a convex relaxation that provides a lower bound. It then develops Approximate Sequential Path Optimization (ASPO), a dynamic-programming–based method that builds paths segment-by-segment by solving orienteering subproblems with information-guided rewards, achieving scalable performance and a bounded optimality gap. The authors extend the framework to adaptive objectives, multimodal sensing, and multi-agent IPP, and demonstrate competitive or superior results compared with exact MICP, MCTS, and neural-network–based approaches on large graphs, with an open-source implementation. These contributions enable robust, scalable planning for informative sensing in dynamic and heterogeneous environments while supporting extensions to complex sensing modalities and multiple agents. The work has practical impact for real-world exploration, environmental monitoring, and search missions where information gain must be maximized within resource constraints.
Abstract
We consider the problem of finding an informative path through a graph, given initial and terminal nodes and a given maximum path length. We assume that a linear noise corrupted measurement is taken at each node of an underlying unknown vector that we wish to estimate. The informativeness is measured by the reduction in uncertainty in our estimate, evaluated using several metrics. We present a convex relaxation for this informative path planning problem, which we can readily solve to obtain a bound on the possible performance. We develop an approximate sequential method where the path is constructed segment by segment through dynamic programming. This involves solving an orienteering problem, with the node reward acting as a surrogate for informativeness, taking the first step, and then repeating the process. The method scales to very large problem instances and achieves performance not too far from the bound produced by the convex relaxation. We also demonstrate our method's ability to handle adaptive objectives, multimodal sensing, and multi-agent variations of the informative path planning problem.
