Greedy Heuristics for Sampling-Based Motion Planning in High-Dimensional State Spaces
Phone Thiha Kyaw, Anh Vu Le, Rajesh Elara Mohan, Jonathan Kelly
TL;DR
This work introduces the greedy informed set, defined by $\mathcal{X}_{greedy}=\{\mathbf{x}\in\mathcal{X}_{free}\mid \hat{f}(\mathbf{x})\le \hat{f}(\mathbf{x}_{\max})\}$ where $\hat{f}$ is the $L^2$ heuristic and $\mathbf{x}_{\max}$ is the maximum-cost state along the current solution. Building on this, Greedy RRT* (G-RRT*) is proposed as a bidirectional, anytime planner that alternates sampling from the greedy informed set and the full informed set, controlled by a bias parameter $\epsilon$, to rapidly find initial solutions and converge to optimal paths. The authors prove probabilistic completeness and characterize asymptotic optimality under mixed sampling, deriving worst-case sample-cost impacts and recall/volume bounds. Empirical results on abstract benchmarks and robotic manipulation tasks show that G-RRT* achieves faster initial solutions and competitive final path costs compared to state-of-the-art planners, particularly in high-dimensional or tightly constrained settings. Overall, the greedy informed sampling approach substantially accelerates convergence in informed planning while preserving global optimality under appropriate exploration/exploitation balance.
Abstract
Informed sampling techniques accelerate the convergence of sampling-based motion planners by biasing sampling toward regions of the state space that are most likely to yield better solutions. However, when the current solution path contains redundant or tortuous segments, the resulting informed subset may remain unnecessarily large, slowing convergence. Our prior work addressed this issue by introducing the greedy informed set, which reduces the sampling region based on the maximum heuristic cost along the current solution path. In this article, we formally characterize the behavior of the greedy informed set within Rapidly-exploring Random Tree (RRT*)-like planners and analyze how greedy sampling affects exploration and asymptotic optimality. We then present Greedy RRT* (G-RRT*), a bi-directional anytime variant of RRT* that leverages the greedy informed set to focus sampling in the most promising regions of the search space. Experiments on abstract planning benchmarks, manipulation tasks from the MotionBenchMaker dataset, and a dual-arm Barrett WAM problem demonstrate that G-RRT* rapidly finds initial solutions and converges asymptotically to optimal paths, outperforming state-of-the-art sampling-based planners.
