Breadth-First Search vs. Restarting Random Walks for Escaping Uninformed Heuristic Regions
Daniel Platnick, Dawson Tomasz, Eamon Earl, Sourena Khanzadeh, Richard Valenzano
TL;DR
This work analyzes how to escape Uninformed Heuristic Regions (UHRs) in state-space search by comparing BrFS and Restarting Random Walks (RRWs). It derives exact and bound-based expressions for the expected runtimes of BrFS and RRW^C_ℓ, establishing a crossover condition based on the exit probability $p_g$ and the structure of the search space, including goal depth and distribution. It then extends Enforced Hill-Climbing (EHC) with RRW-based escapes (EHC-RRW^C_ℓ and EHC-RRW^L), provides theoretical guarantees for EHC-complete domains, and validates the theory with experiments on PDDL planning benchmarks. The results show that RRWs can offer runtime and memory benefits in certain topologies (notably unbounded-UHRs) and that the choice between BrFS and RRWs depends on domain structure and goal-density, informing algorithm portfolios for planning under uncertainty.
Abstract
Greedy search methods like Greedy Best-First Search (GBFS) and Enforced Hill-Climbing (EHC) often struggle when faced with Uninformed Heuristic Regions (UHRs) like heuristic local minima or plateaus. In this work, we theoretically and empirically compare two popular methods for escaping UHRs in breadth-first search (BrFS) and restarting random walks (RRWs). We first derive the expected runtime of escaping a UHR using BrFS and RRWs, based on properties of the UHR and the random walk procedure, and then use these results to identify when RRWs will be faster in expectation than BrFS. We then evaluate these methods for escaping UHRs by comparing standard EHC, which uses BrFS to escape UHRs, to variants of EHC called EHC-RRW, which use RRWs for that purpose. EHC-RRW is shown to have strong expected runtime guarantees in cases where EHC has previously been shown to be effective. We also run experiments with these approaches on PDDL planning benchmarks to better understand their relative effectiveness for escaping UHRs.
