Count-based Novelty Exploration in Classical Planning
Giacomo Rosa, Nir Lipovetzky
TL;DR
Facing exploration bottlenecks in Classical Planning due to exponential growth of tuples, the paper introduces classical count-based novelty, which leverages the frequency of $k$-tuples in search histories to guide exploration. It provides theoretical links between counts, Hamming distance, and the expected number of novel tuples, and proposes a memory-efficient Trimmed Open List along with the BFNoS frontend for dual-configuration planning. Experimental results on IPC benchmarks show that count-based novelty, especially when combined with partitioning and trimmed lists, yields competitive coverage and complements existing novelty heuristics; memory-threshold hybrid configurations substantially boost instance coverage. The work bridges classical planning with count-based exploration concepts and offers practical, scalable tools for memory-aware search, with potential cross-domain relevance to reinforcement learning.
Abstract
Count-based exploration methods are widely employed to improve the exploratory behavior of learning agents over sequential decision problems. Meanwhile, Novelty search has achieved success in Classical Planning through recording of the first, but not successive, occurrences of tuples. In order to structure the exploration, however, the number of tuples considered needs to grow exponentially as the search progresses. We propose a new novelty technique, classical count-based novelty, which aims to explore the state space with a constant number of tuples, by leveraging the frequency of each tuple's appearance in a search tree. We then justify the mechanisms through which lower tuple counts lead the search towards novel tuples. We also introduce algorithmic contributions in the form of a trimmed open list that maintains a constant size by pruning nodes with bad novelty values. These techniques are shown to complement existing novelty heuristics when integrated in a classical solver, achieving competitive results in challenging benchmarks from recent International Planning Competitions. Moreover, adapting our solver as the frontend planner in dual configurations that utilize both memory and time thresholds demonstrates a significant increase in instance coverage, surpassing current state-of-the-art solvers.
