Complex LLM Planning via Automated Heuristics Discovery

Hongyi Ling; Shubham Parashar; Sambhav Khurana; Blake Olson; Anwesha Basu; Gaurangi Sinha; Zhengzhong Tu; James Caverlee; Shuiwang Ji

Complex LLM Planning via Automated Heuristics Discovery

Hongyi Ling, Shubham Parashar, Sambhav Khurana, Blake Olson, Anwesha Basu, Gaurangi Sinha, Zhengzhong Tu, James Caverlee, Shuiwang Ji

TL;DR

The paper addresses the challenge of enabling LLMs to perform complex planning without additional training. It introduces Automated Heuristics Discovery (AutoHD), where LLMs generate explicit Python heuristic functions that guide inference-time search, with an evolutionary loop to refine them. By integrating these heuristics into search algorithms like Greedy BFS and A*, AutoHD achieves significant accuracy gains on Blocksworld, the Game of 24, and Rubik's Cube across multiple LLMs, and provides interpretable insights into the reasoning process. The approach reduces reliance on self-verification or external verifiers and demonstrates strong generalization and robustness across tasks, establishing AutoHD as a practical, interpretable framework for complex planning.

Abstract

We consider enhancing large language models (LLMs) for complex planning tasks. While existing methods allow LLMs to explore intermediate steps to make plans, they either depend on unreliable self-verification or external verifiers to evaluate these steps, which demand significant data and computations. Here, we propose automated heuristics discovery (AutoHD), a novel approach that enables LLMs to explicitly generate heuristic functions to guide inference-time search, allowing accurate evaluation of intermediate states. These heuristic functions are further refined through a heuristic evolution process, improving their robustness and effectiveness. Our proposed method requires no additional model training or fine-tuning, and the explicit definition of heuristic functions generated by the LLMs provides interpretability and insights into the reasoning process. Extensive experiments across diverse benchmarks demonstrate significant gains over multiple baselines, including nearly twice the accuracy on some datasets, establishing our approach as a reliable and interpretable solution for complex planning tasks.

Complex LLM Planning via Automated Heuristics Discovery

TL;DR

Abstract

Complex LLM Planning via Automated Heuristics Discovery

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)