MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use

Zaid Khan; Ali Farhadi; Ranjay Krishna; Luca Weihs; Mohit Bansal; Tanmay Gupta

MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use

Zaid Khan, Ali Farhadi, Ranjay Krishna, Luca Weihs, Mohit Bansal, Tanmay Gupta

TL;DR

MutaGReP tackles the problem of providing code repositories as context to LLM-based code generation without overwhelming the model’s context window. It formulates repo-grounded planning as an execution-free, neural tree search over plans, where each step ties a natural language intent to a small set of code symbols grounded in the target repository. The approach uses two successor-function variants (monotonic and unconstrained), a grounding mechanism that maps intents to top-k symbols via synthetic intents, and scoring and traversal strategies (notably best-first with Likert-based evaluation) to efficiently explore the plan space within a budget. Experiments on LongCodeArena show that tree-searched plans enable strong code-use performance with only a fraction of the repository context, can improve weaker models to match stronger ones, and substantially outperform full-repo-context baselines on hard tasks. This work suggests that structured, repo-grounded planning can provide robust, scalable context for library-style code-use without executing code, with practical implications for accelerators of model performance and interpretability.

Abstract

When a human requests an LLM to complete a coding task using functionality from a large code repository, how do we provide context from the repo to the LLM? One approach is to add the entire repo to the LLM's context window. However, most tasks involve only fraction of symbols from a repo, longer contexts are detrimental to the LLM's reasoning abilities, and context windows are not unlimited. Alternatively, we could emulate the human ability to navigate a large repo, pick out the right functionality, and form a plan to solve the task. We propose MutaGReP (Mutation-guided Grounded Repository Plan Search), an approach to search for plans that decompose a user request into natural language steps grounded in the codebase. MutaGReP performs neural tree search in plan space, exploring by mutating plans and using a symbol retriever for grounding. On the challenging LongCodeArena benchmark, our plans use less than 5% of the 128K context window for GPT-4o but rival the coding performance of GPT-4o with a context window filled with the repo. Plans produced by MutaGReP allow Qwen 2.5 Coder 32B and 72B to match the performance of GPT-4o with full repo context and enable progress on the hardest LongCodeArena tasks. Project page: zaidkhan.me/MutaGReP

MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use

TL;DR

Abstract

MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)