Automated Code Editing with Search-Generate-Modify
Changshu Liu, Pelin Cetin, Yogesh Patodia, Saikat Chakraborty, Yangruibo Ding, Baishakhi Ray
TL;DR
SarGaM tackles automated code editing by mimicking a developer’s workflow: search for related patches, generate a candidate edit, and refine it with a granular modification step. The method combines three components—search augmentation, generation with off-the-shelf code models, and a Levenshtein Transformer-based edit model—to produce high-quality patches for code edits and bug fixes. Across code-editing benchmarks and automated program repair datasets, SarGaM consistently outperforms generation-only and edit-only baselines, demonstrating the value of integrating retrieval and fine-grained edits. The work provides strong evidence that retrieval-informed generation plus precise edit operations can advance practical software maintenance tools, and it releases a prototype implementation for further exploration.
Abstract
Code editing is essential in evolving software development. Many automated code editing tools have been proposed that leverage both Information Retrieval-based techniques and Machine Learning-based code generation and code editing models. Each technique comes with its own promises and perils, and they are often used together to complement their strengths and compensate for their weaknesses. This paper proposes a hybrid approach to better synthesize code edits by leveraging the power of code search, generation, and modification. Our key observation is that a patch obtained by search and retrieval, even if imperfect, can provide helpful guidance to a code generation model. However, a retrieval-guided patch produced by a code generation model can still be a few tokens off from the intended patch. Such generated patches can be slightly modified to create the intended patches. SARGAM is a novel tool designed to mimic a real developer's code editing behavior. Given an original code version, the developer may search for related patches, generate or write the code, and then modify the generated code to adapt it to the right context. Our evaluation of SARGAM on edit generation shows superior performance with respect to current state-of-the-art techniques. SARGAM also shows great effectiveness on automated program repair tasks.
