Iterative Experience Refinement of Software-Developing Agents
Chen Qian, Jiahao Li, Yufan Dang, Wei Liu, YiFei Wang, Zihao Xie, Weize Chen, Cheng Yang, Yingli Zhang, Zhiyuan Liu, Maosong Sun
TL;DR
This work tackles the rigidity of static, heuristic experiences in LLM-based software-developing agents by introducing Iterative Experience Refinement (IER). IER enables continual acquisition, utilization, and elimination of experiences across task batches, supported by two propagation patterns: successive (latest-predecessor) and cumulative (all-history). A heuristic elimination mechanism curbs unbounded growth of the experience space, preserving high-quality, frequently-used experiences. Empirical results on the SRDD dataset show that while the successive pattern can yield higher peaks in performance, the cumulative pattern offers more stable improvements, and elimination enables strong performance with only a small, high-quality subset of experiences. These findings underscore the value of dynamic experience refinement for scalable, autonomous software development.
Abstract
Autonomous agents powered by large language models (LLMs) show significant potential for achieving high autonomy in various scenarios such as software development. Recent research has shown that LLM agents can leverage past experiences to reduce errors and enhance efficiency. However, the static experience paradigm, reliant on a fixed collection of past experiences acquired heuristically, lacks iterative refinement and thus hampers agents' adaptability. In this paper, we introduce the Iterative Experience Refinement framework, enabling LLM agents to refine experiences iteratively during task execution. We propose two fundamental patterns: the successive pattern, refining based on nearest experiences within a task batch, and the cumulative pattern, acquiring experiences across all previous task batches. Augmented with our heuristic experience elimination, the method prioritizes high-quality and frequently-used experiences, effectively managing the experience space and enhancing efficiency. Extensive experiments show that while the successive pattern may yield superior results, the cumulative pattern provides more stable performance. Moreover, experience elimination facilitates achieving better performance using just 11.54% of a high-quality subset.
