Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement
Xunjian Yin, Xinyi Wang, Liangming Pan, Li Lin, Xiaojun Wan, William Yang Wang
TL;DR
The paper tackles rigidity in hand-designed and fixed meta-learning agents by introducing Gödel Agent, a self-referential framework that uses LLM-driven self-modification to iteratively rewrite its own code. It formalizes a recursive update system for both policy and learning algorithm, and implements runtime memory modification with prompting to enable continuous self-improvement. Across multiple reasoning and calculation benchmarks, Gödel Agent achieves competitive or superior performance with faster convergence and demonstrated robustness, particularly in open-ended scenarios. The work lays groundwork for autonomous, self-improving AI agents while discussing limitations, safety, and directions for future development.
Abstract
The rapid advancement of large language models (LLMs) has significantly enhanced the capabilities of AI-driven agents across various tasks. However, existing agentic systems, whether based on fixed pipeline algorithms or pre-defined meta-learning frameworks, cannot search the whole agent design space due to the restriction of human-designed components, and thus might miss the globally optimal agent design. In this paper, we introduce Gödel Agent, a self-evolving framework inspired by the Gödel machine, enabling agents to recursively improve themselves without relying on predefined routines or fixed optimization algorithms. Gödel Agent leverages LLMs to dynamically modify its own logic and behavior, guided solely by high-level objectives through prompting. Experimental results on mathematical reasoning and complex agent tasks demonstrate that implementation of Gödel Agent can achieve continuous self-improvement, surpassing manually crafted agents in performance, efficiency, and generalizability.
