AgentCourt: Simulating Court with Adversarial Evolvable Lawyer Agents
Guhong Chen, Liyang Fan, Zihan Gong, Nan Xie, Zixuan Li, Ziqiang Liu, Chengming Li, Qiang Qu, Hamid Alinejad-Rokny, Shiwen Ni, Min Yang
TL;DR
AgentCourt introduces AdvEvol, an adversarial evolutionary framework that enables LLM-based lawyer agents to autonomously acquire legal knowledge across three specialized bases—regulations, experience, and cases—within a simulated court. By iterating through pre-trial data collection, dynamic courtroom debates, and post-trial reflection, the framework achieves strong performance on dynamic courtroom tasks, approaching GPT-4o-mini levels and outperforming specialized legal models. The CourtBench benchmark is proposed to systematically evaluate interactive legal reasoning beyond static knowledge retrieval. The work highlights the crucial role of adversarial learning in legal AI and suggests scalable directions for broader judicial and regulatory applications, with code and data being open-sourced.
Abstract
Current research in LLM-based simulation systems lacks comprehensive solutions for modeling real-world court proceedings, while existing legal language models struggle with dynamic courtroom interactions. We present AgentCourt, a comprehensive legal simulation framework that addresses these challenges through adversarial evolution of LLM-based agents. Our AgentCourt introduces a new adversarial evolutionary approach for agents called AdvEvol, which performs dynamic knowledge learning and evolution through structured adversarial interactions in a simulated courtroom program, breaking the limitations of the traditional reliance on static knowledge bases or manual annotations. By simulating 1,000 civil cases, we construct an evolving knowledge base that enhances the agents' legal reasoning abilities. The evolved lawyer agents demonstrated outstanding performance on our newly introduced CourtBench benchmark, achieving a 12.1% improvement in performance compared to the original lawyer agents. Evaluations by professional lawyers confirm the effectiveness of our approach across three critical dimensions: cognitive agility, professional knowledge, and logical rigor. Beyond outperforming specialized legal models in interactive reasoning tasks, our findings emphasize the importance of adversarial learning in legal AI and suggest promising directions for extending simulation-based legal reasoning to broader judicial and regulatory contexts. The project's code is available at: https://github.com/relic-yuexi/AgentCourt
