PoCo: Agentic Proof-of-Concept Exploit Generation for Smart Contracts

Vivi Andersson; Sofia Bobadilla; Harald Hobbelhagen; Martin Monperrus

PoCo: Agentic Proof-of-Concept Exploit Generation for Smart Contracts

Vivi Andersson, Sofia Bobadilla, Harald Hobbelhagen, Martin Monperrus

TL;DR

PoCo introduces an agentic AI framework that automatically converts auditor-written vulnerability descriptions into executable PoC exploits for smart contracts, integrating compilation and testing within Foundry. Through a ReAct-style loop and tool-augmented autonomy, PoCo outperforms single-pass prompting and workflow baselines across 23 real-world vulnerabilities, producing numerous well-formed and logically correct PoCs. The evaluation employs a novel Proof-of-Patch dataset linking vulnerability findings to patches and patches themselves, validating PoCs against ground-truth mitigations. The approach demonstrates substantial efficiency gains for audits and provides a reproducible, open dataset for the smart contract security community. Annotation detail levels are shown to influence PoC quality, informing best practices for vulnerability reporting.

Abstract

Smart contracts operate in a highly adversarial environment, where vulnerabilities can lead to substantial financial losses. Thus, smart contracts are subject to security audits. In auditing, proof-of-concept (PoC) exploits play a critical role by demonstrating to the stakeholders that the reported vulnerabilities are genuine, reproducible, and actionable. However, manually creating PoCs is time-consuming, error-prone, and often constrained by tight audit schedules. We introduce POCO, an agentic framework that automatically generates executable PoC exploits from natural-language vulnerability descriptions written by auditors. POCO autonomously generates PoC exploits in an agentic manner by interacting with a set of code-execution tools in a Reason-Act-Observe loop. It produces fully executable exploits compatible with the Foundry testing framework, ready for integration into audit reports and other security tools. We evaluate POCO on a dataset of 23 real-world vulnerability reports. POCO consistently outperforms the prompting and workflow baselines, generating well-formed and logically correct PoCs. Our results demonstrate that agentic frameworks can significantly reduce the effort required for high-quality PoCs in smart contract audits. Our contribution provides readily actionable knowledge for the smart contract security community.

PoCo: Agentic Proof-of-Concept Exploit Generation for Smart Contracts

TL;DR

Abstract

PoCo: Agentic Proof-of-Concept Exploit Generation for Smart Contracts

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)