Infrastructure for AI Agents
Alan Chan, Kevin Wei, Sihao Huang, Nitarshan Rajkumar, Elija Perrier, Seth Lazar, Gillian K. Hadfield, Markus Anderljung
TL;DR
The paper defines agent infrastructure as external technical systems and shared protocols designed to mediate AI-agent interactions with environments and institutions, arguing this infrastructure is essential for safe and scalable ecosystems. It proposes three core functions—attribution, shaping interactions, and response/remedy—to guide a catalog of infrastructure directions. It details concrete components (Identity binding, Certification, Agent IDs) and interaction mechanisms (Agent channels, Oversight layers, Inter-agent communication, Commitment devices, Rollbacks, Incident reporting) while discussing adoption, interoperability, and safety considerations. Overall, the work offers a strategic blueprint for building interoperable, updatable agent infrastructure and highlights governance, liability, and policy implications necessary as AI agents become more capable and widespread.
Abstract
AI agents plan and execute interactions in open-ended environments. For example, OpenAI's Operator can use a web browser to do product comparisons and buy online goods. Much research on making agents useful and safe focuses on directly modifying their behaviour, such as by training them to follow user instructions. Direct behavioural modifications are useful, but do not fully address how heterogeneous agents will interact with each other and other actors. Rather, we will need external protocols and systems to shape such interactions. For instance, agents will need more efficient protocols to communicate with each other and form agreements. Attributing an agent's actions to a particular human or other legal entity can help to establish trust, and also disincentivize misuse. Given this motivation, we propose the concept of \textbf{agent infrastructure}: technical systems and shared protocols external to agents that are designed to mediate and influence their interactions with and impacts on their environments. Just as the Internet relies on protocols like HTTPS, our work argues that agent infrastructure will be similarly indispensable to ecosystems of agents. We identify three functions for agent infrastructure: 1) attributing actions, properties, and other information to specific agents, their users, or other actors; 2) shaping agents' interactions; and 3) detecting and remedying harmful actions from agents. We provide an incomplete catalog of research directions for such functions. For each direction, we include analysis of use cases, infrastructure adoption, relationships to existing (internet) infrastructure, limitations, and open questions. Making progress on agent infrastructure can prepare society for the adoption of more advanced agents.
