Table of Contents
Fetching ...

Fully Autonomous AI Agents Should Not be Developed

Margaret Mitchell, Avijit Ghosh, Alexandra Sasha Luccioni, Giada Pistilli

TL;DR

The paper argues that fully autonomous AI agents should not be developed due to safety, privacy, and trust risks that escalate with autonomy. It formalizes a definition and a leveled framework for AI agents, then analyzes how increasing autonomy interacts with a comprehensive value taxonomy (accuracy, assistiveness, efficiency, equity, flexibility, humanlikeness, privacy, safety, security, trust). Through methodology grounded in applied ethics and a reliance on base models like LLMs, the authors show that while some benefits arise, the potential for cascading harms and loss of human oversight grows with each higher level of autonomy. The work advocates adopting a spectrum of agent autonomy, implementing robust human-override and safety mechanisms, and prioritizing semi-autonomous systems to harness benefits while mitigating catastrophic risks. It emphasizes governance, oversight, and safety guarantees as essential to responsible AI development going forward.

Abstract

This paper argues that fully autonomous AI agents should not be developed. In support of this position, we build from prior scientific literature and current product marketing to delineate different AI agent levels and detail the ethical values at play in each, documenting trade-offs in potential benefits and risks. Our analysis reveals that risks to people increase with the autonomy of a system: The more control a user cedes to an AI agent, the more risks to people arise. Particularly concerning are safety risks, which affect human life and impact further values.

Fully Autonomous AI Agents Should Not be Developed

TL;DR

The paper argues that fully autonomous AI agents should not be developed due to safety, privacy, and trust risks that escalate with autonomy. It formalizes a definition and a leveled framework for AI agents, then analyzes how increasing autonomy interacts with a comprehensive value taxonomy (accuracy, assistiveness, efficiency, equity, flexibility, humanlikeness, privacy, safety, security, trust). Through methodology grounded in applied ethics and a reliance on base models like LLMs, the authors show that while some benefits arise, the potential for cascading harms and loss of human oversight grows with each higher level of autonomy. The work advocates adopting a spectrum of agent autonomy, implementing robust human-override and safety mechanisms, and prioritizing semi-autonomous systems to harness benefits while mitigating catastrophic risks. It emphasizes governance, oversight, and safety guarantees as essential to responsible AI development going forward.

Abstract

This paper argues that fully autonomous AI agents should not be developed. In support of this position, we build from prior scientific literature and current product marketing to delineate different AI agent levels and detail the ethical values at play in each, documenting trade-offs in potential benefits and risks. Our analysis reveals that risks to people increase with the autonomy of a system: The more control a user cedes to an AI agent, the more risks to people arise. Particularly concerning are safety risks, which affect human life and impact further values.

Paper Structure

This paper contains 38 sections, 1 figure, 6 tables.

Figures (1)

  • Figure 1: Selection of potential increased benefits and harms with respect to ethical values as AI agent autonomy increases.