Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce
Yijia Shao, Humishka Zope, Yucheng Jiang, Jiaxin Pei, David Nguyen, Erik Brynjolfsson, Diyi Yang
TL;DR
This paper tackles how AI agents will reshape work by introducing a worker-centered auditing framework that captures which tasks workers want automated or augmented and how those desires align with current capabilities. It builds WORKBank, a large-scale dataset combining 1,500 domain workers across 104 occupations with 52 AI experts evaluating 844 tasks, using an audio-enhanced survey and a new Human Agency Scale ($H1$–$H5$) to quantify human involvement. The study reveals a four-zone desire-capability landscape, highlights mismatches between worker desires and investment/tech focus, and shows that AI-agent integration may shift core human skills from information processing toward interpersonal and organizational competencies. These insights offer a concrete reference for prioritizing AI-agent R&D, guiding responsible deployment, and informing workforce development as workplace dynamics evolve. The framework advances beyond automate-or-not dichotomies by embracing augmentation and collaboration, with implications for policy, education, and industry practice.
Abstract
The rapid rise of compound AI systems (a.k.a., AI agents) is reshaping the labor market, raising concerns about job displacement, diminished human agency, and overreliance on automation. Yet, we lack a systematic understanding of the evolving landscape. In this paper, we address this gap by introducing a novel auditing framework to assess which occupational tasks workers want AI agents to automate or augment, and how those desires align with the current technological capabilities. Our framework features an audio-enhanced mini-interview to capture nuanced worker desires and introduces the Human Agency Scale (HAS) as a shared language to quantify the preferred level of human involvement. Using this framework, we construct the WORKBank database, building on the U.S. Department of Labor's O*NET database, to capture preferences from 1,500 domain workers and capability assessments from AI experts across over 844 tasks spanning 104 occupations. Jointly considering the desire and technological capability divides tasks in WORKBank into four zones: Automation "Green Light" Zone, Automation "Red Light" Zone, R&D Opportunity Zone, Low Priority Zone. This highlights critical mismatches and opportunities for AI agent development. Moving beyond a simple automate-or-not dichotomy, our results reveal diverse HAS profiles across occupations, reflecting heterogeneous expectations for human involvement. Moreover, our study offers early signals of how AI agent integration may reshape the core human competencies, shifting from information-focused skills to interpersonal ones. These findings underscore the importance of aligning AI agent development with human desires and preparing workers for evolving workplace dynamics.
