The Path To Autonomous Cyber Defense
Sean Oesch, Phillipe Austria, Amul Chaulagain, Brian Weber, Cory Watson, Matthew Dixson, Amir Sadovnik
TL;DR
Defenders are overwhelmed by attack volume, and AI-enabled attackers threaten to outpace humans. The paper outlines a path to autonomous cyber defense using multi-agent reinforcement learning, where specialized agents automate stages of the cyber defense life cycle. It discusses critical design choices, including playing the right game (observation, rewards, actions), enabling adaptability to changing networks and adversaries, and building high-fidelity, reusable training environments that combine simulation and emulation. The findings argue that modular, detector-based observation signals and dynamic reward shaping can improve transferability to real networks, and that standardized training platforms are essential for progress.
Abstract
Defenders are overwhelmed by the number and scale of attacks against their networks.This problem will only be exacerbated as attackers leverage artificial intelligence to automate their workflows. We propose a path to autonomous cyber agents able to augment defenders by automating critical steps in the cyber defense life cycle.
