Learning Near-Optimal Intrusion Responses Against Dynamic Attackers
Kim Hammar, Rolf Stadler
TL;DR
This work treats intrusion response as an optimal stopping game between a defender and a dynamic attacker, revealing threshold structures in optimal strategies. It introduces Threshold Fictitious Self-Play (T-FP), a gradient-based, threshold-parametrized learning procedure that efficiently converges to near-Nash defender policies against adaptive adversaries. The authors validate the framework with both a simulation system and a hardware-like emulation platform, showing that learned strategies achieve near-equilibrium performance and robustness against a changing attacker. The combination of a formal stopping-game model, a scalable learning algorithm, and emulation-informed evaluation provides a practical pathway to deploy near-optimal intrusion responses in real IT infrastructures.
Abstract
We study automated intrusion response and formulate the interaction between an attacker and a defender as an optimal stopping game where attack and defense strategies evolve through reinforcement learning and self-play. The game-theoretic modeling enables us to find defender strategies that are effective against a dynamic attacker, i.e. an attacker that adapts its strategy in response to the defender strategy. Further, the optimal stopping formulation allows us to prove that optimal strategies have threshold properties. To obtain near-optimal defender strategies, we develop Threshold Fictitious Self-Play (T-FP), a fictitious self-play algorithm that learns Nash equilibria through stochastic approximation. We show that T-FP outperforms a state-of-the-art algorithm for our use case. The experimental part of this investigation includes two systems: a simulation system where defender strategies are incrementally learned and an emulation system where statistics are collected that drive simulation runs and where learned strategies are evaluated. We argue that this approach can produce effective defender strategies for a practical IT infrastructure.
