Path Integral Methods for Synthesizing and Preventing Stealthy Attacks in Nonlinear Cyber-Physical Systems
Apurva Patil, Kyle Morgenstein, Luis Sentis, Takashi Tanaka
TL;DR
The paper addresses stealthy attacks in nonlinear cyber-physical systems by quantifying stealth via the KL divergence $D(P\|Q)$ and formulating two problems: (i) synthesis of worst-case stealthy attacks with a fixed controller and (ii) a minimax KL control framework to counteract such attacks. It develops a path-integral control framework enabling simulator-based, real-time computation of attack policies and saddle-point defenses without explicit system models, leveraging Monte Carlo sampling and the Cole–Hopf transformation to linearize value functions. The work reveals equivalences between the minimax KL control problem, risk-sensitive control, and two-player zero-sum stochastic differential games under structural assumptions, and provides algorithms for both attack synthesis and mitigation. Numerical experiments on unicycle navigation and cruise control demonstrate that an attacker can covertly degrade performance while remaining undetected, while a risk-aware controller can adapt policies to prevent crashes; the approach is scalable to nonlinear CPS and compatible with digital-twin paradigms. These results offer a practical, simulator-driven framework for designing robust defenses against stealthy CPS attacks with potential impact on security-critical applications.
Abstract
This paper studies the synthesis and mitigation of stealthy attacks in nonlinear cyber-physical systems (CPS). To quantify stealthiness, we employ the Kullback-Leibler (KL) divergence, a measure rooted in hypothesis testing and detection theory, which captures the trade-off between an attacker's desire to remain stealthy and her goal of degrading system performance. First, we synthesize the worst-case stealthy attack in nonlinear CPS using the path integral approach. Second, we consider how a controller can mitigate the impact of such stealthy attacks by formulating a minimax KL control problem, yielding a zero-sum game between the attacker and the controller. Again, we leverage a path integral-based solution that computes saddle-point policies for both players through Monte Carlo simulations. We validate our approach using unicycle navigation and cruise control problems, demonstrating how an attacker can covertly drive the system into unsafe regions, and how the controller can adapt her policy to combat the worst-case attacks.
