Adversarial Perturbations of Physical Signals
Robert L. Bassett, Austin Van Dellen, Anthony P. Austin
TL;DR
The paper addresses the robustness of spectrogram-based classifiers to physically realizable adversarial perturbations by formulating the attack as a PDE-constrained optimization over a perturbation $f(t)$ that interacts with a source signal through the wave equation $\frac{\partial^{2}u}{\partial t^{2}} = c^{2}\nabla^{2}u + q(x,t)$ and a detector's spectrogram $\hat{s} = 10\log_{10}|\mathcal{F}s|^{2}$. Its main contribution is an efficient computational framework that precomputes an operator $\mathbf{Y} = \mathbf{A}^{-T}\mathbf{D}$ to avoid repeated PDE solves, enabling large-scale perturbation searches under a frequency constraint $\mathcal{P}\mathcal{F}f = 0$ and realistic noise. Experiments across Inception V3, GoogLeNet, and VGG-19 show that small-amplitude perturbations can induce misclassification on validation spectrograms, with substantial speedups over naive adjoint-based methods. The work highlights robustness risks for neural networks in security-sensitive sensing and points to extensions to other physics (e.g., Maxwell equations) and to universal perturbations as promising future directions.
Abstract
We investigate the vulnerability of computer-vision-based signal classifiers to adversarial perturbations of their inputs, where the signals and perturbations are subject to physical constraints. We consider a scenario in which a source and interferer emit signals that propagate as waves to a detector, which attempts to classify the source by analyzing the spectrogram of the signal it receives using a pre-trained neural network. By solving PDE-constrained optimization problems, we construct interfering signals that cause the detector to misclassify the source even though the perturbations to the spectrogram of the received signal are nearly imperceptible. Though such problems can have millions of decision variables, we introduce methods to solve them efficiently. Our experiments demonstrate that one can compute effective and physically realizable adversarial perturbations for a variety of machine learning models under various physical conditions.
