Robustifying ML-powered Network Classifiers with PANTS
Minhao Jin, Maria Apostolaki
TL;DR
ML-powered network classifiers are vulnerable to adversarial inputs, and existing AML methods struggle to ensure realizability and semantic preservation in network settings. PANTS integrates gradient-based AML (e.g., PGD, ZOO) with an SMT solver to generate adversarial, realizable, and semantics-preserving packet sequences and embeds them into an iterative adversarial training loop. It achieves higher adversarial-sample discovery and robustness gains than state-of-the-art baselines (e.g., median ASR $=35.31\%$, up to $52.72\%$ robustness improvement) while maintaining accuracy, and it remains effective against stronger or different attackers; it is open-sourced for practical deployment. The approach is practical across pipelines with non-differentiable feature engineering and non-end-to-end differentiable components, offering a concrete path for operators to assess and harden MNCs in real networks.
Abstract
Multiple network management tasks, from resource allocation to intrusion detection, rely on some form of ML-based network traffic classification (MNC). Despite their potential, MNCs are vulnerable to adversarial inputs, which can lead to outages, poor decision-making, and security violations, among other issues. The goal of this paper is to help network operators assess and enhance the robustness of their MNC against adversarial inputs. The most critical step for this is generating inputs that can fool the MNC while being realizable under various threat models. Compared to other ML models, finding adversarial inputs against MNCs is more challenging due to the existence of non-differentiable components e.g., traffic engineering and the need to constrain inputs to preserve semantics and ensure reliability. These factors prevent the direct use of well-established gradient-based methods developed in adversarial ML (AML). To address these challenges, we introduce PANTS, a practical white-box framework that uniquely integrates AML techniques with Satisfiability Modulo Theories (SMT) solvers to generate adversarial inputs for MNCs. We also embed PANTS into an iterative adversarial training process that enhances the robustness of MNCs against adversarial inputs. PANTS is 70% and 2x more likely in median to find adversarial inputs against target MNCs compared to state-of-the-art baselines, namely Amoeba and BAP. PANTS improves the robustness of the target MNCs by 52.7% (even against attackers outside of what is considered during robustification) without sacrificing their accuracy.
