A Cost-Aware Approach to Adversarial Robustness in Neural Networks

Charles Meyers; Mohammad Reza Saleh Sedghpour; Tommy Löfstedt; Erik Elmroth

A Cost-Aware Approach to Adversarial Robustness in Neural Networks

Charles Meyers, Mohammad Reza Saleh Sedghpour, Tommy Löfstedt, Erik Elmroth

TL;DR

This work addresses the challenge of evaluating adversarial robustness for neural networks in production by introducing a cloud-native, cost-aware framework based on survival analysis. It leverages accelerated failure time (AFT) models to predict time-to-failure under adversarial perturbations while jointly optimizing benign and adversarial accuracy and training-time costs using a Tree Parzen Estimator (TPE). The methodology enables comparisons across hardware (e.g., P100, V100, L4) and operational settings by tying performance to measurable times (training, inference, attack generation) and costs, encapsulated in metrics such as the TRASH score. Empirical results show that newer hardware reduces training time but with diminishing accuracy gains and that 8-bit inference-focused hardware (L4) can offer favorable cost-robustness trade-offs, reinforcing the practicality of the proposed approach for risk-aware deployment and rapid iteration under safety constraints.

Abstract

Considering the growing prominence of production-level AI and the threat of adversarial attacks that can evade a model at run-time, evaluating the robustness of models to these evasion attacks is of critical importance. Additionally, testing model changes likely means deploying the models to (e.g. a car or a medical imaging device), or a drone to see how it affects performance, making un-tested changes a public problem that reduces development speed, increases cost of development, and makes it difficult (if not impossible) to parse cause from effect. In this work, we used survival analysis as a cloud-native, time-efficient and precise method for predicting model performance in the presence of adversarial noise. For neural networks in particular, the relationships between the learning rate, batch size, training time, convergence time, and deployment cost are highly complex, so researchers generally rely on benchmark datasets to assess the ability of a model to generalize beyond the training data. To address this, we propose using accelerated failure time models to measure the effect of hardware choice, batch size, number of epochs, and test-set accuracy by using adversarial attacks to induce failures on a reference model architecture before deploying the model to the real world. We evaluate several GPU types and use the Tree Parzen Estimator to maximize model robustness and minimize model run-time simultaneously. This provides a way to evaluate the model and optimise it in a single step, while simultaneously allowing us to model the effect of model parameters on training time, prediction time, and accuracy. Using this technique, we demonstrate that newer, more-powerful hardware does decrease the training time, but with a monetary and power cost that far outpaces the marginal gains in accuracy.

A Cost-Aware Approach to Adversarial Robustness in Neural Networks

TL;DR

Abstract

Paper Structure (42 sections, 20 equations, 9 figures, 2 tables)

This paper contains 42 sections, 20 equations, 9 figures, 2 tables.

Introduction
Motivation
Contributions
Background
Cloud Architectures
ML Pipelines
Classifiers
Learning Rate Selection
Adversarial Attacks
Adversarial Analysis
Survival Analysis for Robustness Verification during Training
Accuracy
Failure Rate
Optimisation
AFT Models
...and 27 more sections

Figures (9)

Figure 1: For each dataset and hardware combination, a random state parameter was chosen at random to decide the test and train sets. Next, model parameters and attack parameters are chosen at random. After 128 random trials, the TPE algorithm attempts to maximize benign and adversarial accuracy while minimizing training time by tuning the model parameters. The random seed for the data split and the attack parameters are sampled independently from this optimization, which is why they are colored differently. The model tuning (blue-box) is discussed in Section \ref{['optimisation']}. After the trials are completed, several AFT models are fit (see Section \ref{['survival_time']}) and compared (see Section \ref{['best-fit']}) using the process depicted in the purple box. Finally we conduct the cost analysis outlined in Section \ref{['cost']} (green box).
Figure 2: For our experiments we used four node pools from Google Cloud Platform, each has a particular responsibility. The first node pool includes 3 different nodes responsible for hosting monitoring services such as Prometheus and Grafana. The other node pools each had one node with a specific GPU. The KEPLER exporter is then deployed on each node as DaemonSet to monitor the resource usage. All the storage requirements during the experimentation such as storage for experiments and monitoring data were then stored on storage provided by persistent volume claim (PVC). A PVC is a request for storage by user in Kubernetes which is then connected to the object storage. The blue experiment, blue-green object storage, green analysis component, and purple AFT component correspond to the same colors in Figure \ref{['fig:experiments']}.
Figure 3: Benign and adversarial accuracy across all hardware and datasets for all 1000 trials using plots that depict the distribution of the y-axis values using the width of the plot. Each color is a different device and the datasets are displayed along the x-axis. Outliers are denoted with a white dot.
Figure 4: This depicts the training, inference, and attack times for all hardware and datasets for all 1000 trials using plots that depict the distribution of the second axis values using the width of the plot. The time per sample was assumed to be uniform across the batch of samples for each training, inference, or attack measurement. Each color is a different device and the datasets are displayed along the first axis. Outliers are denoted with a white dot. For these plots, the second axes have been scaled by the number of samples for the sake of comparison.
Figure 5: This depicts the training, inference, and attack times for all hardware and datasets for all 1000 trials using plots that depict the distribution of the second axis values using the width of the plot. The power per sample was assumed to be uniform across the batch of samples for each training, inference, or attack measurement. Each color is a different device and the datasets are displayed along the first axis. Outliers are denoted with a white dot. For these plots, the second axes have been scaled by the number of samples for the sake of comparison.
...and 4 more figures

A Cost-Aware Approach to Adversarial Robustness in Neural Networks

TL;DR

Abstract

A Cost-Aware Approach to Adversarial Robustness in Neural Networks

Authors

TL;DR

Abstract

Table of Contents

Figures (9)