Evaluating Model Robustness Using Adaptive Sparse L0 Regularization

Weiyou Liu; Zhenyang Li; Weitong Chen

Evaluating Model Robustness Using Adaptive Sparse L0 Regularization

Weiyou Liu, Zhenyang Li, Weitong Chen

TL;DR

The paper tackles the vulnerability of deep neural networks to adversarial inputs, focusing on sparse perturbations under the $L_0$ norm to reveal weaknesses not captured by conventional norms. It introduces Adaptive Sparse and Lightweight Optimization (ASLO), a differentiable $L_0$-approximation method with real-time adaptivity that guides perturbations to be small yet effective. The authors demonstrate ASLO’s ability to balance attack efficacy with sparsity, validate it on time-series datasets and multiple model architectures, and extend its use to common gradient-based attacks like GM_PGD and CW, resulting in reduced perturbation distances without sacrificing attack success. These findings advance robustness evaluation and have practical implications for designing defenses in time-series and other domains against highly adaptive adversaries.

Abstract

Deep Neural Networks have demonstrated remarkable success in various domains but remain susceptible to adversarial examples, which are slightly altered inputs designed to induce misclassification. While adversarial attacks typically optimize under Lp norm constraints, attacks based on the L0 norm, prioritising input sparsity, are less studied due to their complex and non convex nature. These sparse adversarial examples challenge existing defenses by altering a minimal subset of features, potentially uncovering more subtle DNN weaknesses. However, the current L0 norm attack methodologies face a trade off between accuracy and efficiency either precise but computationally intense or expedient but imprecise. This paper proposes a novel, scalable, and effective approach to generate adversarial examples based on the L0 norm, aimed at refining the robustness evaluation of DNNs against such perturbations.

Evaluating Model Robustness Using Adaptive Sparse L0 Regularization

TL;DR

The paper tackles the vulnerability of deep neural networks to adversarial inputs, focusing on sparse perturbations under the

norm to reveal weaknesses not captured by conventional norms. It introduces Adaptive Sparse and Lightweight Optimization (ASLO), a differentiable

-approximation method with real-time adaptivity that guides perturbations to be small yet effective. The authors demonstrate ASLO’s ability to balance attack efficacy with sparsity, validate it on time-series datasets and multiple model architectures, and extend its use to common gradient-based attacks like GM_PGD and CW, resulting in reduced perturbation distances without sacrificing attack success. These findings advance robustness evaluation and have practical implications for designing defenses in time-series and other domains against highly adaptive adversaries.

Abstract

Paper Structure (33 sections, 6 equations, 1 figure, 1 algorithm)

This paper contains 33 sections, 6 equations, 1 figure, 1 algorithm.

Introduction
Related Work
Adversarial Attacks on Time Series Classification
Sparse Adversarial Perturbations
Challenges in $L_0$ Norm Optimization
Existing Approaches and Limitations
Methodology
Overview of ASLO Strategy
Basic Principles of ASLO
Mechanism of ASLO
Algorithmic Implementation
Example and Detailed Explanation
Experiment
Dataset
Part 1: Evaluation of the Adaptive Sparse Regularization Method
...and 18 more sections

Figures (1)

Figure 1: ASR and Success Distance comparison between GM_PGD, GM_PGD_L2, and AS_GM_PGD methods across different models.

Evaluating Model Robustness Using Adaptive Sparse L0 Regularization

TL;DR

Abstract

Evaluating Model Robustness Using Adaptive Sparse L0 Regularization

Authors

TL;DR

Abstract

Table of Contents

Figures (1)