On the Interaction of Compressibility and Adversarial Robustness

Melih Barsbey; Antônio H. Ribeiro; Umut Şimşekli; Tolga Birdal

On the Interaction of Compressibility and Adversarial Robustness

Melih Barsbey, Antônio H. Ribeiro, Umut Şimşekli, Tolga Birdal

TL;DR

This work analyzes how structured compressibility—specifically neuron-level sparsity and spectral compression—affects adversarial robustness in neural networks. By linking compressibility to layer operator norms and Lipschitz constants, the authors derive a bound that reveals a small set of adversarial directions that become highly sensitive under compression. Empirically, they demonstrate robustness gaps across diverse architectures and tasks, showing persistence under adversarial training and transfer, and the emergence of universal adversarial perturbations. They also propose practical mitigation strategies based on per-layer compression targets and spread control, emphasizing the trade-off between efficiency and security in modern models.

Abstract

Modern neural networks are expected to simultaneously satisfy a host of desirable properties: accurate fitting to training data, generalization to unseen inputs, parameter and computational efficiency, and robustness to adversarial perturbations. While compressibility and robustness have each been studied extensively, a unified understanding of their interaction still remains elusive. In this work, we develop a principled framework to analyze how different forms of compressibility - such as neuron-level sparsity and spectral compressibility - affect adversarial robustness. We show that these forms of compression can induce a small number of highly sensitive directions in the representation space, which adversaries can exploit to construct effective perturbations. Our analysis yields a simple yet instructive robustness bound, revealing how neuron and spectral compressibility impact $L_\infty$ and $L_2$ robustness via their effects on the learned representations. Crucially, the vulnerabilities we identify arise irrespective of how compression is achieved - whether via regularization, architectural bias, or implicit learning dynamics. Through empirical evaluations across synthetic and realistic tasks, we confirm our theoretical predictions, and further demonstrate that these vulnerabilities persist under adversarial training and transfer learning, and contribute to the emergence of universal adversarial perturbations. Our findings show a fundamental tension between structured compressibility and robustness, and suggest new pathways for designing models that are both efficient and secure.

On the Interaction of Compressibility and Adversarial Robustness

TL;DR

Abstract

On the Interaction of Compressibility and Adversarial Robustness

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (16)

Theorems & Definitions (24)