Adversarial generalization of unfolding (model-based) networks
Vicky Kouni
TL;DR
This work tackles adversarial generalization of model-based unfolding networks for compressed sensing by coupling a Lipschitz analysis of FGSM-attacked final decoders with adversarial Rademacher complexity (ARC) to derive generalization bounds that depend on overparameterization $N$, depth $L$, and attack level $\\varepsilon$. Using ADMM-DAD with an overcomplete sparsifier $W$, the authors prove Lipschitz continuity of the perturbed decoder in $W$ and obtain a tight ARC-based bound that scales with $\\sqrt{NL\\log(\\varepsilon)}/\\sqrt{s}$ up to constants, plus a term for failure probability. The theory is corroborated by experiments on CIFAR-10 and SVHN, showing that increasing the overcomplete representation improves robustness while the empirical adversarial generalization error tracks the predicted scaling. The results provide practical design guidance for robust, interpretable unfolding networks in safety-critical inverse problems such as CS-MRI, linking architectural overparameterization to adversarial resilience.
Abstract
Unfolding networks are interpretable networks emerging from iterative algorithms, incorporate prior knowledge of data structure, and are designed to solve inverse problems like compressed sensing, which deals with recovering data from noisy, missing observations. Compressed sensing finds applications in critical domains, from medical imaging to cryptography, where adversarial robustness is crucial to prevent catastrophic failures. However, a solid theoretical understanding of the performance of unfolding networks in the presence of adversarial attacks is still in its infancy. In this paper, we study the adversarial generalization of unfolding networks when perturbed with $l_2$-norm constrained attacks, generated by the fast gradient sign method. Particularly, we choose a family of state-of-the-art overaparameterized unfolding networks and deploy a new framework to estimate their adversarial Rademacher complexity. Given this estimate, we provide adversarial generalization error bounds for the networks under study, which are tight with respect to the attack level. To our knowledge, this is the first theoretical analysis on the adversarial generalization of unfolding networks. We further present a series of experiments on real-world data, with results corroborating our derived theory, consistently for all data. Finally, we observe that the family's overparameterization can be exploited to promote adversarial robustness, shedding light on how to efficiently robustify neural networks.
