Pixel-Based Similarities as an Alternative to Neural Data for Improving Convolutional Neural Network Adversarial Robustness
Elie Attias, Cengiz Pehlevan, Dina Obeid
TL;DR
The paper tackles adversarial robustness in CNNs by revisiting a brain-inspired regularizer and proposing a practical, neural-data-free variant based on pixel similarities. It preserves the core regularization objective and demonstrates robustness gains against several black-box attacks and common corruptions, including grayscale and color CIFAR datasets, with low computational overhead. The key insight is that pixel-based similarities can substitute neural target structures, yielding performance comparable to neural-data regularization while avoiding data collection burdens. While not beating state-of-the-art specialized defenses, the work highlights that brain-inspired principles can be exploited in simple, scalable ways, potentially enabling future hybrids that bring model robustness closer to human-level performance without complex pipelines.
Abstract
Convolutional Neural Networks (CNNs) excel in many visual tasks but remain susceptible to adversarial attacks-imperceptible perturbations that degrade performance. Prior research reveals that brain-inspired regularizers, derived from neural recordings, can bolster CNN robustness; however, reliance on specialized data limits practical adoption. We revisit a regularizer proposed by Li et al. (2019) that aligns CNN representations with neural representational similarity structures and introduce a data-driven variant. Instead of a neural recording-based similarity, our method computes a pixel-based similarity directly from images. This substitution retains the original biologically motivated loss formulation, preserving its robustness benefits while removing the need for neural measurements or task-specific augmentations. Notably, this data-driven variant provides the same robustness improvements observed with neural data. Our approach is lightweight and integrates easily into standard pipelines. Although we do not surpass cutting-edge specialized defenses, we show that neural representational insights can be leveraged without direct recordings. This underscores the promise of robust yet simple methods rooted in brain-inspired principles, even without specialized data, and raises the possibility that further integrating these insights could push performance closer to human levels without resorting to complex, specialized pipelines.
