Towards Robust Neural Networks via Orthogonal Diversity
Kun Fang, Qinghua Tao, Yingwen Wu, Tao Li, Jia Cai, Feipeng Cai, Xiaolin Huang, Jie Yang
TL;DR
The paper tackles adversarial robustness by moving beyond data augmentation to a model-centric strategy called DIO (Diversity via Orthogonality). It inserts $L$ mutually orthogonal heads after a shared backbone and enforces diversity with an orthogonality loss $\mathcal{L}_o$ and a margin-based distance loss $\mathcal{L}_d$, optimizing the joint objective $\mathcal{L}_c + \alpha\mathcal{L}_o + \beta\mathcal{L}_d$. Empirically, DIO improves robustness against both white-box and black-box attacks across CIFAR10/100 and TinyImageNet, and its effectiveness is amplified when combined with data-augmentation defenses like AT, TRADES, GAIRAT, LBGAT, AWP, or DDPM-generated data. Ablation studies confirm the complementary roles of $\mathcal{L}_o$ and $\mathcal{L}_d$. The work highlights the value of model-centered diversification for robustness, while noting costs and integration considerations for broader applicability.
Abstract
Deep Neural Networks (DNNs) are vulnerable to invisible perturbations on the images generated by adversarial attacks, which raises researches on the adversarial robustness of DNNs. A series of methods represented by the adversarial training and its variants have proven as one of the most effective techniques in enhancing the DNN robustness. Generally, adversarial training focuses on enriching the training data by involving perturbed data. Such data augmentation effect of the involved perturbed data in adversarial training does not contribute to the robustness of DNN itself and usually suffers from clean accuracy drop. Towards the robustness of DNN itself, we in this paper propose a novel defense that aims at augmenting the model in order to learn features that are adaptive to diverse inputs, including adversarial examples. More specifically, to augment the model, multiple paths are embedded into the network, and an orthogonality constraint is imposed on these paths to guarantee the diversity among them. A margin-maximization loss is then designed to further boost such DIversity via Orthogonality (DIO). In this way, the proposed DIO augments the model and enhances the robustness of DNN itself as the learned features can be corrected by these mutually-orthogonal paths. Extensive empirical results on various data sets, structures and attacks verify the stronger adversarial robustness of the proposed DIO utilizing model augmentation. Besides, DIO can also be flexibly combined with different data augmentation techniques (e.g., TRADES and DDPM), further promoting robustness gains.
