NODE-AdvGAN: Improving the transferability and perceptual similarity of adversarial examples by dynamic-system-driven adversarial generative model
Xinheng Xie, Yue Wu, Cuiyu He
TL;DR
The paper tackles the challenge of creating adversarial examples that are both highly transferable and perceptually similar to the original images. It introduces NODE-AdvGAN, which models adversarial perturbation generation as a continuous-time dynamical process using a Neural Ordinary Differential Equation based generator, and NODE-AdvGAN-T, a training strategy that dynamically tunes the perturbation budget to boost transferability while keeping evaluation budgets fixed. Empirical results on FMNIST and CIFAR-10 demonstrate that NODE-AdvGAN and especially NODE-AdvGAN-T achieve higher attack success rates and preserve image quality relative to gradient-based methods and the original AdvGAN, in both white-box and black-box transfer settings. The work suggests that combining continuous-time dynamics with targeted noise-tuning can significantly improve adversarial robustness research, with potential applications in defense training, model evaluation, and further integration with diffusion-inspired generators.
Abstract
Understanding adversarial examples is crucial for improving model robustness, as they introduce imperceptible perturbations to deceive models. Effective adversarial examples, therefore, offer the potential to train more robust models by eliminating model singularities. We propose NODE-AdvGAN, a novel approach that treats adversarial generation as a continuous process and employs a Neural Ordinary Differential Equation (NODE) to simulate generator dynamics. By mimicking the iterative nature of traditional gradient-based methods, NODE-AdvGAN generates smoother and more precise perturbations that preserve high perceptual similarity when added to benign images. We also propose a new training strategy, NODE-AdvGAN-T, which enhances transferability in black-box attacks by tuning the noise parameters during training. Experiments demonstrate that NODE-AdvGAN and NODE-AdvGAN-T generate more effective adversarial examples that achieve higher attack success rates while preserving better perceptual quality than baseline models.
