Purify++: Improving Diffusion-Purification with Advanced Diffusion Models and Control of Randomness
Boya Zhang, Weijian Luo, Zhihua Zhang
TL;DR
Purify++ addresses adversarial robustness by enhancing diffusion purification through three targeted improvements: a stronger diffusion model (EDM/VE-based forward diffusion), efficient yet effective purification simulation (favoring stable, low-NFE Heun-style updates), and an optimal randomness control that mixes ODE and Langevin dynamics. It demonstrates that these components yield state-of-the-art robustness on CIFAR-10 and MNIST across black-box, gray-box, and adaptive attacks, while maintaining solid standard accuracy. The work provides both theoretical and empirical insights into diffusion-purification design, including how forward diffusion choice and stochasticity influence purification effectiveness. Practically, Purify++ offers a more reliable, compute-budget-friendly purification paradigm that can be deployed without retraining classifiers.
Abstract
Adversarial attacks can mislead neural network classifiers. The defense against adversarial attacks is important for AI safety. Adversarial purification is a family of approaches that defend adversarial attacks with suitable pre-processing. Diffusion models have been shown to be effective for adversarial purification. Despite their success, many aspects of diffusion purification still remain unexplored. In this paper, we investigate and improve upon three limiting designs of diffusion purification: the use of an improved diffusion model, advanced numerical simulation techniques, and optimal control of randomness. Based on our findings, we propose Purify++, a new diffusion purification algorithm that is now the state-of-the-art purification method against several adversarial attacks. Our work presents a systematic exploration of the limits of diffusion purification methods.
