PureEBM: Universal Poison Purification via Mid-Run Dynamics of Energy-Based Models
Omead Pooladzandi, Jeffrey Jiang, Sunay Bhat, Gregory Pottie
TL;DR
Data poisoning threatens model reliability through imperceptible backdoors or triggerless perturbations. PureEBM introduces a universal purification preprocessor that uses mid-run Langevin dynamics of a convergent Energy-Based Model to move poisoned inputs toward the natural-data energy basin via the stochastic transform $Psi_T(x)$. The method achieves state-of-the-art defense across multiple attack types (BP, GM, NS, Narcissus) and remains effective even when the EBM is trained on poisoned or POOD data, with minimal impact on natural accuracy. Its preprocessor design, model- and dataset-agnostic applicability, and favorable compute-accuracy trade-offs enable practical deployment across diverse architectures and settings.
Abstract
Data poisoning attacks pose a significant threat to the integrity of machine learning models by leading to misclassification of target distribution data by injecting adversarial examples during training. Existing state-of-the-art (SoTA) defense methods suffer from limitations, such as significantly reduced generalization performance and significant overhead during training, making them impractical or limited for real-world applications. In response to this challenge, we introduce a universal data purification method that defends naturally trained classifiers from malicious white-, gray-, and black-box image poisons by applying a universal stochastic preprocessing step $Ψ_{T}(x)$, realized by iterative Langevin sampling of a convergent Energy Based Model (EBM) initialized with an image $x.$ Mid-run dynamics of $Ψ_{T}(x)$ purify poison information with minimal impact on features important to the generalization of a classifier network. We show that EBMs remain universal purifiers, even in the presence of poisoned EBM training data, and achieve SoTA defense on leading triggered and triggerless poisons. This work is a subset of a larger framework introduced in \pgen with a more detailed focus on EBM purification and poison defense.
