Defending Against Repetitive Backdoor Attacks on Semi-supervised Learning through Lens of Rate-Distortion-Perception Trade-off
Cheng-Yi Lee, Ching-Chia Kao, Cheng-Han Yeh, Chun-Shien Lu, Chia-Mu Yu, Chu-Song Chen
TL;DR
This paper addresses backdoor attacks in semi-supervised learning by defending unlabeled data without requiring clean labeled data. The core idea, UPure, purifies unlabeled data in the frequency domain by perturbing high-frequency DCT components within a region sized by $\tau\times\tau$, guided by Rate-Distortion-Perception (RDP) trade-offs. The authors provide a theoretical justification for the perturbation region and demonstrate that UPure markedly reduces attack success rates to near zero across multiple SSL algorithms and datasets, while preserving benign accuracy. Empirically, UPure outperforms several state-of-the-art defenses and remains effective against repetitive and visible/invisible triggers, highlighting its practical impact for secure SSL in real-world data pipelines.
Abstract
Semi-supervised learning (SSL) has achieved remarkable performance with a small fraction of labeled data by leveraging vast amounts of unlabeled data from the Internet. However, this large pool of untrusted data is extremely vulnerable to data poisoning, leading to potential backdoor attacks. Current backdoor defenses are not yet effective against such a vulnerability in SSL. In this study, we propose a novel method, Unlabeled Data Purification (UPure), to disrupt the association between trigger patterns and target classes by introducing perturbations in the frequency domain. By leveraging the Rate-Distortion-Perception (RDP) trade-off, we further identify the frequency band, where the perturbations are added, and justify this selection. Notably, UPure purifies poisoned unlabeled data without the need of extra clean labeled data. Extensive experiments on four benchmark datasets and five SSL algorithms demonstrate that UPure effectively reduces the attack success rate from 99.78% to 0% while maintaining model accuracy. Code is available here: \url{https://github.com/chengyi-chris/UPure}.
