MedBN: Robust Test-Time Adaptation against Malicious Test Samples
Hyejin Park, Jeongyeon Hwang, Sunung Mun, Sangdon Park, Jungseul Ok
TL;DR
Test-time adaptation methods are vulnerable to data-poisoning attacks that manipulate batch statistics. The authors propose Median Batch Normalization (MedBN), which uses the median to estimate BN statistics during test-time, replacing the conventional mean-based approach and enabling plug-in compatibility with existing TTA frameworks. They provide theoretical justification that mean-based statistics are easily corrupted by a single malicious sample, while the median remains robust unless malicious samples constitute a majority; empirical results on CIFAR10-C, CIFAR100-C, and ImageNet-C (and semantic segmentation tasks) show MedBN substantially improves resilience to both targeted and indiscriminate attacks with minimal loss on benign performance. Overall, MedBN offers a practical, architecture-agnostic defense that strengthens TTA against data-poisoning while preserving performance in benign conditions, making it suitable for real-world deployment.
Abstract
Test-time adaptation (TTA) has emerged as a promising solution to address performance decay due to unforeseen distribution shifts between training and test data. While recent TTA methods excel in adapting to test data variations, such adaptability exposes a model to vulnerability against malicious examples, an aspect that has received limited attention. Previous studies have uncovered security vulnerabilities within TTA even when a small proportion of the test batch is maliciously manipulated. In response to the emerging threat, we propose median batch normalization (MedBN), leveraging the robustness of the median for statistics estimation within the batch normalization layer during test-time inference. Our method is algorithm-agnostic, thus allowing seamless integration with existing TTA frameworks. Our experimental results on benchmark datasets, including CIFAR10-C, CIFAR100-C and ImageNet-C, consistently demonstrate that MedBN outperforms existing approaches in maintaining robust performance across different attack scenarios, encompassing both instant and cumulative attacks. Through extensive experiments, we show that our approach sustains the performance even in the absence of attacks, achieving a practical balance between robustness and performance.
