SecureGaze: Defending Gaze Estimation Against Backdoor Attacks
Lingyu Du, Yupei Liu, Jinyuan Jia, Guohao Lan
TL;DR
Gaze estimation systems are vulnerable to backdoor attacks, with continuous output spaces and globally activated backdoors differentiating them from classification tasks. SecureGaze introduces a trigger-reversal framework that leverages feature-space and output-space observations to identify backdoors without enumerating infinite gaze outputs, using a generative trigger model and a sensitivity-aware search. The approach demonstrates strong backdoor identification (over 92-94% accuracy) and effective mitigation across diverse digital and physical attacks, outperforming adapted classification defenses. The work highlights the need for domain-specific defenses in regression-based vision tasks and provides a practical offline pipeline for secure deployment.
Abstract
Gaze estimation models are widely used in applications such as driver attention monitoring and human-computer interaction. While many methods for gaze estimation exist, they rely heavily on data-hungry deep learning to achieve high performance. This reliance often forces practitioners to harvest training data from unverified public datasets, outsource model training, or rely on pre-trained models. However, such practices expose gaze estimation models to backdoor attacks. In such attacks, adversaries inject backdoor triggers by poisoning the training data, creating a backdoor vulnerability: the model performs normally with benign inputs, but produces manipulated gaze directions when a specific trigger is present. This compromises the security of many gaze-based applications, such as causing the model to fail in tracking the driver's attention. To date, there is no defense that addresses backdoor attacks on gaze estimation models. In response, we introduce SecureGaze, the first solution designed to protect gaze estimation models from such attacks. Unlike classification models, defending gaze estimation poses unique challenges due to its continuous output space and globally activated backdoor behavior. By identifying distinctive characteristics of backdoored gaze estimation models, we develop a novel and effective approach to reverse-engineer the trigger function for reliable backdoor detection. Extensive evaluations in both digital and physical worlds demonstrate that SecureGaze effectively counters a range of backdoor attacks and outperforms seven state-of-the-art defenses adapted from classification models.
