Facial Expression Recognition with Controlled Privacy Preservation and Feature Compensation
Feng Xu, David Ahmedt-Aristizabal, Lars Petersson, Dadong Wang, Xun Li
TL;DR
This paper tackles privacy concerns in video-based facial expression recognition (FER) by introducing a dual-frequency privacy-preservation framework that uses a wavelet transform to separately remove identity features from high- and low-frequency components. It decouples privacy and utility tasks with a privacy-enhancement controller per frequency and a feature compensator that enriches FER-relevant information, plus a privacy leakage validator to quantify residual identity information. On CREMA-D, the approach achieves a FER accuracy of $78.84\%$ with a privacy leakage ratio of $2.01\%$, outperforming several baselines including Gaussian blur, optical flow, and image swapping, while maintaining robust FER performance. The work provides a practical mechanism for quantifying privacy leakage in closed-set FER and demonstrates a scalable pathway toward secure, video-based FER applications in privacy-sensitive settings.
Abstract
Facial expression recognition (FER) systems raise significant privacy concerns due to the potential exposure of sensitive identity information. This paper presents a study on removing identity information while preserving FER capabilities. Drawing on the observation that low-frequency components predominantly contain identity information and high-frequency components capture expression, we propose a novel two-stream framework that applies privacy enhancement to each component separately. We introduce a controlled privacy enhancement mechanism to optimize performance and a feature compensator to enhance task-relevant features without compromising privacy. Furthermore, we propose a novel privacy-utility trade-off, providing a quantifiable measure of privacy preservation efficacy in closed-set FER tasks. Extensive experiments on the benchmark CREMA-D dataset demonstrate that our framework achieves 78.84% recognition accuracy with a privacy (facial identity) leakage ratio of only 2.01%, highlighting its potential for secure and reliable video-based FER applications.
