Deep Tiny Network for Recognition-Oriented Face Image Quality Assessment
Baoyun Peng, Min Liu, Zhaoning Zhang, Kai Xu, Dongsheng Li
TL;DR
The paper addresses recognition instability from low-quality face images in video sequences by introducing a recognition-oriented, non-reference FIQA method. It defines a quality metric via quality(x) = cosine(f_x, u_y) that directly links image quality to FR performance, and develops tinyFQnet, an ultra-efficient 21.8k-parameter network trained to predict this quality score. The approach enables automatic generation of large-scale quality labels and includes a data-balancing strategy to improve training effectiveness. Extensive experiments on IJB-B, IJB-C, and YTF show that tinyFQnet outperforms perceptual FIQA baselines and several learning-based methods while requiring far less computation, making it well-suited as a plug-in for resource-constrained FR systems. Limitations include reliance on cosine-based similarity and the need to validate generalizability across more diverse datasets and metrics, suggesting avenues for future work.
Abstract
Face recognition has made significant progress in recent years due to deep convolutional neural networks (CNN). In many face recognition (FR) scenarios, face images are acquired from a sequence with huge intra-variations. These intra-variations, which are mainly affected by the low-quality face images, cause instability of recognition performance. Previous works have focused on ad-hoc methods to select frames from a video or use face image quality assessment (FIQA) methods, which consider only a particular or combination of several distortions. In this work, we present an efficient non-reference image quality assessment for FR that directly links image quality assessment (IQA) and FR. More specifically, we propose a new measurement to evaluate image quality without any reference. Based on the proposed quality measurement, we propose a deep Tiny Face Quality network (tinyFQnet) to learn a quality prediction function from data. We evaluate the proposed method for different powerful FR models on two classical video-based (or template-based) benchmark: IJB-B and YTF. Extensive experiments show that, although the tinyFQnet is much smaller than the others, the proposed method outperforms state-of-the-art quality assessment methods in terms of effectiveness and efficiency.
