Feature Transfer Learning for Deep Face Recognition with Under-Represented Data
Xi Yin, Xiang Yu, Kihyuk Sohn, Xiaoming Liu, Manmohan Chandraker
TL;DR
Face recognition suffers when many subjects are under-represented (UR) with few samples. We propose Feature Transfer Learning (FTL), which transfers intra-class variance from regular classes to UR classes under a Gaussian variance prior across subjects, implemented via a center-based transfer module and aided by an alternating two-stage training regime and a metric $L_2$ regularization. The approach yields strong gains on LFW, IJB-A, and MS-Celeb-1M, supports one-shot/low-shot scenarios, and enables smooth feature interpolation that disentangles identity from non-identity variations. Together, these contributions demonstrate effective use of UR data to reduce classifier bias and improve discrimination, with potential applicability to other fine-grained recognition tasks.
Abstract
Despite the large volume of face recognition datasets, there is a significant portion of subjects, of which the samples are insufficient and thus under-represented. Ignoring such significant portion results in insufficient training data. Training with under-represented data leads to biased classifiers in conventionally-trained deep networks. In this paper, we propose a center-based feature transfer framework to augment the feature space of under-represented subjects from the regular subjects that have sufficiently diverse samples. A Gaussian prior of the variance is assumed across all subjects and the variance from regular ones are transferred to the under-represented ones. This encourages the under-represented distribution to be closer to the regular distribution. Further, an alternating training regimen is proposed to simultaneously achieve less biased classifiers and a more discriminative feature representation. We conduct ablative study to mimic the under-represented datasets by varying the portion of under-represented classes on the MS-Celeb-1M dataset. Advantageous results on LFW, IJB-A and MS-Celeb-1M demonstrate the effectiveness of our feature transfer and training strategy, compared to both general baselines and state-of-the-art methods. Moreover, our feature transfer successfully presents smooth visual interpolation, which conducts disentanglement to preserve identity of a class while augmenting its feature space with non-identity variations such as pose and lighting.
