Deep Learning Approach for Ear Recognition and Longitudinal Evaluation in Children
Afzal Hossain, Tipu Sultan, Stephanie Schuckers
TL;DR
The paper addresses the challenge of ear-based biometrics in children, where rapid ear development impairs longitudinal identification. It introduces a deep learning pipeline that uses Mask R-CNN for ear segmentation and a VGG16-MobileNet ensemble for feature extraction, evaluated on a newly collected longitudinal dataset of children aged 4–14 over 2.5 years, plus an adult IITD baseline. Key findings show strong within-session performance (TAR > 90%, FAR ≈ 2%) but substantial drop in cross-session accuracy over time (55–76% TAR across 30 months), with especially low performance for sub-8-year-olds due to rapid growth. The study highlights the need for adaptive, time-aware recognition strategies and potentially multi-modal approaches to robustly identify children across developmental stages.
Abstract
Ear recognition as a biometric modality is becoming increasingly popular, with promising broader application areas. While current applications involve adults, one of the challenges in ear recognition for children is the rapid structural changes in the ear as they age. This work introduces a foundational longitudinal dataset collected from children aged 4 to 14 years over a 2.5-year period and evaluates ear recognition performance in this demographic. We present a deep learning based approach for ear recognition, using an ensemble of VGG16 and MobileNet, focusing on both adult and child datasets, with an emphasis on longitudinal evaluation for children.
