EdgeEar: Efficient and Accurate Ear Recognition for Edge Devices
Camile Lendering, Bernardo Perrone Ribeiro, Žiga Emeršič, Peter Peer
TL;DR
EdgeEar tackles the need for accurate ear recognition on resource-limited edge devices by presenting a lightweight hybrid CNN–Transformer model that incorporates selective low-rank linear layers (LoRaLin) within SDTA modules. With fewer than $2\ \mathrm{M}$ parameters, EdgeEar achieves a competitive $EER=0.143$, $AUC=0.904$, and $R1=0.929$ on the UERC2023 benchmark, while reducing computational cost dramatically. The method introduces a targeted replacement strategy for the QKV projections and demonstrates through ablations that CE with label smoothing best preserves performance. This work shows that compact, edge-optimized ear biometrics are feasible and highlights the importance of larger, diverse datasets to reduce demographic bias and advance real-world deployment.
Abstract
Ear recognition is a contactless and unobtrusive biometric technique with applications across various domains. However, deploying high-performing ear recognition models on resource-constrained devices is challenging, limiting their applicability and widespread adoption. This paper introduces EdgeEar, a lightweight model based on a proposed hybrid CNN-transformer architecture to solve this problem. By incorporating low-rank approximations into specific linear layers, EdgeEar reduces its parameter count by a factor of 50 compared to the current state-of-the-art, bringing it below two million while maintaining competitive accuracy. Evaluation on the Unconstrained Ear Recognition Challenge (UERC2023) benchmark shows that EdgeEar achieves the lowest EER while significantly reducing computational costs. These findings demonstrate the feasibility of efficient and accurate ear recognition, which we believe will contribute to the wider adoption of ear biometrics.
