EM-DARTS: Hierarchical Differentiable Architecture Search for Eye Movement Recognition
Huafeng Qin, Hongyu Zhu, Xin Jin, Xin Yu, Mounim A. El-Yacoubi, Shuqiang Yang
TL;DR
EM-DARTS presents a hierarchical differentiable NAS framework for eye movement recognition, addressing limitations of conventional DARTS by enabling independent local-cell architectures and alternated global optimization to align search and evaluation. A transfer entropy–based pruning step reduces redundancy, yielding compact networks with strong verification performance. Across GazeBase, JuDo1000, and EMglasses, EM-DARTS achieves state-of-the-art EERs and robust recognition under occlusion, while maintaining competitive search efficiency. The approach carries practical implications for mobile and wearable biometric systems by delivering high accuracy with reduced architectural redundancy and computation.
Abstract
Eye movement biometrics has received increasing attention thanks to its highly secure identification. Although deep learning (DL) models have shown success in eye movement recognition, their architectures largely rely on human prior knowledge. Differentiable Neural Architecture Search (DARTS) automates the manual process of architecture design with high search efficiency. However, DARTS typically stacks multiple cells to form a convolutional network, which limits the diversity of architecture. Furthermore, DARTS generally searches for architectures using shallower networks than those used in the evaluation, creating a significant disparity in architecture depth between the search and evaluation phases. To address this issue, we propose EM-DARTS, a hierarchical differentiable architecture search algorithm to automatically design the DL architecture for eye movement recognition. First, we define a supernet and propose a global and local alternate Neural Architecture Search method to search the optimal architecture alternately with a differentiable neural architecture search. The local search strategy aims to find an optimal architecture for different cells while the global search strategy is responsible for optimizing the architecture of the target network. To minimize redundancy, transfer entropy is proposed to compute the information amount of each layer, thereby further simplifying the network search process. Experimental results on three public datasets demonstrate that the proposed EM-DARTS is capable of producing an optimal architecture that leads to state-of-the-art recognition performance, {Specifically, the recognition models developed using EM-DARTS achieved the lowest EERs of 0.0453 on the GazeBase dataset, 0.0377 on the JuDo1000 dataset, and 0.1385 on the EMglasses dataset.
