High-frequency near-eye ground truth for event-based eye tracking
Andrea Simpsi, Andrea Aspesi, Simone Mentasti, Luca Merigo, Tommaso Ongarello, Matteo Matteucci
TL;DR
The paper addresses the lack of high-frequency, eye-level ground truth for event-based eye tracking by introducing a semi-automatic annotation pipeline that converts asynchronous event streams into 200 Hz RGB frames, detects eye movements, and estimates pupil centers using template matching and RANSAC, followed by human refinement. Applied to the Angelopoulos dataset, it provides 200 Hz pupil-center annotations along with blink and saccade labels, significantly enriching ground truth for training and evaluation. The approach reduces manual effort and enhances the reliability of pupil tracking in event-based systems, enabling more capable, low-power eye-tracking in smart eyewear. Overall, the work strengthens datasets in the emerging field of event-based eye tracking and supports faster development of real-time algorithms for near-eye devices.
Abstract
Event-based eye tracking is a promising solution for efficient and low-power eye tracking in smart eyewear technologies. However, the novelty of event-based sensors has resulted in a limited number of available datasets, particularly those with eye-level annotations, crucial for algorithm validation and deep-learning training. This paper addresses this gap by presenting an improved version of a popular event-based eye-tracking dataset. We introduce a semi-automatic annotation pipeline specifically designed for event-based data annotation. Additionally, we provide the scientific community with the computed annotations for pupil detection at 200Hz.
