Egocentric Gaze Estimation via Neck-Mounted Camera
Haoyu Huang, Yoichi Sato
TL;DR
This work introduces neck-mounted view gaze estimation as a new egocentric gaze task, addressing the lack of neck-mounted data by collecting the first ~4 hours of synchronized head-mounted gaze and neck-mounted video from 8 participants. It benchmarked a transformer-based gaze model (GLC) on this domain and proposed two domain-specific augmentations: an auxiliary in-view classification and a multi-view co-learning scheme with latent feature alignment conditioned on the relative camera rotation. The auxiliary in-view classifier yielded modest gains over direct fine-tuning, while multi-view co-learning did not improve performance. The study provides a dataset, analysis of neck-mounted gaze characteristics (notably higher out-of-view rates and different center bias), and directions for future work, including scaling data and developing neck-tailored architectures with broader device support.
Abstract
This paper introduces neck-mounted view gaze estimation, a new task that estimates user gaze from the neck-mounted camera perspective. Prior work on egocentric gaze estimation, which predicts device wearer's gaze location within the camera's field of view, mainly focuses on head-mounted cameras while alternative viewpoints remain underexplored. To bridge this gap, we collect the first dataset for this task, consisting of approximately 4 hours of video collected from 8 participants during everyday activities. We evaluate a transformer-based gaze estimation model, GLC, on the new dataset and propose two extensions: an auxiliary gaze out-of-bound classification task and a multi-view co-learning approach that jointly trains head-view and neck-view models using a geometry-aware auxiliary loss. Experimental results show that incorporating gaze out-of-bound classification improves performance over standard fine-tuning, while the co-learning approach does not yield gains. We further analyze these results and discuss implications for neck-mounted gaze estimation.
