Confidence-aware 3D Gaze Estimation and Evaluation Metric
Qiaojie Zheng, Xiaoli Zhang
TL;DR
This work tackles unreliable and overconfident appearance-based 3D gaze estimation by introducing a confidence-aware network that outputs both gaze angles and their uncertainties. It employs a heteroskedastic loss and an end-to-end architecture built on Resnet18 per eye, with the overall uncertainty taken as the maximum of the two per-angle uncertainties. To evaluate uncertainty effectiveness, the authors propose a causal correlation metric that links eye feature degradation (via controllable corruptions) to inferred uncertainty, showing superior discrimination relative to traditional angular-error correlations. Experimental results on MPII-Gaze and RTGene demonstrate strong per-corruption uncertainty signaling (e.g., correlation $C\approx0.95$) and robust cross-dataset behavior, suggesting practical benefits for safer HMI deployment where unreliable gaze estimates can be flagged or discarded before action. The proposed framework advances both gaze estimation and uncertainty evaluation, enabling more trustworthy, real-time gaze-enabled interactions in variable visual conditions.
Abstract
Deep learning appearance-based 3D gaze estimation is gaining popularity due to its minimal hardware requirements and being free of constraint. Unreliable and overconfident inferences, however, still limit the adoption of this gaze estimation method. To address the unreliable and overconfident issues, we introduce a confidence-aware model that predicts uncertainties together with gaze angle estimations. We also introduce a novel effectiveness evaluation method based on the causality between eye feature degradation and the rise in inference uncertainty to assess the uncertainty estimation. Our confidence-aware model demonstrates reliable uncertainty estimations while providing angular estimation accuracies on par with the state-of-the-art. Compared with the existing statistical uncertainty-angular-error evaluation metric, the proposed effectiveness evaluation approach can more effectively judge inferred uncertainties' performance at each prediction.
