How Deep Is Your Gaze? Leveraging Distance in Image-Based Gaze Analysis
Maurice Koch, Nelusa Pathmanathan, Daniel Weiskopf, Kuno Kurzhals
TL;DR
This work tackles the issue of depth-induced variability in image-based gaze analysis by introducing depth-adaptive thumbnails that scale according to eye-to-object distance, preserving a consistent visual focus. The authors implement two patch strategies (classic fixed-angle and depth-adaptive constant-length) and evaluate them in AR using scanpath similarity (Smith-Waterman with 512-D ResNet features) and visualization methods (Gaze Stripes and image-based projections). Results show depth-adaptive thumbnails improve analysis quality, especially for mid and large patches, and enhance visualization coherence, demonstrated on a benchmark AR dataset of real and virtual objects across multiple distances. While promising, the study notes limitations in depth estimation robustness and feature representation, and calls for broader evaluation with diverse hardware and live-depth sensing scenarios.
Abstract
Image thumbnails are a valuable data source for fixation filtering, scanpath classification, and visualization of eye tracking data. They are typically extracted with a constant size, approximating the foveated area. As a consequence, the focused area of interest in the scene becomes less prominent in the thumbnail with increasing distance, affecting image-based analysis techniques. In this work, we propose depth-adaptive thumbnails, a method for varying image size according to the eye-to-object distance. Adjusting the visual angle relative to the distance leads to a zoom effect on the focused area. We evaluate our approach on recordings in augmented reality, investigating the similarity of thumbnails and scanpaths. Our quantitative findings suggest that considering the eye-to-object distance improves the quality of data analysis and visualization. We demonstrate the utility of depth-adaptive thumbnails for applications in scanpath comparison and visualization.
