NMF-Based Analysis of Mobile Eye-Tracking Data
Daniel Klötzl, Tim Krake, Frank Heyen, Michael Becher, Maurice Koch, Daniel Weiskopf, Kuno Kurzhals
TL;DR
The paper addresses unsupervised identification of Areas of Interest (AOIs) in mobile eye-tracking data by applying nonnegative matrix factorization (NMF) to gaze-informed video frames. It introduces a preprocessing pipeline that crops image patches around gaze points, reduces temporal redundancy via fixations, and vectorizes frames into a matrix $X \in \mathbb{R}^{(3\cdot\tilde{n}_1\cdot\tilde{n}_2) \times (\tilde{m}_1+\dots+\tilde{m}_r)}$; NMF then computes $X = W H$ with $W \ge 0$, $H \ge 0$, yielding $k$ spatial components $w_j$ and temporal components $h_j$. Visualization ranks components by impact using $\|h_j\|_p$ and presents spatial images with temporal indicator plots for exploratory assessment. The method is demonstrated on nine mobile eye-tracking recordings of an art-gallery scene, extended to 27 recordings, showing successful identification of multiple AOIs and revealing scanpath patterns, while noting scalability constraints and potential future interactive enhancements.
Abstract
The depiction of scanpaths from mobile eye-tracking recordings by thumbnails from the stimulus allows the application of visual computing to detect areas of interest in an unsupervised way. We suggest using nonnegative matrix factorization (NMF) to identify such areas in stimuli. For a user-defined integer k, NMF produces an explainable decomposition into k components, each consisting of a spatial representation associated with a temporal indicator. In the context of multiple eye-tracking recordings, this leads to k spatial representations, where the temporal indicator highlights the appearance within recordings. The choice of k provides an opportunity to control the refinement of the decomposition, i.e., the number of areas to detect. We combine our NMF-based approach with visualization techniques to enable an exploratory analysis of multiple recordings. Finally, we demonstrate the usefulness of our approach with mobile eye-tracking data of an art gallery.
