Which Experimental Design is Better Suited for VQA Tasks? Eye Tracking Study on Cognitive Load, Performance, and Gaze Allocations
Sita A. Vriend, Sandeep Vidyapu, Amer Rama, Kun-Ting Chen, Daniel Weiskopf
TL;DR
This study addresses how the order of image stimuli and questions, together with question modality, shape cognitive load, accuracy, and gaze in visual question answering (VQA). It compares five designs using eye-tracking and subjective/cobjective measures (NASA-TLX, accuracy, HAAR, fixation duration) to identify designs that minimize extraneous cognitive burden. Key findings show that the IQ design is most taxing and least accurate, while designs like QI, IQI, and QIQ offer better performance with varied gaze patterns; the auditory AIA design may hinder comprehension. The results provide practical guidance for designing robust visualization experiments and gaze-based studies that rely on VQA tasks.
Abstract
We conducted an eye-tracking user study with 13 participants to investigate the influence of stimulus-question ordering and question modality on participants using visual question-answering (VQA) tasks. We examined cognitive load, task performance, and gaze allocations across five distinct experimental designs, aiming to identify setups that minimize the cognitive burden on participants. The collected performance and gaze data were analyzed using quantitative and qualitative methods. Our results indicate a significant impact of stimulus-question ordering on cognitive load and task performance, as well as a noteworthy effect of question modality on task performance. These findings offer insights for the experimental design of controlled user studies in visualization research.
