Eyes on Many: Evaluating Gaze, Hand, and Voice for Multi-Object Selection in Extended Reality
Mohammad Raihanul Bashar, Aunnoy K Mutasim, Ken Pfeuffer, Anil Ufuk Batmaz
TL;DR
Eyes on Many investigates controller-free multi-object selection in XR by evaluating four mode-switching techniques and three subselection methods across moderate-to-large target sets. The study employs a within-subject design in a VR grid to measure speed, accuracy, and workload, identifying a strong advantage for persistent-mode toggles, particularly DoublePinch, combined with gaze+pinch subselection. Key findings show that SemiPinch underperforms due to instability and fatigue, while DoublePinch+gP delivers the best speed-accuracy-efficiency balance; voice reduces physical effort but repeated commands for subselection can be tedious. These results yield practical design guidelines for scalable, headset-native multi-selection in XR, emphasizing mode stability, efficient gaze confirmation, and careful use of hands-free input for high-level commands.
Abstract
Interacting with multiple objects simultaneously makes us fast. A pre-step to this interaction is to select the objects, i.e., multi-object selection, which is enabled through two steps: (1) toggling multi-selection mode -- mode-switching -- and then (2) selecting all the intended objects -- subselection. In extended reality (XR), each step can be performed with the eyes, hands, and voice. To examine how design choices affect user performance, we evaluated four mode-switching (SemiPinch, FullPinch, DoublePinch, and Voice) and three subselection techniques (Gaze+Dwell, Gaze+Pinch, and Gaze+Voice) in a user study. Results revealed that while DoublePinch paired with Gaze+Pinch yielded the highest overall performance, SemiPinch achieved the lowest performance. Although Voice-based mode-switching showed benefits, Gaze+Voice subselection was less favored, as the required repetitive vocal commands were perceived as tedious. Overall, these findings provide empirical insights and inform design recommendations for multi-selection techniques in XR.
