Table of Contents
Fetching ...

FocusFlow: 3D Gaze-Depth Interaction in Virtual Reality Leveraging Active Visual Depth Manipulation

Chenyang Zhang, Tiansu Chen, Eric Shaffer, Elahe Soltanaghai

TL;DR

This paper tackles enabling hands-free gaze-based interaction in VR by exploiting visual depth as an input dimension, addressing depth-estimation noise and the Midas Touch problem. It introduces FocusFlow, a system with a layer-based UI and a Virtual Window that uses adaptive visual cues and two learning strategies to train users in depth control. Through a pilot study and a 24-participant user study, the authors show that adaptive cues improve depth perception, reduce false triggers, and achieve activation times around 1.3 seconds, illustrating practical viability. Limitations include small sample size and fatigue concerns, with future work pointing toward multi-modal integration and continuous depth inputs to broaden applicability.

Abstract

Gaze interaction presents a promising avenue in Virtual Reality (VR) due to its intuitive and efficient user experience. Yet, the depth control inherent in our visual system remains underutilized in current methods. In this study, we introduce FocusFlow, a hands-free interaction method that capitalizes on human visual depth perception within the 3D scenes of Virtual Reality. We first develop a binocular visual depth detection algorithm to understand eye input characteristics. We then propose a layer-based user interface and introduce the concept of 'Virtual Window' that offers an intuitive and robust gaze-depth VR interaction, despite the constraints of visual depth accuracy and precision spatially at further distances. Finally, to help novice users actively manipulate their visual depth, we propose two learning strategies that use different visual cues to help users master visual depth control. Our user studies on 24 participants demonstrate the usability of our proposed virtual window concept as a gaze-depth interaction method. In addition, our findings reveal that the user experience can be enhanced through an effective learning process with adaptive visual cues, helping users to develop muscle memory for this brand-new input mechanism. We conclude the paper by discussing strategies to optimize learning and potential research topics of gaze-depth interaction.

FocusFlow: 3D Gaze-Depth Interaction in Virtual Reality Leveraging Active Visual Depth Manipulation

TL;DR

This paper tackles enabling hands-free gaze-based interaction in VR by exploiting visual depth as an input dimension, addressing depth-estimation noise and the Midas Touch problem. It introduces FocusFlow, a system with a layer-based UI and a Virtual Window that uses adaptive visual cues and two learning strategies to train users in depth control. Through a pilot study and a 24-participant user study, the authors show that adaptive cues improve depth perception, reduce false triggers, and achieve activation times around 1.3 seconds, illustrating practical viability. Limitations include small sample size and fatigue concerns, with future work pointing toward multi-modal integration and continuous depth inputs to broaden applicability.

Abstract

Gaze interaction presents a promising avenue in Virtual Reality (VR) due to its intuitive and efficient user experience. Yet, the depth control inherent in our visual system remains underutilized in current methods. In this study, we introduce FocusFlow, a hands-free interaction method that capitalizes on human visual depth perception within the 3D scenes of Virtual Reality. We first develop a binocular visual depth detection algorithm to understand eye input characteristics. We then propose a layer-based user interface and introduce the concept of 'Virtual Window' that offers an intuitive and robust gaze-depth VR interaction, despite the constraints of visual depth accuracy and precision spatially at further distances. Finally, to help novice users actively manipulate their visual depth, we propose two learning strategies that use different visual cues to help users master visual depth control. Our user studies on 24 participants demonstrate the usability of our proposed virtual window concept as a gaze-depth interaction method. In addition, our findings reveal that the user experience can be enhanced through an effective learning process with adaptive visual cues, helping users to develop muscle memory for this brand-new input mechanism. We conclude the paper by discussing strategies to optimize learning and potential research topics of gaze-depth interaction.
Paper Structure (39 sections, 21 figures)

This paper contains 39 sections, 21 figures.

Figures (21)

  • Figure 1: Depth calculation and sensitivity analysis.
  • Figure 2: Experiment settings. The participants are required to look at three static objects located at 0.5m, 1.0m, and 2.0m away from themselves. Then they are asked to follow a moving target moving back and forth between 0.5m and 2.5m away with their gaze.
  • Figure 3: Visual depth detection results. (a) per-participant histogram of estimated depths for three static targets at different distances; (b) snapshot of estimated depth fluctuations for the moving target with and without de-noising.
  • Figure 4: Layer-based UI. The layers are arranged at different depths along the z-direction. The user can match the corresponding layers by changing the visual depth. The matched layers will be activated.
  • Figure 5: Activation logic. (a) Pointing: When the user is looking at the target object in VR environment, the gaze point falls on the object and the visual depth exceeds the activation zone. (b) Depth Shift: When the user's visual depth falls in the activation zone, the Virtual Window will be activated. (c) The gaze depth change in multiple activations.
  • ...and 16 more figures