Table of Contents
Fetching ...

Beyond Ray-Casting: Evaluating Controller, Free-Hand, and Virtual-Touch Modalities for Immersive Text Entry

Md. Tanvir Hossain, Mohd Ruhul Ameen, Akif Islam, Md. Omar Faruqe, Mahboob Qaosar, A. F. M. Mahbubur Rahman, Sanjoy Kumar Chakravarty, M. Khademul Islam Molla

Abstract

Efficient text entry remains a primary bottleneck preventing Virtual Reality (VR) from evolving into a viable productivity platform. To address this, we conducted an empirical comparison of six physical input systems across three interaction styles Controller Driven, Free Hand, and Virtual Touch evaluating both discrete tap typing and continuous gesture typing (swiping), alongside a speech to text (Voice) condition as a non physical reference modality. Results from 21 participants show that the Controller Driven Tap Gesture Combo (CD TGC) delivers the best productivity performance, achieving speeds 2.25 times higher than the slowest system and 30% faster than the current industry standard, while reducing error rates by up to 68%. A clear trade off emerged between performance and perceived usability: although controller based gesture input led on speed and accuracy, participants rated Virtual Touch Tap Typing highest in subjective experience, scoring 80% higher on the System Usability Scale (SUS) than the lowest rated alternative. We further observe that Free Hand interaction remains limited by tracking stability and physical fatigue, whereas Voice input introduces practical constraints related to privacy, editing control, and immersive engagement. Together, these findings characterize the tension between throughput and natural interaction in immersive text entry and provide data driven guidance for future VR interface design.

Beyond Ray-Casting: Evaluating Controller, Free-Hand, and Virtual-Touch Modalities for Immersive Text Entry

Abstract

Efficient text entry remains a primary bottleneck preventing Virtual Reality (VR) from evolving into a viable productivity platform. To address this, we conducted an empirical comparison of six physical input systems across three interaction styles Controller Driven, Free Hand, and Virtual Touch evaluating both discrete tap typing and continuous gesture typing (swiping), alongside a speech to text (Voice) condition as a non physical reference modality. Results from 21 participants show that the Controller Driven Tap Gesture Combo (CD TGC) delivers the best productivity performance, achieving speeds 2.25 times higher than the slowest system and 30% faster than the current industry standard, while reducing error rates by up to 68%. A clear trade off emerged between performance and perceived usability: although controller based gesture input led on speed and accuracy, participants rated Virtual Touch Tap Typing highest in subjective experience, scoring 80% higher on the System Usability Scale (SUS) than the lowest rated alternative. We further observe that Free Hand interaction remains limited by tracking stability and physical fatigue, whereas Voice input introduces practical constraints related to privacy, editing control, and immersive engagement. Together, these findings characterize the tension between throughput and natural interaction in immersive text entry and provide data driven guidance for future VR interface design.
Paper Structure (29 sections, 7 figures)

This paper contains 29 sections, 7 figures.

Figures (7)

  • Figure 1: Visual comparison of the six physical interaction systems. The top row displays the discrete Tap-Typing (TT) modes, while the bottom row displays the continuous Tap-Gesture Combo (TGC) modes where users swipe through letters.
  • Figure 2: The custom Data Collection Application UI running on the desktop, streaming to the VR headset. It handles phrase presentation, WPM calculation, and questionnaire switching.
  • Figure 3: In-Game GEQ results. VT-TT and CD-TGC score highest in Competence and Flow, while Free-Hand systems suffer from high Tension and Negative Affect.
  • Figure 4: Median Words Per Minute (WPM) vs. Median Total Error Rate (TER) across all physical systems. CD-TGC demonstrates the optimal balance of high speed (17.09 WPM) and low error rate (5.80%).
  • Figure 5: User preferences for Physical Systems. Virtual-Touch Tap-Typing (VT-TT) was the clear favorite among participants.
  • ...and 2 more figures