A Room to Roam: Reset Prediction Based on Physical Object Placement for Redirected Walking
Sulim Chun, Ho Jung Lee, In-Kwon Lee
TL;DR
This work tackles predicting and reducing resets in Redirected Walking by leveraging a Vision Transformer to estimate resets from top-down layouts of physical spaces, trained on simulation data. It introduces an interactive Unity-based interface that provides real-time feedback as users rearrange furniture, enabling rapid interior planning to minimize resets. Study 1 demonstrates that object placement significantly affects resets, while Study 2 achieves RMSE $= 23.88$, MAE $= 15.36$, and $R^2=0.91$, validating the predictive capability and its practical utility. Overall, the approach offers a fast alternative to time-consuming simulations or multiple user studies, facilitating more immersive RDW experiences within real-world spatial constraints.
Abstract
In Redirected Walking (RDW), resets are an overt method that explicitly interrupts users, and they should be avoided to provide a quality user experience. The number of resets depends on the configuration of the physical environment; thus, inappropriate object placement can lead to frequent resets, causing motion sickness and degrading presence. However, estimating the number of resets based on the physical layout is challenging. It is difficult to measure reset frequency with real users repeatedly testing different layouts, and virtual simulations offer limited real-time verification. As a result, while rearranging objects can reduce resets, users have not been able to fully take advantage of this opportunity, highlighting the need for rapid assessment of object placement. To address this, in Study 1, we collected simulation data and analyzed the average number of resets for various object placements. In study 2, we developed a system that allows users to evaluate reset frequency using a real-time placement interface powered by the first learning-based reset prediction model. Our model predicts resets from top-down views of the physical space, leveraging a Vision Transformer architecture. The model achieved a root mean square error (RMSE) of $23.88$. We visualized the model's attention scores using heatmaps to analyze the regions of focus during prediction. Through the interface, users can reorganize furniture while instantly observing the change in the predicted number of resets, thus improving their interior for a better RDW experience with fewer resets.
