VRCopilot: Authoring 3D Layouts with Generative AI Models in VR
Lei Zhang, Jin Pan, Jacob Gettig, Steve Oney, Anhong Guo
TL;DR
VRCopilot tackles immersive authoring in VR by integrating pre-trained generative models to co-create indoor layouts. It introduces multimodal specification and intermediate wireframe representations to improve controllability and support rapid prototyping. Two user studies show that scaffolded wireframes enhance user agency, while manual multimodal creation offers the highest sense of creativity; automatic creation yields lower creativity and agency. The findings highlight design opportunities for transparent AI assistance, multiple alternatives, and intermediate representations to balance creativity and control in VR-based design tasks.
Abstract
Immersive authoring provides an intuitive medium for users to create 3D scenes via direct manipulation in Virtual Reality (VR). Recent advances in generative AI have enabled the automatic creation of realistic 3D layouts. However, it is unclear how capabilities of generative AI can be used in immersive authoring to support fluid interactions, user agency, and creativity. We introduce VRCopilot, a mixed-initiative system that integrates pre-trained generative AI models into immersive authoring to facilitate human-AI co-creation in VR. VRCopilot presents multimodal interactions to support rapid prototyping and iterations with AI, and intermediate representations such as wireframes to augment user controllability over the created content. Through a series of user studies, we evaluated the potential and challenges in manual, scaffolded, and automatic creation in immersive authoring. We found that scaffolded creation using wireframes enhanced the user agency compared to automatic creation. We also found that manual creation via multimodal specification offers the highest sense of creativity and agency.
