CARING-AI: Towards Authoring Context-aware Augmented Reality INstruction through Generative Artificial Intelligence
Jingyu Shi, Rahul Jain, Seungguen Chi, Hyungjun Doh, Hyunggun Chi, Alexander J. Quinn, Karthik Ramani
TL;DR
CARING-AI introduces a context-aware AR authoring system that leverages Generative AI to create humanoid avatar instructions grounded in real environments. By combining a two-dimensional design space (context: spatial/temporal; content: local/global) with a three-stage workflow—textual instruction refinement, environment grounding, and motion generation via diffusion models—the system delivers scalable, contextually blended AR guidance. Evaluation across a quantitative baseline and two user studies demonstrates improved temporal continuity, spatial accuracy, and usability compared with a PbD approach, while also highlighting limitations in hand-object interactions and generalizability. The work advances practical AR instruction authoring by enabling remote and ad hoc content creation with reduced hardware demands and demonstrates a path toward broader AI-generated modalities in AR guidance.
Abstract
Context-aware AR instruction enables adaptive and in-situ learning experiences. However, hardware limitations and expertise requirements constrain the creation of such instructions. With recent developments in Generative Artificial Intelligence (Gen-AI), current research tries to tackle these constraints by deploying AI-generated content (AIGC) in AR applications. However, our preliminary study with six AR practitioners revealed that the current AIGC lacks contextual information to adapt to varying application scenarios and is therefore limited in authoring. To utilize the strong generative power of GenAI to ease the authoring of AR instruction while capturing the context, we developed CARING-AI, an AR system to author context-aware humanoid-avatar-based instructions with GenAI. By navigating in the environment, users naturally provide contextual information to generate humanoid-avatar animation as AR instructions that blend in the context spatially and temporally. We showcased three application scenarios of CARING-AI: Asynchronous Instructions, Remote Instructions, and Ad Hoc Instructions based on a design space of AIGC in AR Instructions. With two user studies (N=12), we assessed the system usability of CARING-AI and demonstrated the easiness and effectiveness of authoring with Gen-AI.
