ClassComet: Exploring and Designing AI-generated Danmaku in Educational Videos to Enhance Online Learning
Zipeng Ji, Pengcheng An, Jian Zhao
TL;DR
This work tackles the challenge of scarce and variable-quality danmaku in educational videos by leveraging large multimodal models (LMMs) to automatically generate high-quality, classroom-relevant danmaku. It identifies desirable content- and emotion-related danmaku through a formative study and delivers ClassComet, a platform that uses clip- and text-level video understanding, virtual personas, and a structured prompt template to produce diverse danmaku. In a controlled study, AI-generated danmaku—especially when combining content- and emotion-related types—improved engagement and learning outcomes, with quality metrics approaching those of human-created danmaku on several dimensions. The results demonstrate the feasibility and value of AI-generated danmaku for scalable, consistent, and effective educational video experiences, while outlining limitations and future avenues such as real-time generation, personalized personas, and longitudinal effects.
Abstract
Danmaku, users' live comments synchronized with, and overlaying on videos, has recently shown potential in promoting online video-based learning. However, user-generated danmaku can be scarce-especially in newer or less viewed videos and its quality is unpredictable, limiting its educational impact. This paper explores how large multimodal models (LMM) can be leveraged to automatically generate effective, high-quality danmaku. We first conducted a formative study to identify the desirable characteristics of content- and emotion-related danmaku in educational videos. Based on the obtained insights, we developed ClassComet, an educational video platform with novel LMM-driven techniques for generating relevant types of danmaku to enhance video-based learning. Through user studies, we examined the quality of generated danmaku and their influence on learning experiences. The results indicate that our generated danmaku is comparable to human-created ones, and videos with both content- and emotion-related danmaku showed significant improvement in viewers' engagement and learning outcome.
