RetAssist: Facilitating Vocabulary Learners with Generative Images in Story Retelling Practices
Qiaoyi Chen, Siyu Liu, Kaihui Huang, Xingbo Wang, Xiaojuan Ma, Junkai Zhu, Zhenhui Peng
TL;DR
RetAssist addresses vocabulary learning for ESL learners by pairing story based input with generated images to reduce cognitive load and improve recall of target word usage. The authors build a sentence-level image generation workflow using Stable-Diffusion-v1-5, CLIP similarity, and cartoon style transfer guided by CTML and BDCT principles. They validate the approach with a within-subjects study (N=24) comparing RetAssist to a baseline, finding gains in fluency and positive user perceptions, and derive five design principles to guide future systems. The work demonstrates the feasibility and educational value of integrating generative AIs into vocabulary practice and outlines broader implications for AI assisted education.
Abstract
Reading and repeatedly retelling a short story is a common and effective approach to learning the meanings and usages of target words. However, learners often struggle with comprehending, recalling, and retelling the story contexts of these target words. Inspired by the Cognitive Theory of Multimedia Learning, we propose a computational workflow to generate relevant images paired with stories. Based on the workflow, we work with learners and teachers to iteratively design an interactive vocabulary learning system named RetAssist. It can generate sentence-level images of a story to facilitate the understanding and recall of the target words in the story retelling practices. Our within-subjects study (N=24) shows that compared to a baseline system without generative images, RetAssist significantly improves learners' fluency in expressing with target words. Participants also feel that RetAssist eases their learning workload and is more useful. We discuss insights into leveraging text-to-image generative models to support learning tasks.
