StoryDiffusion: How to Support UX Storyboarding With Generative-AI
Zhaohui Liang, Xiaoyu Zhang, Kevin Ma, Zhao Liu, Xipei Ren, Kosa Goucher-Lambert, Can Liu
TL;DR
StoryDiffusion tackles the problem of enabling end-to-end UX storyboard creation by integrating GPT-4-based narrative processing with Stable Diffusion image generation in a single workflow. It introduces a three-step prompting pipeline that segments narratives into frames and generates consistent, editable prompts to produce coherent storyboard frames, with a flexible interface for narrative and style adjustments. A user study with 12 UX design students reveals two creative strategies (user-directed and AI-directed) and shows improved ideation and visualization, along with requirements for narrative clarity, visual continuity, and human-AI communication. The work contributes a practical, full-stack prompting solution and provides design implications for AI-assisted storytelling tools, highlighting the trade-offs between precision and exploratory creativity and outlining directions for future enhancements in accuracy, continuity, and editing capabilities.
Abstract
Storyboarding is an established method for designing user experiences. Generative AI can support this process by helping designers quickly create visual narratives. However, existing tools only focus on accurate text-to-image generation. Currently, it is not clear how to effectively support the entire creative process of storyboarding and how to develop AI-powered tools to support designers' individual workflows. In this work, we iteratively developed and implemented StoryDiffusion, a system that integrates text-to-text and text-to-image models, to support the generation of narratives and images in a single pipeline. With a user study, we observed 12 UX designers using the system for both concept ideation and illustration tasks. Our findings identified AI-directed vs. user-directed creative strategies in both tasks and revealed the importance of supporting the interchange between narrative iteration and image generation. We also found effects of the design tasks on their strategies and preferences, providing insights for future development.
