ToMigo: Interpretable Design Concept Graphs for Aligning Generative AI with Creative Intent
Lena Hegemann, Xinyi Wen, Michael A. Hedderich, Tarmo Nurmi, Hariharan Subramonyam
TL;DR
ToMigo addresses the misalignment between user intent and generative AI outputs by introducing interpretable design concept graphs that encode both explicit and inferred design intent. It combines multimodal input analysis (reference images and briefs) with a four-role graph schema and reasoning-enabled edges to maintain coherence and support iterative refinement. The work introduces three interactions—theory-of-mind widgets, clarifying questions, and graph-guided design generation—that surface uncertainties, enable direct editing of AI reasoning, and realign outputs with updated concepts. Two user studies demonstrate high alignment between user intentions and the graph representations, and show that ToMigo enhances grounding and design exploration, offering practical benefits for novice designers and collaborative design workflows.
Abstract
Generative AI often produces results misaligned with user intentions, for example, resolving ambiguous prompts in unexpected ways. Despite existing approaches to clarify intent, a major challenge remains: understanding and influencing AI's interpretation of user intent through simple, direct inputs requiring no expertise or rigid procedures. We present ToMigo, representing intent as design concept graphs: nodes represent choices of purpose, content, or style, while edges link them with interpretable explanations. Applied to graphic design, ToMigo infers intent from reference images and text. We derived a schema of node types and edges from pre-study data, informing a multimodal large language model to generate graphs aligning nodes externally with user intent and internally toward a unified design goal. This structure enables users to explore AI reasoning and directly manipulate the design concept. In our user studies, ToMigo received high alignment ratings and captured most user intentions well. Users reported greater control and found interactive features-editable graphs, reflective chats, concept-design realignment-useful for evolving and realizing their design ideas.
