Creative Blends of Visual Concepts
Zhida Sun, Zhenyao Zhang, Yue Zhang, Min Lu, Dani Lischinski, Daniel Cohen-Or, Hui Huang
TL;DR
This work introduces Creative Blends, a metaphor-informed AI system that maps abstract concepts to concrete objects via commonsense reasoning and combines them through attribute-based similarity to generate diverse visual blends. It integrates LLMs, ConceptNet, and CLIP-based similarity with sentiment analysis to produce structured blending prompts, which are then realized as images with DALL·E 3, all presented in an interpretable visualization interface. A formative study informs design goals, while a within-subject evaluation with 24 participants demonstrates improved usability, reduced cognitive load, and enhanced creativity and metaphorical richness compared to a GPT-3.5+DALL·E 3 baseline. The findings underscore the potential of semantic-attribute grounded blending for ideation and offer insights into user preferences, generalization, and the integration of generative AI in design workflows. Limitations include scope to ideation and short-term evaluation, with future work focusing on longer studies, prompt-editing controls, and refinement-stage support to further empower designers."
Abstract
Visual blends combine elements from two distinct visual concepts into a single, integrated image, with the goal of conveying ideas through imaginative and often thought-provoking visuals. Communicating abstract concepts through visual blends poses a series of conceptual and technical challenges. To address these challenges, we introduce Creative Blends, an AI-assisted design system that leverages metaphors to visually symbolize abstract concepts by blending disparate objects. Our method harnesses commonsense knowledge bases and large language models to align designers' conceptual intent with expressive concrete objects. Additionally, we employ generative text-to-image techniques to blend visual elements through their overlapping attributes. A user study (N=24) demonstrated that our approach reduces participants' cognitive load, fosters creativity, and enhances the metaphorical richness of visual blend ideation. We explore the potential of our method to expand visual blends to include multiple object blending and discuss the insights gained from designing with generative AI.
