PromptMap: An Alternative Interaction Style for AI-Based Image Generation
Krzysztof Adamkiewicz, Paweł W. Woźniak, Julia Dominiak, Andrzej Romanowski, Jakob Karolus, Stanislav Frolov
TL;DR
PromptMap introduces a map-based, semantic-zoom interface to explore a vast, synthetic collection of prompts for text-to-image generation, addressing novices' prompting difficulties. It combines a large-scale synthetic dataset (over 10 million prompts) with a 2D density map, labels, and search to support structured, inspiration-driven exploration, rather than language-only prompt crafting. Through a between-subject quantitative study and a within-subject qualitative study, the authors show that presenting examples shifts user strategy toward example-driven exploration and that synthetic data generation can yield diverse, high-quality prompts with lower NSFW content than scraped datasets. The work contributes a new interaction paradigm, a scalable data-generation pipeline, and open resources (dataset and code) that can influence future interface design and cross-modal prompting tasks.
Abstract
Recent technological advances popularized the use of image generation among the general public. Crafting effective prompts can, however, be difficult for novice users. To tackle this challenge, we developed PromptMap, a new interaction style for text-to-image AI that allows users to freely explore a vast collection of synthetic prompts through a map-like view with semantic zoom. PromptMap groups images visually by their semantic similarity, allowing users to discover relevant examples. We evaluated PromptMap in a between-subject online study ($n=60$) and a qualitative within-subject study ($n=12$). We found that PromptMap supported users in crafting prompts by providing them with examples. We also demonstrated the feasibility of using LLMs to create vast example collections. Our work contributes a new interaction style that supports users unfamiliar with prompting in achieving a satisfactory image output.
