Table of Contents
Fetching ...

The role of interface design on prompt-mediated creativity in Generative AI

Maddalena Torricelli, Mauro Martino, Andrea Baronchelli, Luca Maria Aiello

TL;DR

The paper investigates how interface design shapes prompt-driven creativity in Generative AI by analyzing 145,000+ prompts from two platforms with different interaction models (DiffusionDB and Pick-a-Pic). It applies time-aware prompt similarity (via Jaccard) and image similarity (via DinoV2 embeddings, cosine similarity) and defines topical variation using $P(variation) = \frac{n_{variation}}{n_{prompt}-1}$ to quantify exploration versus exploitation. Findings show that prompting-only interactions promote broad exploration and topic shifts, while features that generate image variants without prompting reduce exploration and prompt complexity, constraining creative breadth. These results offer concrete guidance for designing AI art interfaces that preserve opportunities for learning and diverse ideation while maintaining accessibility.

Abstract

Generative AI for the creation of images is becoming a staple in the toolkit of digital artists and visual designers. The interaction with these systems is mediated by \emph{prompting}, a process in which users write a short text to describe the desired image's content and style. The study of prompts offers an unprecedented opportunity to gain insight into the process of human creativity. Yet, our understanding of how people use them remains limited. We analyze more than 145,000 prompts from the logs of two Generative AI platforms (Stable Diffusion and Pick-a-Pic) to shed light on how people \emph{explore} new concepts over time, and how their exploration might be influenced by different design choices in human-computer interfaces to Generative AI. We find that users exhibit a tendency towards exploration of new topics over exploitation of concepts visited previously. However, a comparative analysis of the two platforms, which differ both in scope and functionalities, reveals some stark differences. Features diverting user focus from prompting and providing instead shortcuts for quickly generating image variants are associated with a considerable reduction in both exploration of novel concepts and detail in the submitted prompts. These results carry direct implications for the design of human interfaces to Generative AI and raise new questions regarding how the process of prompting should be aided in ways that best support creativity.

The role of interface design on prompt-mediated creativity in Generative AI

TL;DR

The paper investigates how interface design shapes prompt-driven creativity in Generative AI by analyzing 145,000+ prompts from two platforms with different interaction models (DiffusionDB and Pick-a-Pic). It applies time-aware prompt similarity (via Jaccard) and image similarity (via DinoV2 embeddings, cosine similarity) and defines topical variation using to quantify exploration versus exploitation. Findings show that prompting-only interactions promote broad exploration and topic shifts, while features that generate image variants without prompting reduce exploration and prompt complexity, constraining creative breadth. These results offer concrete guidance for designing AI art interfaces that preserve opportunities for learning and diverse ideation while maintaining accessibility.

Abstract

Generative AI for the creation of images is becoming a staple in the toolkit of digital artists and visual designers. The interaction with these systems is mediated by \emph{prompting}, a process in which users write a short text to describe the desired image's content and style. The study of prompts offers an unprecedented opportunity to gain insight into the process of human creativity. Yet, our understanding of how people use them remains limited. We analyze more than 145,000 prompts from the logs of two Generative AI platforms (Stable Diffusion and Pick-a-Pic) to shed light on how people \emph{explore} new concepts over time, and how their exploration might be influenced by different design choices in human-computer interfaces to Generative AI. We find that users exhibit a tendency towards exploration of new topics over exploitation of concepts visited previously. However, a comparative analysis of the two platforms, which differ both in scope and functionalities, reveals some stark differences. Features diverting user focus from prompting and providing instead shortcuts for quickly generating image variants are associated with a considerable reduction in both exploration of novel concepts and detail in the submitted prompts. These results carry direct implications for the design of human interfaces to Generative AI and raise new questions regarding how the process of prompting should be aided in ways that best support creativity.
Paper Structure (8 sections, 3 equations, 8 figures)

This paper contains 8 sections, 3 equations, 8 figures.

Figures (8)

  • Figure 1: User interface of Pick-a-Pic (left) and Stable Diffusion (right).
  • Figure 2: To identify topical transitions in prompting, we calculate a similarity matrix between pairs of prompts sorted by their submission time (a), binarize the matrix (b), and identify blocks of highly-similar, consecutive prompts around the matrix diagonal (c).
  • Figure 3: Distribution of number of prompts (a) and unique prompts (b) per user. Total number of unique prompts submitted after $n$ interactions (c).
  • Figure 4: Probability distributions of image similarity for: (a) all pairs of different prompts; (b) all pairs of identical prompts within the same prompt sequence.
  • Figure 5: Distribution of similarity of consecutive prompts (a) and consecutive images (b) in a user sequence.
  • ...and 3 more figures