AI-Instruments: Embodying Prompts as Instruments to Abstract & Reflect Graphical Interface Commands as General-Purpose Tools
Nathalie Riche, Anna Offenwanger, Frederic Gmeiner, David Brown, Hugo Romat, Michel Pahud, Nicolai Marquardt, Kori Inkpen, Ken Hinckley
TL;DR
The paper tackles the challenge that chat-based prompts produce linear, hard-to-refine interactions for creative design with generative AI. It extends the instrumental interaction model by introducing AI-Instruments—reification of user intent, reflection, and grounding—and demonstrates four technology probes for image generation. Through a qualitative study with 12 participants, it shows that these instruments support non-linear exploration, direct manipulation, and richer intent formulation and resolution than traditional prompting. The work contributes a general interaction framework, four concrete instruments (Fragments, Transformative Lenses, Generative Containers, Fillable Brushes), and the notion of meta-instruments (Palettes) to organize complex instrument collections, with implications for broader AI-enabled creative workflows.
Abstract
Chat-based prompts respond with verbose linear-sequential texts, making it difficult to explore and refine ambiguous intents, back up and reinterpret, or shift directions in creative AI-assisted design work. AI-Instruments instead embody "prompts" as interface objects via three key principles: (1) Reification of user-intent as reusable direct-manipulation instruments; (2) Reflection of multiple interpretations of ambiguous user-intents (Reflection-in-intent) as well as the range of AI-model responses (Reflection-in-response) to inform design "moves" towards a desired result; and (3) Grounding to instantiate an instrument from an example, result, or extrapolation directly from another instrument. Further, AI-Instruments leverage LLM's to suggest, vary, and refine new instruments, enabling a system that goes beyond hard-coded functionality by generating its own instrumental controls from content. We demonstrate four technology probes, applied to image generation, and qualitative insights from twelve participants, showing how AI-Instruments address challenges of intent formulation, steering via direct manipulation, and non-linear iterative workflows to reflect and resolve ambiguous intents.
