Advancing GUI for Generative AI: Charting the Design Space of Human-AI Interactions through Task Creativity and Complexity
Zijian Ding
TL;DR
Advancing GUI for Generative AI examines how to design user interfaces that couple human expertise with Generative AI across creative and analytical tasks. It proposes a three-part plan—fixed-scope news headline curation, atomic cross-domain analogy generation, and complex data visualization—mapped to a human-AI interaction taxonomy to guide GUI design. The studies suggest guidance+selection often yields the best balance of quality and efficiency, while human oversight remains essential for safety, bias, and accuracy; complex tasks benefit from interactive, design-like canvases enabling rapid prototyping and multi-modal reasoning. The work provides a concrete design roadmap for mixed-initiative, multi-modal GUIs that enhance interpretability, collaboration, and control in Generative AI-enabled workflows.
Abstract
Technological progress has persistently shaped the dynamics of human-machine interactions in task execution. In response to the advancements in Generative AI, this paper outlines a detailed study plan that investigates various human-AI interaction modalities across a range of tasks, characterized by differing levels of creativity and complexity. This exploration aims to inform and contribute to the development of Graphical User Interfaces (GUIs) that effectively integrate with and enhance the capabilities of Generative AI systems. The study comprises three parts: exploring fixed-scope tasks through news headline generation, delving into atomic creative tasks with analogy generation, and investigating complex tasks via data visualization. Future work aims to extend this exploration to linearize complex data analysis results into narratives understandable to a broader audience, thereby enhancing the interpretability of AI-generated content.
