Table of Contents
Fetching ...

A Taxonomy for Human-LLM Interaction Modes: An Initial Exploration

Jie Gao, Simret Araya Gebreegziabher, Kenny Tsu Wei Choo, Toby Jia-Jun Li, Simon Tangi Perrault, Thomas W. Malone

TL;DR

This paper identifies four phases of human-LLM interaction—planning, facilitating, iterating, and testing—and proposes a four-mode taxonomy (Mode 1: Standard Prompting, Mode 2: User Interface, Mode 3: Context-based, Mode 4: Agent Facilitator) to structure human-LLM collaboration. Through a two-stage systematic literature review of CHI/CSCW/UIST/IUI publications since 2021, the authors develop a primary taxonomy using a 5W1H-inspired framework (Who, What, When, How) and then systematically annotate and refine it across 73 papers. The resulting taxonomy captures a wide range of interaction patterns, including submodes such as UI for structured prompts, UI for varying output, explicit and implicit contexts, and team-focused agent facilitation, with examples drawn from contemporary LLM-enabled tools. The work provides a design-oriented lens to evaluate and compose human-LLM interactions beyond plain conversational prompting and points to future expansions across tasks and domains, including image/video generation and multi-venue literature reviews.

Abstract

With ChatGPT's release, conversational prompting has become the most popular form of human-LLM interaction. However, its effectiveness is limited for more complex tasks involving reasoning, creativity, and iteration. Through a systematic analysis of HCI papers published since 2021, we identified four key phases in the human-LLM interaction flow - planning, facilitating, iterating, and testing - to precisely understand the dynamics of this process. Additionally, we have developed a taxonomy of four primary interaction modes: Mode 1: Standard Prompting, Mode 2: User Interface, Mode 3: Context-based, and Mode 4: Agent Facilitator. This taxonomy was further enriched using the "5W1H" guideline method, which involved a detailed examination of definitions, participant roles (Who), the phases that happened (When), human objectives and LLM abilities (What), and the mechanics of each interaction mode (How). We anticipate this taxonomy will contribute to the future design and evaluation of human-LLM interaction.

A Taxonomy for Human-LLM Interaction Modes: An Initial Exploration

TL;DR

This paper identifies four phases of human-LLM interaction—planning, facilitating, iterating, and testing—and proposes a four-mode taxonomy (Mode 1: Standard Prompting, Mode 2: User Interface, Mode 3: Context-based, Mode 4: Agent Facilitator) to structure human-LLM collaboration. Through a two-stage systematic literature review of CHI/CSCW/UIST/IUI publications since 2021, the authors develop a primary taxonomy using a 5W1H-inspired framework (Who, What, When, How) and then systematically annotate and refine it across 73 papers. The resulting taxonomy captures a wide range of interaction patterns, including submodes such as UI for structured prompts, UI for varying output, explicit and implicit contexts, and team-focused agent facilitation, with examples drawn from contemporary LLM-enabled tools. The work provides a design-oriented lens to evaluate and compose human-LLM interactions beyond plain conversational prompting and points to future expansions across tasks and domains, including image/video generation and multi-venue literature reviews.

Abstract

With ChatGPT's release, conversational prompting has become the most popular form of human-LLM interaction. However, its effectiveness is limited for more complex tasks involving reasoning, creativity, and iteration. Through a systematic analysis of HCI papers published since 2021, we identified four key phases in the human-LLM interaction flow - planning, facilitating, iterating, and testing - to precisely understand the dynamics of this process. Additionally, we have developed a taxonomy of four primary interaction modes: Mode 1: Standard Prompting, Mode 2: User Interface, Mode 3: Context-based, and Mode 4: Agent Facilitator. This taxonomy was further enriched using the "5W1H" guideline method, which involved a detailed examination of definitions, participant roles (Who), the phases that happened (When), human objectives and LLM abilities (What), and the mechanics of each interaction mode (How). We anticipate this taxonomy will contribute to the future design and evaluation of human-LLM interaction.
Paper Structure (30 sections, 4 figures, 3 tables)