How Do Analysts Understand and Verify AI-Assisted Data Analyses?
Ken Gu, Ruoxi Shang, Tim Althoff, Chenglong Wang, Steven M. Drucker
TL;DR
The paper addresses the challenge of validating AI-assisted data analyses, where LLMs translate natural language prompts into data operations. It introduces a design probe that surfaces natural language explanations, code, and interactive data artifacts, and reports a qualitative study with 22 professional analysts to reveal verification workflows. Key findings show analysts routinely alternate between procedure-oriented and data-oriented verification, using artifacts from both domains to support sensemaking and provenance. The work offers concrete recommendations for analysts and tool designers to improve verification, including clarifying AI assumptions, connecting data and procedure artifacts, and integrating AI guidance into verification workflows.
Abstract
Data analysis is challenging as it requires synthesizing domain knowledge, statistical expertise, and programming skills. Assistants powered by large language models (LLMs), such as ChatGPT, can assist analysts by translating natural language instructions into code. However, AI-assistant responses and analysis code can be misaligned with the analyst's intent or be seemingly correct but lead to incorrect conclusions. Therefore, validating AI assistance is crucial and challenging. Here, we explore how analysts understand and verify the correctness of AI-generated analyses. To observe analysts in diverse verification approaches, we develop a design probe equipped with natural language explanations, code, visualizations, and interactive data tables with common data operations. Through a qualitative user study (n=22) using this probe, we uncover common behaviors within verification workflows and how analysts' programming, analysis, and tool backgrounds reflect these behaviors. Additionally, we provide recommendations for analysts and highlight opportunities for designers to improve future AI-assistant experiences.
