Efficiency with Rigor! A Trustworthy LLM-powered Workflow for Qualitative Data Analysis
Jie Gao, Zhiyao Shu, Shun Yi Yeo, Alok Prakash, Chien-Ming Huang, Mark Dredze, Ziang Xiao
TL;DR
MindCoder addresses the challenge of balancing efficiency and trustworthiness in qualitative data analysis by delegating mechanical tasks to a transparent LLM workflow while preserving interpretive work for humans. It derives six design requirements from literature and formative interviews (DR1–DR6) and implements them in a web-based system with an auditable codebook trajectory. In a within-subject study against Atlas.ti AI Coding, MindCoder improved active interpretation, flexible control, and perceived trustworthiness of results, with user experiences varying by expertise. The work contributes a concrete, user-centered blueprint for human-AI collaboration in QDA and offers design implications for future LLM-powered qualitative analysis tools.
Abstract
Qualitative data analysis (QDA) emphasizes trustworthiness, requiring sustained human engagement and reflexivity. Recently, large language models (LLMs) have been applied in QDA to improve efficiency. However, their use raises concerns about unvalidated automation and displaced sensemaking, which can undermine trustworthiness. To address these issues, we employed two strategies: transparency and human involvement. Through a literature review and formative interviews, we identified six design requirements for transparent automation and meaningful human involvement. Guided by these requirements, we developed MindCoder, an LLM-powered workflow that delegates mechanical tasks, such as grouping and validation, to the system, while enabling humans to conduct meaningful interpretation. MindCoder also maintains comprehensive logs of users' step-by-step interactions to ensure transparency and support trustworthy results. In an evaluation with 12 users and two external evaluators, MindCoder supported active interpretation, offered flexible control, and produced more trustworthy codebooks. We further discuss design implications for building human-AI collaborative QDA workflows.
