CollabCoder: A Lower-barrier, Rigorous Workflow for Inductive Collaborative Qualitative Analysis with Large Language Models
Jie Gao, Yuchen Guo, Gionnieve Lim, Tianqin Zhang, Zheng Zhang, Toby Jia-Jun Li, Simon Tangi Perrault
TL;DR
CollabCoder introduces a three-phase, inductive CQA workflow that integrates LLMs to support independent open coding, iterative discussion, and codebook generation. The system emphasizes coders' independence, explicit decision-making documentation, and quantitative metrics to surface (dis)agreements, aiming for rigorous yet accessible qualitative analysis. An empirical evaluation with 16 participants shows CollabCoder improves learning curve, fosters mutual understanding, and enhances discussion efficiency relative to Atlas.ti Web, while highlighting challenges around autonomy, AI reliance, and feature usefulness. The work contributes design guidelines, an end-to-end AI-assisted CQA workflow, and practical insights on human-AI collaboration in qualitative analysis, with implications for scalable, rigorous qualitative research workflows.
Abstract
Collaborative Qualitative Analysis (CQA) can enhance qualitative analysis rigor and depth by incorporating varied viewpoints. Nevertheless, ensuring a rigorous CQA procedure itself can be both demanding and costly. To lower this bar, we take a theoretical perspective to design the CollabCoder workflow, that integrates Large Language Models (LLMs) into key inductive CQA stages: independent open coding, iterative discussions, and final codebook creation. In the open coding phase, CollabCoder offers AI-generated code suggestions and records decision-making data. During discussions, it promotes mutual understanding by sharing this data within the coding team and using quantitative metrics to identify coding (dis)agreements, aiding in consensus-building. In the code grouping stage, CollabCoder provides primary code group suggestions, lightening the cognitive load of finalizing the codebook. A 16-user evaluation confirmed the effectiveness of CollabCoder, demonstrating its advantages over existing software and providing empirical insights into the role of LLMs in the CQA practice.
