Prompt Engineering for Large Language Model-assisted Inductive Thematic Analysis
Muhammad Talal Khalid, Ann-Perry Witmer
TL;DR
This paper addresses the need for methodological rigor in using large language models to support inductive thematic analysis by conducting a systematic literature review and proposing a structured four-step prompt engineering process. It maps current PE practices, identifies common strategies such as single- versus multi-prompt approaches, and discusses advanced techniques like decomposition, ensembling, and self-generated prompts to improve reliability and transparency. The authors emphasize that LLMs should augment rather than replace human researchers, and they outline ethical considerations and future research directions to establish replicable PE methods in ITA. Overall, the work provides a practical framework and guidelines to enhance the rigor, efficiency, and trustworthiness of LLM-assisted ITA research.
Abstract
The potential of large language models (LLMs) to mitigate the time- and cost- related challenges associated with inductive thematic analysis (ITA) has been extensively explored in the literature. However, the use of LLMs to support ITA has often been opportunistic, relying on ad hoc prompt engineering (PE) approaches, thereby undermining the reliability, transparency, and replicability of the analysis. The goal of this study is to develop a structured approach to PE in LLM-assisted ITA. To this end, a comprehensive review of the existing literature is conducted to examine how ITA researchers integrate LLMs into their workflows and, in particular, how PE is utilized to support the analytical process. Built on the insights generated from this review, four key steps for effective PE in LLM-assisted ITA are identified and extensively outlined. Furthermore, the study explores state-of-the-art PE techniques that can enhance the execution of these steps, providing ITA researchers with practical strategies to improve their analyses. In conclusion, the main contributions of this paper include: (i) it maps the existing research on LLM-assisted ITA to enable a better understanding of the rapidly developing field, (ii) it outlines a structured four-step PE process to enhance methodological rigor, (iii) it discusses the application of advanced PE techniques to support the execution of these steps, and (iv) it highlights key directions for future research.
