Primacy Effect of ChatGPT
Yiwei Wang, Yujun Cai, Muhao Chen, Yuxuan Liang, Bryan Hooi
TL;DR
The paper addresses whether ChatGPT inherits human primacy bias in discriminative NLU tasks by introducing label-order randomization in prompts and measuring prediction stability and label-index distributions. It leverages a label-shuffling methodology across diverse datasets (e.g., TACRED, TACRED-Revisit, Re-TACRED, Banking77, MASSIVE, GoEmotions, 20 Newsgroups) and compares ChatGPT with a BERT baseline, using a zero-temperature setting to minimize randomness. Key findings show a pronounced primacy effect in ChatGPT, with predictions shifting under label shuffling and a skew toward earlier labels, particularly on harder tasks; CoT does not fully mitigate this bias, raising fairness concerns quantified via Jensen–Shannon divergence. The study provides insights into reliability and fairness of ChatGPT-based NLU solutions and releases code to facilitate further research and mitigation strategies.
Abstract
Instruction-tuned large language models (LLMs), such as ChatGPT, have led to promising zero-shot performance in discriminative natural language understanding (NLU) tasks. This involves querying the LLM using a prompt containing the question, and the candidate labels to choose from. The question-answering capabilities of ChatGPT arise from its pre-training on large amounts of human-written text, as well as its subsequent fine-tuning on human preferences, which motivates us to ask: Does ChatGPT also inherits humans' cognitive biases? In this paper, we study the primacy effect of ChatGPT: the tendency of selecting the labels at earlier positions as the answer. We have two main findings: i) ChatGPT's decision is sensitive to the order of labels in the prompt; ii) ChatGPT has a clearly higher chance to select the labels at earlier positions as the answer. We hope that our experiments and analyses provide additional insights into building more reliable ChatGPT-based solutions. We release the source code at https://github.com/wangywUST/PrimacyEffectGPT.
