Exploiting Primacy Effect To Improve Large Language Models
Bianca Raimondi, Maurizio Gabbrielli
TL;DR
The paper addresses positional biases in large language models, especially the primacy effect observed in MCQA tasks. It reveals that fine-tuning amplifies this bias compared to pre-trained models and introduces a training-free method that reorders answer options by semantic similarity to the query, exploiting primacy to boost accuracy. The approach uses a simple metric Sim$(O,Q)$ based on token-wise cosine similarity and a frozen encoder, enabling efficient deployment across datasets (CLINC, BANKING, HWU) and architectures (Llama2, Llama3, Mistral). Empirical results show consistent performance gains, with additional nuances from Recency bias in certain models, suggesting bias-aware strategies can enhance decision-making in biased evaluation environments. Overall, the work reframes bias as a lever for performance, offering practical, model-agnostic techniques and implications for bias-aware NLP design.
Abstract
Large Language Models (LLMs) have become essential in many Natural Language Processing (NLP) tasks, leveraging extensive pre-training and fine-tuning to achieve high accuracy. However, like humans, LLMs exhibit biases, particularly positional biases such as primacy and recency effects, which can influence the accuracy of the answers. The primacy effect-where items presented first are more likely to be remembered or selected-plays a key role in Multiple Choice Question Answering (MCQA), where the order of answer options can affect prediction outcomes. This study focuses on primacy bias in fine-tuned LLMs: We first show that fine-tuning amplifies this bias, probably due to exposure to human-like patterns. Hence, we strategically leverage this effect by reordering response options based on semantic similarity to the query, without requiring knowledge of the correct answer. Our experimental results show that this approach significantly improves performance in MCQA. More generally, our findings underscore the dual nature of biases as both challenges and opportunities, offering insights for bias-aware model design and NLP applications.
