Debiasing LLMs by Fine-tuning

Zhenyu Gao; Wenxi Jiang; Yutong Yan

Debiasing LLMs by Fine-tuning

Zhenyu Gao, Wenxi Jiang, Yutong Yan

Abstract

Prior research shows that large language models (LLMs) exhibit systematic extrapolation bias when forming predictions from both experimental and real-world data, and that prompt-based approaches appear limited in alleviating this bias. We propose a supervised fine-tuning (SFT) approach that uses Low-Rank Adaptation (LoRA) to train off-the-shelf LLMs on instruction datasets constructed from rational benchmark forecasts. By intervening at the parameter level, SFT changes how LLMs map observed information into forecasts and thereby mitigates extrapolation bias. We evaluate the fine-tuned model in two settings: controlled forecasting experiments and cross-sectional stock return prediction. In both settings, fine-tuning corrects the extrapolative bias out-of-sample, establishing a low-cost and generalizable method for debiasing LLMs.

Debiasing LLMs by Fine-tuning

Abstract

Debiasing LLMs by Fine-tuning

Abstract

Paper Structure

Table of Contents

Figures (6)