Can ChatGPT Overcome Behavioral Biases in the Financial Sector? Classify-and-Rethink: Multi-Step Zero-Shot Reasoning in the Gold Investment

Shuoling Liu; Gaoguo Jia; Yuhang Jiang; Liyuan Chen; Qiang Yang

Can ChatGPT Overcome Behavioral Biases in the Financial Sector? Classify-and-Rethink: Multi-Step Zero-Shot Reasoning in the Gold Investment

Shuoling Liu, Gaoguo Jia, Yuhang Jiang, Liyuan Chen, Qiang Yang

TL;DR

The paper addresses how to mitigate framing-induced behavioral biases in financial decision-making by leveraging ChatGPT with a novel two-stage prompting framework called Classify-and-Rethink (CAR). CAR first classifies gold-news items into six categories and scores them, then prompts the model to rethink the scoring from a long-term perspective, reducing susceptibility to framing effects. Back-tested on Au9999.SGE data and dyhjw.com gold news (2018–2023), CAR outperforms baseline strategies with an 80.35% return and a Sharpe ratio of 1.071, while producing a near-normal distribution of scores that suggests more stable reasoning. The work demonstrates that structured prompt design can enhance financial reasoning in LLMs and offers a foundation for future, ethically-aware deployment of AI-driven investment analysis. Overall, CAR provides a practical, explainable approach for using LLMs to generate investment opinions while mitigating common behavioral biases in financial markets.

Abstract

Large Language Models (LLMs) have achieved remarkable success recently, displaying exceptional capabilities in creating understandable and organized text. These LLMs have been utilized in diverse fields, such as clinical research, where domain-specific models like Med-Palm have achieved human-level performance. Recently, researchers have employed advanced prompt engineering to enhance the general reasoning ability of LLMs. Despite the remarkable success of zero-shot Chain-of-Thoughts (CoT) in solving general reasoning tasks, the potential of these methods still remains paid limited attention in the financial reasoning task.To address this issue, we explore multiple prompt strategies and incorporated semantic news information to improve LLMs' performance on financial reasoning tasks.To the best of our knowledge, we are the first to explore this important issue by applying ChatGPT to the gold investment.In this work, our aim is to investigate the financial reasoning capabilities of LLMs and their capacity to generate logical and persuasive investment opinions. We will use ChatGPT, one of the most powerful LLMs recently, and prompt engineering to achieve this goal. Our research will focus on understanding the ability of LLMs in sophisticated analysis and reasoning within the context of investment decision-making. Our study finds that ChatGPT with CoT prompt can provide more explainable predictions and overcome behavioral biases, which is crucial in finance-related tasks and can achieve higher investment returns.

Can ChatGPT Overcome Behavioral Biases in the Financial Sector? Classify-and-Rethink: Multi-Step Zero-Shot Reasoning in the Gold Investment

TL;DR

Abstract

Can ChatGPT Overcome Behavioral Biases in the Financial Sector? Classify-and-Rethink: Multi-Step Zero-Shot Reasoning in the Gold Investment

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)