Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models

Alejandro Lopez-Lira; Yuehua Tang

Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models

Alejandro Lopez-Lira, Yuehua Tang

TL;DR

This paper demonstrates that off-the-shelf large language models, particularly GPT-4, can forecast the immediate stock-market reaction to news headlines and anticipate short-horizon price drift, especially for small and negatively toned news. It establishes a theoretical framework linking LLM processing capacity to information frictions and limits to arbitrage, showing a threshold model where only sufficiently sophisticated LLMs generate profitable signals. Empirically, GPT-4's predictions produce high initial hit rates and meaningful drift, with model size and news complexity shaping performance; the results also reveal heterogeneous processing across news topics. The findings imply that AI-driven enhancement of information processing can improve market efficiency over time, though adoption dynamics may initially sustain some return predictability. The paper contributes both a novel empirical instrument for studying market information processing and a formal model connecting AI capabilities to price formation and efficiency.

Abstract

We document the capability of large language models (LLMs) like ChatGPT to predict stock market reactions from news headlines without direct financial training. Using post-knowledge-cutoff headlines, GPT-4 captures initial market responses, achieving approximately 90% portfolio-day hit rates for the non-tradable initial reaction. GPT-4 scores also significantly predict the subsequent drift, especially for small stocks and negative news. Forecasting ability generally increases with model size, suggesting that financial reasoning is an emerging capacity of complex LLMs. Strategy returns decline as LLM adoption rises, consistent with improved price efficiency. To rationalize these findings, we develop a theoretical model that incorporates LLM technology, information-processing capacity constraints, underreaction, and limits to arbitrage.

Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models

TL;DR

Abstract

Paper Structure (66 sections, 13 theorems, 96 equations, 14 figures, 18 tables)

This paper contains 66 sections, 13 theorems, 96 equations, 14 figures, 18 tables.

Introduction
Institutional Background
Conceptual Framework: LLMs, Information Processing, and Market Dynamics
Key Ingredients
LLMs as Information Processors
Mechanism of Predictability
Key Theoretical Insights
Data
ChatGPT Prompt
ChatGPT and Market Information Processing
Can ChatGPT Forecast Stock Price Movements?
Post-Announcement Drift and Predictive Regressions
Information Processing Across News Types
Comparing Different Large Language Models
Performance Across Models
...and 51 more sections

Key Result

Proposition B.1

Mispricing is

Figures (14)

Figure 1: Overnight News Returns: Before and After the Release Time
Figure 2: Intraday News Returns: Before and After the Release Time
Figure 3: Cumulative Returns of Investing $1 (Without Transaction Costs)
Figure 4: Cumulative Returns of Investing $1 With Different Transaction Costs
Figure 5: Cumulative Returns of Investing $1 With Different Sample Restrictions
...and 9 more figures

Theorems & Definitions (22)

Proposition B.1
Theorem 1
Proposition B.2
Proposition B.3
Theorem 2
Proposition B.4
Proposition B.5
Lemma F.1
proof
Lemma K.1: Posterior Beliefs
...and 12 more

Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models

TL;DR

Abstract

Can ChatGPT Forecast Stock Price Movements? Return Predictability and Large Language Models

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (14)

Theorems & Definitions (22)