Inferring Latent Market Forces: Evaluating LLM Detection of Gamma Exposure Patterns via Obfuscation Testing

Christopher Regan; Ying Xie

Inferring Latent Market Forces: Evaluating LLM Detection of Gamma Exposure Patterns via Obfuscation Testing

Christopher Regan, Ying Xie

TL;DR

The paper introduces obfuscation testing to assess whether large language models can reason about structural market mechanisms, specifically dealer hedging constraints tied to gamma exposure, rather than rely on temporal memorization. It implements a WHO→WHOM→WHAT causal framework across three dealer-hedging patterns using fully obfuscated SPY options data from 2024, achieving a 71.5% detection rate and a 91.2% forward-materialization accuracy. Detection reaches 100% with regime labels, illustrating the sensitivity to contextual prompts, while robust statistical validation (including Granger causality) supports genuine mechanistic understanding. The work demonstrates emergent structural reasoning capabilities in transformers, offers a rigorous validation methodology, and highlights implications for risk management, surveillance, and AI governance in financial markets.

Abstract

We introduce obfuscation testing, a novel methodology for validating whether large language models detect structural market patterns through causal reasoning rather than temporal association. Testing three dealer hedging constraint patterns (gamma positioning, stock pinning, 0DTE hedging) on 242 trading days (95.6% coverage) of S&P 500 options data, we find LLMs achieve 71.5% detection rate using unbiased prompts that provide only raw gamma exposure values without regime labels or temporal context. The WHO-WHOM-WHAT causal framework forces models to identify the economic actors (dealers), affected parties (directional traders), and structural mechanisms (forced hedging) underlying observed market dynamics. Critically, detection accuracy (91.2%) remains stable even as economic profitability varies quarterly, demonstrating that models identify structural constraints rather than profitable patterns. When prompted with regime labels, detection increases to 100%, but the 71.5% unbiased rate validates genuine pattern recognition. Our findings suggest LLMs possess emergent capabilities for detecting complex financial mechanisms through pure structural reasoning, with implications for systematic strategy development, risk management, and our understanding of how transformer architectures process financial market dynamics.

Inferring Latent Market Forces: Evaluating LLM Detection of Gamma Exposure Patterns via Obfuscation Testing

TL;DR

Abstract

Inferring Latent Market Forces: Evaluating LLM Detection of Gamma Exposure Patterns via Obfuscation Testing

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)