Interpretable Deep Learning for Stock Returns: A Consensus-Bottleneck Asset Pricing Model
Bong-Gyu Jang, Younwoo Jeong, Changeun Kim
TL;DR
The paper tackles the interpretability-accuracy tension in asset pricing by introducing CB-APM, a partially interpretable neural network that compresses firm and macro information into analyst-consensus-like latent states and then maps them to expected returns. By jointly training a nonlinear consensus bottleneck with a linear pricing stage, CB-APM achieves strong long-horizon prediction and provides transparent loadings that resemble priced components of analyst beliefs. Empirical results show substantial improvements in out-of-sample return forecasts, robust long-short portfolio profitability, and pricing content beyond traditional factor models, aided by macroeconomic embeddings learned via an autoencoder. The work demonstrates that integrating interpretable architecture with rational-expectations-based information aggregation can yield both predictive gains and economically meaningful insights, paving the way for more trustworthy, finance-grounded machine learning models.
Abstract
We introduce the \textit{Consensus-Bottleneck Asset Pricing Model} (CB-APM), a partially interpretable neural network that replicates the reasoning processes of sell-side analysts by capturing how dispersed investor beliefs are compressed into asset prices through a consensus formation process. By modeling this ``bottleneck'' to summarize firm- and macro-level information, CB-APM not only predicts future risk premiums of U.S. equities but also links belief aggregation to expected returns in a structurally interpretable manner. The model improves long-horizon return forecasts and outperforms standard deep learning approaches in both predictive accuracy and explanatory power. Comprehensive portfolio analyses show that CB-APM's out-of-sample predictions translate into economically meaningful payoffs, with monotonic return differentials and stable long-short performance across regularization settings. Empirically, CB-APM leverages consensus as a regularizer to amplify long-horizon predictability and yields interpretable consensus-based components that clarify how information is priced in returns. Moreover, regression and GRS-based pricing diagnostics reveal that the learned consensus representations capture priced variation only partially spanned by traditional factor models, demonstrating that CB-APM uncovers belief-driven structure in expected returns beyond the canonical factor space. Overall, CB-APM provides an interpretable and empirically grounded framework for understanding belief-driven return dynamics.
