Hedonic Prices and Quality Adjusted Price Indices Powered by AI
Patrick Bajari, Zhihao Cen, Victor Chernozhukov, Manoj Manukonda, Suhas Vijaykumar, Jin Wang, Ramon Huerta, Junbo Li, Ling Leng, George Monokroussos, Shan Wang
TL;DR
The paper tackles inflation measurement with high turnover, heterogeneous online goods by introducing AI-powered hedonic pricing that leverages unstructured text and image data to predict prices. It develops a multi-task neural network that produces a time-series of hedonic prices via a value embedding derived from BERT-based text and ResNet-50 image embeddings. The main contributions are (i) scalable, automatic generation of product characteristics from unstructured data, (ii) demonstration of high predictive accuracy ($R^2$ in hold-out $80\%$–$90\%$) and (iii) construction of Fisher hedonic price indices (FHPI) for apparel, showing quality-adjusted inflation closer to CPI than alternatives. The results suggest AI embeddings can power real-time, quality-adjusted price indices with practical advantages over traditional, manually curated hedonic models, and point to future work on better image utilization and explainability.The approach reframes hedonic pricing as a prediction problem: prices are regressed on product attributes $X_i$ derived from text and image embeddings, with the price function allowed to vary over time as $P_{it}=H_{it}=h_t(X_i)$. By using deep neural networks, the authors estimate $h_t$ nonlinearly and then quantify uncertainty via a hold-out linear stage on a fixed value embedding $V_i$, enabling standard inference. The empirical application to Amazon apparel demonstrates strong out-of-sample fit and yields FHPI that indicates a modest decline in apparel prices 2014–2019, in contrast to larger declines suggested by some online indices, while reducing chain-drift biases through long-horizon chaining. The study contributes to the literature by showing how AI-based embeddings can modernize hedonic price indices using electronic data, offering a scalable, transparent alternative to manual hedonic feature construction.
Abstract
We develop empirical models that efficiently process large amounts of unstructured product data (text, images, prices, quantities) to produce accurate hedonic price estimates and derived indices. To achieve this, we generate abstract product attributes (or ``features'') from descriptions and images using deep neural networks. These attributes are then used to estimate the hedonic price function. To demonstrate the effectiveness of this approach, we apply the models to Amazon's data for first-party apparel sales, and estimate hedonic prices. The resulting models have a very high out-of-sample predictive accuracy, with $R^2$ ranging from $80\%$ to $90\%$. Finally, we construct the AI-based hedonic Fisher price index, chained at the year-over-year frequency, and contrast it with the CPI and other electronic indices.
