Dynamic pricing with Bayesian updates from online reviews
José Correa, Mathieu Mari, Andrew Xia
TL;DR
The paper studies dynamic pricing under Bayesian learning from online reviews, modeling a seller's revenue as a Bayesian bandit with latent product quality and a prior $x$ updated by like/dislike signals. It shows that the posterior after a sequence of reviews depends only on the counts $(\ell, d)$, enabling a combinatorial analysis via Catalan numbers and a Gittins-index perspective to compute the stopping threshold $x^*$ and the value $V(x)$. Two complementary solution methods are developed: a fast dynamic programming approach over the discrete prior set and a closed-form combinatorial approach that yields explicit expressions for $x^*$ and $V(x)$. The results quantify when dynamic pricing improves learning and revenue relative to static pricing and extend to multi-valued or continuous quality spaces with a tractable dynamic program, informing practical pricing policies on review-rich platforms.
Abstract
When launching new products, firms face uncertainty about market reception. Online reviews provide valuable information not only to consumers but also to firms, allowing firms to adjust the product characteristics, including its selling price. In this paper, we consider a pricing model with online reviews in which the quality of the product is uncertain, and both the seller and the buyers Bayesianly update their beliefs to make purchasing & pricing decisions. We model the seller's pricing problem as a basic bandits' problem and show a close connection with the celebrated Catalan numbers, allowing us to efficiently compute the overall future discounted reward of the seller. With this tool, we analyze and compare the optimal static and dynamic pricing strategies in terms of the probability of effectively learning the quality of the product.
