Revenue Maximization Under Sequential Price Competition Via The Estimation Of s-Concave Demand Functions
Daniele Bracale, Moulinath Banerjee, Cong Shi, Yuekai Sun
TL;DR
The paper tackles revenue maximization in dynamic, multi-seller price competition under unknown nonlinear demand. It introduces a tuning-free, shape-constrained semi-parametric policy leveraging monotone single-index models with $s$-concave link functions, estimated in two exploration phases before exploiting a learned best-response mapping. The authors establish existence and uniqueness of Nash equilibrium under these constraints, derive regret bounds with an optimal exploration parameter, and prove NE convergence rates, complemented by concentration results for the $s$-concave regression estimator. Numerical experiments validate the theoretical rates and demonstrate robustness to misspecification, while providing practical insights into dynamic, learning-enabled pricing with nonlinear demand. The framework advances nonparametric learning in strategic settings by combining shape constraints with online learning in a coherent equilibrium-focused analysis.
Abstract
We consider price competition among multiple sellers over a selling horizon of $T$ periods. In each period, sellers simultaneously offer their prices (which are made public) and subsequently observe their respective demand (not made public). The demand function of each seller depends on all sellers' prices through a private, unknown, and nonlinear relationship. We propose a dynamic pricing policy that uses semi-parametric least-squares estimation and show that when the sellers employ our policy, their prices converge at a rate of $O(T^{-1/7})$ to the Nash equilibrium prices that sellers would reach if they were fully informed. Each seller incurs a regret of $O(T^{5/7})$ relative to a dynamic benchmark policy. A theoretical contribution of our work is proving the existence of equilibrium under shape-constrained demand functions via the concept of $s$-concavity and establishing regret bounds of our proposed policy. Technically, we also establish new concentration results for the least squares estimator under shape constraints. Our findings offer significant insights into dynamic competition-aware pricing and contribute to the broader study of non-parametric learning in strategic decision-making.
