A Regret Analysis of Bilateral Trade
Nicolò Cesa-Bianchi, Tommaso Cesari, Roberto Colomboni, Federico Fusco, Stefano Leonardi
TL;DR
This work inaugurates a regret-minimization view of bilateral trade, contrasting the Myerson–Satterthwaite impossibility with a spectrum of near-optimal fixed-price mechanisms learned online. It introduces two feedback models—full revelation and realistic posted pricing—and provides tight regret characterizations across stochastic, adversarial, and informational regimes. The authors design constructive algorithms (Follow the Best Price and Scouting Bandits) achieving sublinear regret where possible, and they prove matching lower bounds via Embedding and Simulation lemmas that connect to partial-monitoring lower bounds. The results reveal sublinear regret rates of $ ilde{Θ}(√T)$ in full feedback, $ ilde{Θ}(T^{2/3})$ in realistic feedback under iv+bd, and linear regret in several relaxed settings or adversarial scenarios, thereby clarifying the feasibility landscape for online bilateral trade with no priors. The findings have practical implications for designing simple, budget-balanced, incentive-compatible mechanisms that learn optimal pricing in online markets and highlight the critical role of feedback quality and valuation independence on achievable performance.
Abstract
Bilateral trade, a fundamental topic in economics, models the problem of intermediating between two strategic agents, a seller and a buyer, willing to trade a good for which they hold private valuations. Despite the simplicity of this problem, a classical result by Myerson and Satterthwaite (1983) affirms the impossibility of designing a mechanism which is simultaneously efficient, incentive compatible, individually rational, and budget balanced. This impossibility result fostered an intense investigation of meaningful trade-offs between these desired properties. Much work has focused on approximately efficient fixed-price mechanisms, i.e., Blumrosen and Dobzinski (2014; 2016), Colini-Baldeschi et al. (2016), which have been shown to fully characterize strong budget balanced and ex-post individually rational direct revelation mechanisms. All these results, however, either assume some knowledge on the priors of the seller/buyer valuations, or a black box access to some samples of the distributions, as in D{ü}tting et al. (2021). In this paper, we cast for the first time the bilateral trade problem in a regret minimization framework over rounds of seller/buyer interactions, with no prior knowledge on the private seller/buyer valuations. Our main contribution is a complete characterization of the regret regimes for fixed-price mechanisms with different models of feedback and private valuations, using as benchmark the best fixed price in hindsight. More precisely, we prove the following bounds on the regret: $\bullet$ $\widetildeΘ(\sqrt{T})$ for full-feedback (i.e., direct revelation mechanisms); $\bullet$ $\widetildeΘ(T^{2/3})$ for realistic feedback (i.e., posted-price mechanisms) and independent seller/buyer valuations with bounded densities; $\bullet$ $Θ(T)$ for realistic feedback and seller/buyer valuations with bounded densities; $\bullet$ $Θ(T)$ for realistic feedback and independent seller/buyer valuations; $\bullet$ $Θ(T)$ for the adversarial setting.
