Table of Contents
Fetching ...

Contextual Dynamic Pricing with Heterogeneous Buyers

Thodoris Lykouris, Sloan Nietert, Princewill Okoroafor, Chara Podimata, Julian Zimmert

TL;DR

This paper advances contextual dynamic pricing for heterogenous buyers by modeling buyer types as draws from an unknown distribution with finite support K★. It introduces an optimistic posterior sampling (OPS) framework, plus a perturbation-based extension (POPS) to handle infinite model classes, and establishes a near-optimal regret of $\tilde{O}(K★\sqrt{dT})$, with a matching lower bound $\Omega(\sqrt{K★ d T})$. A non-contextual refinement (ZoomV) achieves $\tilde{O}(\min\{\sqrt{K★T}, T^{2/3}\})$ via variance-aware zooming, while stronger ex-post type observability yields even tighter bounds $\tilde{O}(\sqrt{\min\{K★,d\}T})$. The work highlights how heterogeneity and feedback richness shape the difficulty of pricing under online learning, and it lays groundwork for scalable algorithms in practical settings with multiple buyer types and contextual information.

Abstract

We initiate the study of contextual dynamic pricing with a heterogeneous population of buyers, where a seller repeatedly posts prices (over $T$ rounds) that depend on the observable $d$-dimensional context and receives binary purchase feedback. Unlike prior work assuming homogeneous buyer types, in our setting the buyer's valuation type is drawn from an unknown distribution with finite support size $K_{\star}$. We develop a contextual pricing algorithm based on optimistic posterior sampling with regret $\widetilde{O}(K_{\star}\sqrt{dT})$, which we prove to be tight in $d$ and $T$ up to logarithmic terms. Finally, we refine our analysis for the non-contextual pricing case, proposing a variance-aware zooming algorithm that achieves the optimal dependence on $K_{\star}$.

Contextual Dynamic Pricing with Heterogeneous Buyers

TL;DR

This paper advances contextual dynamic pricing for heterogenous buyers by modeling buyer types as draws from an unknown distribution with finite support K★. It introduces an optimistic posterior sampling (OPS) framework, plus a perturbation-based extension (POPS) to handle infinite model classes, and establishes a near-optimal regret of , with a matching lower bound . A non-contextual refinement (ZoomV) achieves via variance-aware zooming, while stronger ex-post type observability yields even tighter bounds . The work highlights how heterogeneity and feedback richness shape the difficulty of pricing under online learning, and it lays groundwork for scalable algorithms in practical settings with multiple buyer types and contextual information.

Abstract

We initiate the study of contextual dynamic pricing with a heterogeneous population of buyers, where a seller repeatedly posts prices (over rounds) that depend on the observable -dimensional context and receives binary purchase feedback. Unlike prior work assuming homogeneous buyer types, in our setting the buyer's valuation type is drawn from an unknown distribution with finite support size . We develop a contextual pricing algorithm based on optimistic posterior sampling with regret , which we prove to be tight in and up to logarithmic terms. Finally, we refine our analysis for the non-contextual pricing case, proposing a variance-aware zooming algorithm that achieves the optimal dependence on .

Paper Structure

This paper contains 32 sections, 30 theorems, 101 equations, 1 table.

Key Result

Lemma 2.1

Fix any distribution $Q \in \Delta([0,1])$ and let $0 \leq p < p' \leq 1$. We then have $\mathsf{rev}_Q(p') - \mathsf{rev}_Q(p) \leq \mathsf{dem}_Q(p)(p'-p) \leq p'-p$.

Theorems & Definitions (48)

  • Lemma 2.1: One-sided Lipschitzness
  • Lemma 2.2: Pricing implication of Lévy metric bound
  • Theorem 3.2
  • Lemma 3.3
  • Lemma 3.4
  • Lemma 3.5
  • proof : Proof of \ref{['thm:OPS-regret-bd']}
  • Remark 3.6: Comparison to Thompson sampling
  • Remark 3.7: Relation with existing results
  • Theorem 3.8
  • ...and 38 more