Contextual Dynamic Pricing with Heterogeneous Buyers

Thodoris Lykouris; Sloan Nietert; Princewill Okoroafor; Chara Podimata; Julian Zimmert

Contextual Dynamic Pricing with Heterogeneous Buyers

Thodoris Lykouris, Sloan Nietert, Princewill Okoroafor, Chara Podimata, Julian Zimmert

TL;DR

This paper advances contextual dynamic pricing for heterogenous buyers by modeling buyer types as draws from an unknown distribution with finite support K★. It introduces an optimistic posterior sampling (OPS) framework, plus a perturbation-based extension (POPS) to handle infinite model classes, and establishes a near-optimal regret of $\tilde{O}(K★\sqrt{dT})$, with a matching lower bound $\Omega(\sqrt{K★ d T})$. A non-contextual refinement (ZoomV) achieves $\tilde{O}(\min\{\sqrt{K★T}, T^{2/3}\})$ via variance-aware zooming, while stronger ex-post type observability yields even tighter bounds $\tilde{O}(\sqrt{\min\{K★,d\}T})$. The work highlights how heterogeneity and feedback richness shape the difficulty of pricing under online learning, and it lays groundwork for scalable algorithms in practical settings with multiple buyer types and contextual information.

Abstract

We initiate the study of contextual dynamic pricing with a heterogeneous population of buyers, where a seller repeatedly posts prices (over $T$ rounds) that depend on the observable $d$-dimensional context and receives binary purchase feedback. Unlike prior work assuming homogeneous buyer types, in our setting the buyer's valuation type is drawn from an unknown distribution with finite support size $K_{\star}$. We develop a contextual pricing algorithm based on optimistic posterior sampling with regret $\widetilde{O}(K_{\star}\sqrt{dT})$, which we prove to be tight in $d$ and $T$ up to logarithmic terms. Finally, we refine our analysis for the non-contextual pricing case, proposing a variance-aware zooming algorithm that achieves the optimal dependence on $K_{\star}$.

Contextual Dynamic Pricing with Heterogeneous Buyers

TL;DR

, with a matching lower bound

. A non-contextual refinement (ZoomV) achieves

via variance-aware zooming, while stronger ex-post type observability yields even tighter bounds

. The work highlights how heterogeneity and feedback richness shape the difficulty of pricing under online learning, and it lays groundwork for scalable algorithms in practical settings with multiple buyer types and contextual information.

Abstract

We initiate the study of contextual dynamic pricing with a heterogeneous population of buyers, where a seller repeatedly posts prices (over

rounds) that depend on the observable

-dimensional context and receives binary purchase feedback. Unlike prior work assuming homogeneous buyer types, in our setting the buyer's valuation type is drawn from an unknown distribution with finite support size

. We develop a contextual pricing algorithm based on optimistic posterior sampling with regret

, which we prove to be tight in

and

up to logarithmic terms. Finally, we refine our analysis for the non-contextual pricing case, proposing a variance-aware zooming algorithm that achieves the optimal dependence on

Contextual Dynamic Pricing with Heterogeneous Buyers

TL;DR

Abstract

Contextual Dynamic Pricing with Heterogeneous Buyers

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (48)