Dynamic Pricing and Advertising with Demand Learning

Shipra Agrawal; Yiding Feng; Wei Tang

Dynamic Pricing and Advertising with Demand Learning

Shipra Agrawal, Yiding Feng, Wei Tang

TL;DR

This work studies a monopolist’s joint design of dynamic pricing and fully flexible advertising (information signaling) to shape customer valuations. Using Bayesian persuasion as the information-design backbone, the authors quantify the value of advertising and show a tight universal bound Γ^* = 2 on the revenue gain from signaling, meaning advertising can at most double revenue in the worst case. For settings with unknown demand functions and valuations linear in product quality, they develop a computationally efficient online algorithm achieving a regret of order O$\left(T^{2/3}(m\log T)^{1/3}\right)$, where m is the cardinality of the quality space, without requiring Lipschitz/smoothness assumptions on demand. The algorithm leverages an instance-dependent discretization of the type space and a UCB-based estimate of demand, together with a novel rounding procedure to ensure feasibility of advertising strategies; improved results are obtained for additive valuations. Together, these results justify advertising as a revenue lever and provide practical, provably near-optimal learning strategies for pricing and signaling under demand uncertainty, with clear implications for markets where signaling product characteristics through advertising is feasible and privacy-sensitive pricing is desirable.

Abstract

We consider a novel pricing and advertising framework, where a seller not only sets product price but also designs flexible 'advertising schemes' to influence customers' valuation of the product. We impose no structural restriction on the seller's feasible advertising strategies and allow her to advertise the product by disclosing or concealing any information. Following the literature in information design, this fully flexible advertising can be modeled as the seller being able to choose any information policy that signals the product quality/characteristic to the customers. Customers observe the advertising signal and infer a Bayesian belief over the products. We aim to investigate two questions in this work: (1) What is the value of advertising? To what extent can advertising enhance a seller's revenue? (2) Without any apriori knowledge of the customers' demand function, how can a seller adaptively learn and optimize both pricing and advertising strategies using past purchase responses? To study the first question, we introduce and study the value of advertising - a revenue gap between using advertising vs not advertising, and we provide a crisp tight characterization for this notion for a broad family of problems. For the second question, we study the seller's dynamic pricing and advertising problem with demand uncertainty. Our main result for this question is a computationally efficient online algorithm that achieves an optimal $O(T^{2/3}(m\log T)^{1/3})$ regret rate when the valuation function is linear in the product quality. Here $m$ is the cardinality of the discrete product quality domain and $T$ is the time horizon. This result requires some mild regularity assumptions on the valuation function, but no Lipschitz or smoothness assumption on the customers' demand function. We also obtain several improved results for the widely considered special case of additive valuations.

Dynamic Pricing and Advertising with Demand Learning

TL;DR

, where m is the cardinality of the quality space, without requiring Lipschitz/smoothness assumptions on demand. The algorithm leverages an instance-dependent discretization of the type space and a UCB-based estimate of demand, together with a novel rounding procedure to ensure feasibility of advertising strategies; improved results are obtained for additive valuations. Together, these results justify advertising as a revenue lever and provide practical, provably near-optimal learning strategies for pricing and signaling under demand uncertainty, with clear implications for markets where signaling product characteristics through advertising is feasible and privacy-sensitive pricing is desirable.

Abstract

regret rate when the valuation function is linear in the product quality. Here

is the cardinality of the discrete product quality domain and

is the time horizon. This result requires some mild regularity assumptions on the valuation function, but no Lipschitz or smoothness assumption on the customers' demand function. We also obtain several improved results for the widely considered special case of additive valuations.

Paper Structure (26 sections, 27 theorems, 55 equations, 6 figures)

This paper contains 26 sections, 27 theorems, 55 equations, 6 figures.

Introduction
The High-level Problem Formulation and Our Contributions
Problem Formulation
Advertising as the Revenue Lever
Two Illustrative Examples
The Value of Advertising
Dynamic Pricing and Advertising with Demand Learning
An Equivalent Reformulation for Tractable Advertising Strategies
Algorithm Design: Challenges and Ideas
Details of the Proposed Algorithm
Regret Bound of Algorithm 1 and Proof Overview
Proof Outline
A Rounding Procedure to Bound the Discretization Error
Estimation Error and Optimism
Improved regret bounds for additive valuations
...and 11 more sections

Key Result

Theorem 3.1

The universal VoA $\Gamma^* = 2$.

Figures (6)

Figure 1: Graphical illustration of binary product quality examples in \ref{['subsec:example']}.
Figure 2: Graphical illustration for Procedure \ref{['algo_construction_adver_general']}. Given the input price and advertising $(p, \rho)$, fix a posterior mean $q \in\mathsf{supp}(\rho)$ where $\{i'\in[m]: \rho_{i'}(q) > 0\} = \{i, j\}$ (drawn in black dashed line). According to the procedure, we first identify $x = \kappa(p, q)$, and $x^\dagger = \kappa(p^\dagger, q) \in ((z-1)\varepsilon, z\varepsilon)$ where the constructed price $p^\dagger$ is defined as in the procedure. We then find two posterior means $q_L, q_R$ (here $q_L \ge \bar{\omega}_i, q_R \le \bar{\omega}_j$) such that $\kappa(p^\dagger, q_L) = z\varepsilon$ and $\kappa(p^\dagger, q_R) = (z-1)\varepsilon$ (drawn in brown dashed line), and $\kappa(p^\dagger, q_R) < \kappa(p^\dagger, q_L) < \kappa(p, q)$.
Figure 3: Setting 1: compared to no-information advertising strategy and full-information advertising strategy
Figure 4: Setting 2: compared to no-information advertising strategy and full-information advertising strategy
Figure 5: Setting 1: compared to ETC algorithm with different pure exploration rounds
...and 1 more figures

Theorems & Definitions (47)

Remark 2.2
Theorem 3.1: Value of advertising
Definition 4.1: Critical type
Example 4.2
Proposition 4.0
Remark 4.3
Theorem 5.1: Regret upper bound
Proposition 5.2: Regret lower bound
Proposition 5.2
Lemma 5.3: FTX-22
...and 37 more

Dynamic Pricing and Advertising with Demand Learning

TL;DR

Abstract

Dynamic Pricing and Advertising with Demand Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (47)