Selling Joint Ads: A Regret Minimization Perspective

Gagan Aggarwal; Ashwinkumar Badanidiyuru; Paul Dütting; Federico Fusco

Selling Joint Ads: A Regret Minimization Perspective

Gagan Aggarwal, Ashwinkumar Badanidiyuru, Paul Dütting, Federico Fusco

TL;DR

The paper tackles selling a single non-excludable good to two cooperating buyers in an online-learning setting, formalizing the Repeated Joint Ads problem and analyzing regret with respect to the best fixed DSIC/IR mechanism. It develops adaptive discretization approaches that transform the challenging mechanism space into tractable representations: orthogonal mechanisms for stochastic data and path-based discretizations for smooth adversaries. It delivers a stochastic upper bound of $\tilde{O}(T^{3/4})$ via Augment-the-Best-Mechanism and a $O(T^{2/3})$ bound via PathLearning in the $\sigma$-smooth regime, alongside lower bounds showing an $\Omega(\sqrt{T})$ baseline and an adversarial impossibility of sublinear regret; together they delineate a sharp separation between stochastic/smooth and adversarial settings. The work extends online-learning in economic design to non-excludable, multi-buyer settings, introduces new algorithmic tools (adaptive grids, augmented-grid mechanisms, and edge-based sampling for path-experts), and points to future extensions to larger coalitions and broader non-excludable mechanisms with potential real-world impact in collaborative advertising markets.

Abstract

Motivated by online retail, we consider the problem of selling one item (e.g., an ad slot) to two non-excludable buyers (say, a merchant and a brand). This problem captures, for example, situations where a merchant and a brand cooperatively bid in an auction to advertise a product, and both benefit from the ad being shown. A mechanism collects bids from the two and decides whether to allocate and which payments the two parties should make. This gives rise to intricate incentive compatibility constraints, e.g., on how to split payments between the two parties. We approach the problem of finding a revenue-maximizing incentive-compatible mechanism from an online learning perspective; this poses significant technical challenges. First, the action space (the class of all possible mechanisms) is huge; second, the function that maps mechanisms to revenue is highly irregular, ruling out standard discretization-based approaches. In the stochastic setting, we design an efficient learning algorithm achieving a regret bound of $O(T^{3/4})$. Our approach is based on an adaptive discretization scheme of the space of mechanisms, as any non-adaptive discretization fails to achieve sublinear regret. In the adversarial setting, we exploit the non-Lipschitzness of the problem to prove a strong negative result, namely that no learning algorithm can achieve more than half of the revenue of the best fixed mechanism in hindsight. We then consider the $σ$-smooth adversary; we construct an efficient learning algorithm that achieves a regret bound of $O(T^{2/3})$ and builds on a succinct encoding of exponentially many experts. Finally, we prove that no learning algorithm can achieve less than $Ω(\sqrt T)$ regret in both the stochastic and the smooth setting, thus narrowing the range where the minimax regret rates for these two problems lie.

Selling Joint Ads: A Regret Minimization Perspective

TL;DR

via Augment-the-Best-Mechanism and a

bound via PathLearning in the

-smooth regime, alongside lower bounds showing an

baseline and an adversarial impossibility of sublinear regret; together they delineate a sharp separation between stochastic/smooth and adversarial settings. The work extends online-learning in economic design to non-excludable, multi-buyer settings, introduces new algorithmic tools (adaptive grids, augmented-grid mechanisms, and edge-based sampling for path-experts), and points to future extensions to larger coalitions and broader non-excludable mechanisms with potential real-world impact in collaborative advertising markets.

Abstract

. Our approach is based on an adaptive discretization scheme of the space of mechanisms, as any non-adaptive discretization fails to achieve sublinear regret. In the adversarial setting, we exploit the non-Lipschitzness of the problem to prove a strong negative result, namely that no learning algorithm can achieve more than half of the revenue of the best fixed mechanism in hindsight. We then consider the

-smooth adversary; we construct an efficient learning algorithm that achieves a regret bound of

and builds on a succinct encoding of exponentially many experts. Finally, we prove that no learning algorithm can achieve less than

regret in both the stochastic and the smooth setting, thus narrowing the range where the minimax regret rates for these two problems lie.

Paper Structure (39 sections, 23 theorems, 54 equations, 6 figures, 3 algorithms)

This paper contains 39 sections, 23 theorems, 54 equations, 6 figures, 3 algorithms.

Introduction
Our Results
Challenges and Techniques
A Hard Instance
The Stochastic Algorithm: An Adaptive Grid.
The Smooth Algorithm: From Paths to Edges.
Lower Bounds.
Further Related Work
(Online) Learning of Economic Problems
Non-Excludable Mechanism Design.
The Learning Model
Structure of Incentive Compatible Mechanisms
A Geometric View.
Technical convention.
Regret Minimization
...and 24 more sections

Key Result

Proposition 1

Let $G = (V,E)$ be any orthogonal graph, and $\pi$ any complete path in $G$, then $M_{\pi} \in \mathcal{M}^{\perp}$. Conversely, for any $M \in \mathcal{M}^{\perp}$, there exists a complete path $\pi$ in some orthogonal graph such that $M = M_\pi$.

Figures (6)

Figure 1: The allocation regions of optimal mechanisms corresponding to different distributions. The payment functions can be visualized as the valuations projection on the allocation region's boundary. For instance, in \ref{['fig:convex']}, the first agent (x-axis) pays $p_1=0$, while the second agent (y-axis) pays $p_2 = 0.2$. The valuations are drawn i.i.d. from the distribution with cumulative density function $F(x) = 2(x+1)^{-2}$ in \ref{['fig:concave']} and $F(x) = x^2$ in \ref{['fig:convex']}. In \ref{['fig:complex']}, the distribution has support on the black points.
Figure 2: The distribution described in \ref{['ex:counter-example']} is supported on the red segment (with $\delta = 1/6$). The Figure also represents the uniform grid for $\varepsilon = 1/6$.
Figure 3: An orthogonal graph (left) and the rectangles $A_e$ for vertical and horizontal edges (center). Note that while the two figures are for a uniform grid, this is not needed for orthogonal mechanisms. On the right, there is a visualization of the augmentation procedure: $G_{1/2}$ (red nodes) is augmented with the three green points (note the auxiliary points in blue). These points have been chosen to mimic the procedure used in \ref{['lem:approx']} to approximate the mechanism whose allocation region is shaded in blue with the orthogonal one corresponding to the red complete path.
Figure 4: Visualization of the grid $G_\varepsilon$ for $\varepsilon = 1/6$. The shaded area represents the allocation region of some mechanism, while the red path is the corresponding inner hull.
Figure 5: The two squares support the family of distributions we use in the lower bound. In blue, respectively red, are reported the boundary of the allocation region of $M_1$, respectively $M_2$.
...and 1 more figures

Theorems & Definitions (68)

Example 1: Equal-revenue
Definition 1: HaghtalabRS21
Definition 2: Orthogonal Mechanisms
Definition 3: Orthogonal Graphs
Proposition 1
proof
Definition 4: Edge weights
Proposition 2
proof
Theorem 1
...and 58 more

Selling Joint Ads: A Regret Minimization Perspective

TL;DR

Abstract

Selling Joint Ads: A Regret Minimization Perspective

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (68)