Table of Contents
Fetching ...

Autobidders with Budget and ROI Constraints: Efficiency, Regret, and Pacing Dynamics

Brendan Lucier, Sarath Pattathil, Aleksandrs Slivkins, Mengxiao Zhang

TL;DR

The paper studies competing autobidders with budget and ROI constraints in repeated auctions. It introduces a gradient-based constrained learning algorithm that operates with bandit feedback and guarantees feasibility. Key results show a $\tfrac{1}{2}$-approximation to the optimal liquid welfare and vanishing regret for each bidder, valid across $1$st-price, $2$nd-price, and intermediate auctions. Numerical experiments validate the theory and reveal pacing dynamics and practical efficiency improvements.

Abstract

We study a game between autobidding algorithms that compete in an online advertising platform. Each autobidder is tasked with maximizing its advertiser's total value over multiple rounds of a repeated auction, subject to budget and return-on-investment constraints. We propose a gradient-based learning algorithm that is guaranteed to satisfy all constraints and achieves vanishing individual regret. Our algorithm uses only bandit feedback and can be used with the first- or second-price auction, as well as with any "intermediate" auction format. Our main result is that when these autobidders play against each other, the resulting expected liquid welfare over all rounds is at least half of the expected optimal liquid welfare achieved by any allocation. This holds whether or not the bidding dynamics converges to an equilibrium.

Autobidders with Budget and ROI Constraints: Efficiency, Regret, and Pacing Dynamics

TL;DR

The paper studies competing autobidders with budget and ROI constraints in repeated auctions. It introduces a gradient-based constrained learning algorithm that operates with bandit feedback and guarantees feasibility. Key results show a -approximation to the optimal liquid welfare and vanishing regret for each bidder, valid across st-price, nd-price, and intermediate auctions. Numerical experiments validate the theory and reveal pacing dynamics and practical efficiency improvements.

Abstract

We study a game between autobidding algorithms that compete in an online advertising platform. Each autobidder is tasked with maximizing its advertiser's total value over multiple rounds of a repeated auction, subject to budget and return-on-investment constraints. We propose a gradient-based learning algorithm that is guaranteed to satisfy all constraints and achieves vanishing individual regret. Our algorithm uses only bandit feedback and can be used with the first- or second-price auction, as well as with any "intermediate" auction format. Our main result is that when these autobidders play against each other, the resulting expected liquid welfare over all rounds is at least half of the expected optimal liquid welfare achieved by any allocation. This holds whether or not the bidding dynamics converges to an equilibrium.
Paper Structure (12 sections)