Table of Contents
Fetching ...

An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online Advertising

Zhijian Duan, Yusen Huo, Tianyu Wang, Zhilin Zhang, Yeshu Li, Chuan Yu, Jian Xu, Bo Zheng, Xiaotie Deng

TL;DR

This work tackles budget-constrained auto-bidding in online advertising by introducing ABPlanner, a few-shot adaptable budget planner that sits above a low-level auto-bidder in a hierarchical framework. ABPlanner models each advertiser's bidding episode as an MDP where a high-level budget plan over $m$ stages guides fast, per-episode adaptation using prompts from previous episodes (in-context reinforcement learning) and is trained with PPO. Through extensive pure and semi-simulation experiments and real-world A/B testing, ABPlanner consistently improves the cumulative value of auto-bidders and demonstrates rapid adaptation to unseen advertisers. The results highlight the practical viability of integrating a learned, sample-efficient budget planner into real-time bidding stacks, with potential future work on joint learning with the auto-bidder and per-stage dynamic planning.

Abstract

In online advertising, advertisers commonly utilize auto-bidding services to bid for impression opportunities. A typical objective of the auto-bidder is to optimize the advertiser's cumulative value of winning impressions within specified budget constraints. However, such a problem is challenging due to the complex bidding environment faced by diverse advertisers. To address this challenge, we introduce ABPlanner, a few-shot adaptable budget planner designed to improve budget-constrained auto-bidding. ABPlanner is based on a hierarchical bidding framework that decomposes the bidding process into shorter, manageable stages. Within this framework, ABPlanner allocates the budget across all stages, allowing a low-level auto-bidder to bids based on the budget allocation plan. The adaptability of ABPlanner is achieved through a sequential decision-making approach, inspired by in-context reinforcement learning. For each advertiser, ABPlanner adjusts the budget allocation plan episode by episode, using data from previous episodes as prompt for current decisions. This enables ABPlanner to quickly adapt to different advertisers with few-shot data, providing a sample-efficient solution. Extensive simulation experiments and real-world A/B testing validate the effectiveness of ABPlanner, demonstrating its capability to enhance the cumulative value achieved by auto-bidders.

An Adaptable Budget Planner for Enhancing Budget-Constrained Auto-Bidding in Online Advertising

TL;DR

This work tackles budget-constrained auto-bidding in online advertising by introducing ABPlanner, a few-shot adaptable budget planner that sits above a low-level auto-bidder in a hierarchical framework. ABPlanner models each advertiser's bidding episode as an MDP where a high-level budget plan over stages guides fast, per-episode adaptation using prompts from previous episodes (in-context reinforcement learning) and is trained with PPO. Through extensive pure and semi-simulation experiments and real-world A/B testing, ABPlanner consistently improves the cumulative value of auto-bidders and demonstrates rapid adaptation to unseen advertisers. The results highlight the practical viability of integrating a learned, sample-efficient budget planner into real-time bidding stacks, with potential future work on joint learning with the auto-bidder and per-stage dynamic planning.

Abstract

In online advertising, advertisers commonly utilize auto-bidding services to bid for impression opportunities. A typical objective of the auto-bidder is to optimize the advertiser's cumulative value of winning impressions within specified budget constraints. However, such a problem is challenging due to the complex bidding environment faced by diverse advertisers. To address this challenge, we introduce ABPlanner, a few-shot adaptable budget planner designed to improve budget-constrained auto-bidding. ABPlanner is based on a hierarchical bidding framework that decomposes the bidding process into shorter, manageable stages. Within this framework, ABPlanner allocates the budget across all stages, allowing a low-level auto-bidder to bids based on the budget allocation plan. The adaptability of ABPlanner is achieved through a sequential decision-making approach, inspired by in-context reinforcement learning. For each advertiser, ABPlanner adjusts the budget allocation plan episode by episode, using data from previous episodes as prompt for current decisions. This enables ABPlanner to quickly adapt to different advertisers with few-shot data, providing a sample-efficient solution. Extensive simulation experiments and real-world A/B testing validate the effectiveness of ABPlanner, demonstrating its capability to enhance the cumulative value achieved by auto-bidders.

Paper Structure

This paper contains 27 sections, 6 equations, 6 figures, 1 algorithm.

Figures (6)

  • Figure 1: The whole procedure of ABPlanner. At the beginning of each bidding episode of an advertiser, ABPlanner allocates the budget based on data from previous bidding episodes. Subsequently, the underlying auto-bidder bids according to the allocation plan, and ABPlanner collects information to inform future episodes for the advertiser.
  • Figure 2: Experimental results of simulation experiments. We present the return improvement of all the budget planners with respect to vanilla auto-bidders. The average results from $5$ different runs and the $95\%$ confidence intervals are displayed.
  • Figure 3: The average budget allocation plans output by ABPlanner across all advertisers in the test set of the pure simulation environment.
  • Figure 4: The average budget proportion in the last episode of the semi-simulation environment. The proportion is averaged from all advertisers in the test set. Each $2$ successive stages are grouped, displaying the budget proportion for all $12$ groups.
  • Figure 5: Experimental results of the last episode return improvement of ABPlanner with different numbers of stages in the pure simulation and semi-simulation environments.
  • ...and 1 more figures