Multi-agent Auto-Bidding with Latent Graph Diffusion Models

Dom Huh; Prasant Mohapatra

Multi-agent Auto-Bidding with Latent Graph Diffusion Models

Dom Huh, Prasant Mohapatra

TL;DR

This work tackles auto-bidding in large-scale, multi-agent auctions with KPI constraints by introducing LGD-AB, a framework that combines learnable graph embeddings of impression opportunities and agents with a planning-based latent diffusion model for trajectory forecasting and bid planning. The two-tier approach first encodes auction dynamics via a graph representation and multi-agent context, then uses a latent diffusion model to plan bids while aligning outcomes to KPI targets through reward alignment and offline value-function guidance. Key contributions include a graph-based representation of IO interdependencies, a diffusion-based planning module, and a KPI-aligned training loop demonstrated on real-world AuctionNet data and synthetic auctions, yielding improvements in CPA, ROI, win rate, and social welfare, along with better forecast accuracy under partial observability. The results suggest LGD-AB enables more effective, KPI-compliant auto-bidding in large, open multi-agent auction environments and provides a scalable path toward richer graph-driven planning in practice.

Abstract

This paper proposes a diffusion-based auto-bidding framework that leverages graph representations to model large-scale auction environments. In such settings, agents must dynamically optimize bidding strategies under constraints defined by key performance indicator (KPI) metrics, all while operating in competitive environments characterized by uncertain, sparse, and stochastic variables. To address these challenges, we introduce a novel approach combining learnable graph-based embeddings with a planning-based latent diffusion model (LDM). By capturing patterns and nuances underlying the interdependence of impression opportunities and the multi-agent dynamics of the auction environment, the graph representation enable expressive computations regarding auto-bidding outcomes. With reward alignment techniques, the LDM's posterior is fine-tuned to generate auto-bidding trajectories that maximize KPI metrics while satisfying constraint thresholds. Empirical evaluations on both real-world and synthetic auction environments demonstrate significant improvements in auto-bidding performance across multiple common KPI metrics, as well as accuracy in forecasting auction outcomes.

Multi-agent Auto-Bidding with Latent Graph Diffusion Models

TL;DR

Abstract

Multi-agent Auto-Bidding with Latent Graph Diffusion Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)