Causal Inference on Stopped Random Walks in Online Advertising
Jia Yuan Yu
TL;DR
The paper addresses estimating long-term causal effects of online-advertising treatments under interference, where a treatment alters user trajectories, advertiser budgets, and population size. It develops a Markov-chain framework with stopped random walks to model long-run revenue and introduces a permutation trick to reduce state-space complexity, enabling tractable analysis. A budget-splitting experimental design together with a Markov-chain CLT and a Wald-like equation yields a confidence interval for the long-term treatment effect $\Delta$, including guidance on experiment duration via asymptotic variances and stopping-time considerations. The work advances causal inference in two-sided marketplaces by handling random population sizes, interdependencies, and non-i.i.d. measurements, with practical implications for designing long-running experiments in large online advertising ecosystems.
Abstract
We consider a causal inference problem frequently encountered in online advertising systems, where a publisher (e.g., Instagram, TikTok) interacts repeatedly with human users and advertisers by sporadically displaying to each user an advertisement selected through an auction. Each treatment corresponds to a parameter value of the advertising mechanism (e.g., auction reserve-price), and we want to estimate through experiments the corresponding long-term treatment effect (e.g., annual advertising revenue). In our setting, the treatment affects not only the instantaneous revenue from showing an ad, but also changes each user's interaction-trajectory, and each advertiser's bidding policy -- as the latter is constrained by a finite budget. In particular, each a treatment may even affect the size of the population, since users interact longer with a tolerable advertising mechanism. We drop the classical i.i.d. assumption and model the experiment measurements (e.g., advertising revenue) as a stopped random walk, and use a budget-splitting experimental design, the Anscombe Theorem, a Wald-like equation, and a Central Limit Theorem to construct confidence intervals for the long-term treatment effect.
