Allocating Variance to Maximize Expectation
Renato Purita Paes Leme, Cliff Stein, Yifeng Teng, Pratik Worah
TL;DR
This work studies how to allocate a fixed variance budget among Gaussian variables to maximize the expected aggregate performance, either a single-set maximum $\mathbb{E}[\max_i X_i]$ or a sum over multiple sets $\mathbb{E}[\sum_{j=1}^m \max_{i\in S_j} X_i]$. It develops PTASes for the independent and correlated single-set problems by exploiting a structural bound that most variance can be ignored below a small threshold and by grid-discretizing the remaining decision space; for the multi-set case it yields a provable $\Omega\left(\frac{1}{\log n}\right)$-approximation via a bucketed, submodular-maximization approach. The analysis leverages a key eps-contribution lemma, Lipschitz-type smoothness, and Earth Mover’s Distance arguments to relate nearby distributions, enabling efficient approximation despite non-convex objectives and correlation. The results have implications for variance shaping in ML reliability, mixture-model learning, and revenue optimization in auctions, where tuning prediction uncertainty directly impacts performance.
Abstract
We design efficient approximation algorithms for maximizing the expectation of the supremum of families of Gaussian random variables. In particular, let $\mathrm{OPT}:=\max_{σ_1,\cdots,σ_n}\mathbb{E}\left[\sum_{j=1}^{m}\max_{i\in S_j} X_i\right]$, where $X_i$ are Gaussian, $S_j\subset[n]$ and $\sum_iσ_i^2=1$, then our theoretical results include: - We characterize the optimal variance allocation -- it concentrates on a small subset of variables as $|S_j|$ increases, - A polynomial time approximation scheme (PTAS) for computing $\mathrm{OPT}$ when $m=1$, and - An $O(\log n)$ approximation algorithm for computing $\mathrm{OPT}$ for general $m>1$. Such expectation maximization problems occur in diverse applications, ranging from utility maximization in auctions markets to learning mixture models in quantitative genetics.
