A New Bound on the Cumulant Generating Function of Dirichlet Processes
Pierre Perrault, Denis Belomestny, Pierre Ménard, Éric Moulines, Alexey Naumov, Daniil Tiapkin, Michal Valko
TL;DR
This paper addresses bounding the cumulant generating function (CGF) of a Dirichlet process X ~ DP(α ν0) to obtain non-asymptotic concentration bounds for sums of independent DPs. It develops a superadditivity-based approach, connects to Varadhan's integral lemma, and applies Fekete's lemma to derive a CGF bound in terms of the reversed KL divergence. The main result is log M_{DP(α ν0)}(f) ≤ sup_{ν ∈ M1(Ω)} ( Eν[f] − α KL(ν0||ν) ), i.e., the convex conjugate of α KL(ν0||·). The bound yields practical confidence regions and is asymptotically optimal as α → ∞, with demonstrated applicability to problems such as Combinatorial Thompson Sampling.
Abstract
In this paper, we introduce a novel approach for bounding the cumulant generating function (CGF) of a Dirichlet process (DP) $X \sim \text{DP}(αν_0)$, using superadditivity. In particular, our key technical contribution is the demonstration of the superadditivity of $α\mapsto \log \mathbb{E}_{X \sim \text{DP}(αν_0)}[\exp( \mathbb{E}_X[αf])]$, where $\mathbb{E}_X[f] = \int f dX$. This result, combined with Fekete's lemma and Varadhan's integral lemma, converts the known asymptotic large deviation principle into a practical upper bound on the CGF $ \log\mathbb{E}_{X\sim \text{DP}(αν_0)}{\exp(\mathbb{E}_{X}{[f]})} $ for any $α> 0$. The bound is given by the convex conjugate of the scaled reversed Kullback-Leibler divergence $α\mathrm{KL}(ν_0\Vert \cdot)$. This new bound provides particularly effective confidence regions for sums of independent DPs, making it applicable across various fields.
