Scalable Expectation Estimation with Subtractive Mixture Models
Lena Zellinger, Nicola Branchini, Víctor Elvira, Antonio Vergari
TL;DR
The paper addresses the challenge of estimating expectations when target distributions are complex by leveraging subtractive mixture models (SMMs) as powerful, expressive proposals in importance sampling. It introduces ΔEx, an unbiased estimator that avoids direct sampling from SMMs by decomposing an SMM into the difference of additive MMs and sampling from the positive and negative parts separately. ΔEx is shown to be unbiased and strongly consistent, with a variance analysis guiding allocation across the two components; a safe-mixing variant further stabilizes performance in valley-rich regions. Empirical results demonstrate that ΔEx can match the estimation quality of costly autoregressive sampling while delivering substantial runtime gains, and initial normalizing-constant experiments highlight the importance of robust proposal design (including safe components) for practical use. The work lays groundwork for adaptive IS with SMMs and suggests directions toward hierarchical, circuit-based mixtures and more refined variance-reduction strategies.
Abstract
Many Monte Carlo (MC) and importance sampling (IS) methods use mixture models (MMs) for their simplicity and ability to capture multimodal distributions. Recently, subtractive mixture models (SMMs), i.e. MMs with negative coefficients, have shown greater expressiveness and success in generative modeling. However, their negative parameters complicate sampling, requiring costly auto-regressive techniques or accept-reject algorithms that do not scale in high dimensions. In this work, we use the difference representation of SMMs to construct an unbiased IS estimator ($Δ\text{Ex}$) that removes the need to sample from the SMM, enabling high-dimensional expectation estimation with SMMs. In our experiments, we show that $Δ\text{Ex}$ can achieve comparable estimation quality to auto-regressive sampling while being considerably faster in MC estimation. Moreover, we conduct initial experiments with $Δ\text{Ex}$ using hand-crafted proposals, gaining first insights into how to construct safe proposals for $Δ\text{Ex}$.
