On the Entropy of General Mixture Distributions
Namyoon Lee
TL;DR
This work addresses the challenge of computing the differential entropy of general mixture distributions, which is intractable due to the log-sum structure. By recasting mixtures as a memoryless channel with input $C$ and output $X$, it yields the exact decomposition $h(X)=h(X|C)+I(X;C)$ and establishes universal bounds $h(X|C)\le h(X)\le h(X|C)+H(C)$. It then provides a deterministic, closed-form approximation ${\hat{h}}(X)=h_{\sf L}(X)+{\bar{\Delta}}$ based on pairwise overlaps $z_{c,d}$, with a family-dependent offset $\bar{\Delta}$ calibrated to be exact in complete and zero overlap regimes and clipped to respect bounds. The authors derive explicit closed-form ingredients for time-honored mixture families (Gaussian, factorized Laplacian, uniform, and hybrids) and validate the method numerically across separation, dimension, component count, and correlation, demonstrating accurate and computationally tractable entropy estimation for practical applications.
Abstract
Mixture distributions are a workhorse model for multimodal data in information theory, signal processing, and machine learning. Yet even when each component density is simple, the differential entropy of the mixture is notoriously hard to compute because the mixture couples a logarithm with a sum. This paper develops a deterministic, closed-form toolkit for bounding and accurately approximating mixture entropy directly from component parameters. Our starting point is an information-theoretic channel viewpoint: the latent mixture label plays the role of an input, and the observation is the output. This viewpoint separates mixture entropy into an average within-component uncertainty plus an overlap term that quantifies how much the observation reveals about the hidden label. We then bound and approximate this overlap term using pairwise overlap integrals between component densities, yielding explicit expressions whenever these overlaps admit a closed form. A simple, family-dependent offset corrects the systematic bias of the Jensen overlap bound and is calibrated to be exact in the two limiting regimes of complete overlap and near-perfect separation. A final clipping step guarantees that the estimate always respects universal information-theoretic bounds. Closed-form specializations are provided for Gaussian, factorized Laplacian, uniform, and hybrid mixtures, and numerical experiments validate the resulting bounds and approximations across separation, dimension, number of components, and correlated covariances.
