Gaussian Mixture Model with unknown diagonal covariances via continuous sparse regularization
Romane Giard, Yohann de Castro, Clément Marteau
TL;DR
This work develops a convex Beurling-LASSO approach for Gaussian mixtures with unknown diagonal covariances by lifting parameters to the space of Radon measures and using a normalization that yields a kernel-based, non-translation-invariant framework. It proves non-asymptotic recovery guarantees via non-degenerate dual certificates built within a Fisher-Rao geometric framework, introducing a semi-distance aligned with the kernel to control near/far regions and parameter recovery. The results yield near-parametric convergence rates for component means, diagonals, and weights, along with density prediction guarantees, and they establish sparsity of the BLASSO solution under large samples under a Non-Degenerate Source Condition. The methodology advances beyond prior BLASSO off-the-grid work by accommodating component-specific diagonal covariances and by deriving explicit separation conditions tied to the kernel geometry, with implications for accurate mixture learning and density estimation in higher dimensions.
Abstract
This paper addresses the statistical estimation of Gaussian Mixture Models (GMMs) with unknown diagonal covariances from independent and identically distributed samples. We employ the Beurling-LASSO (BLASSO), a convex optimization framework that promotes sparsity in the space of measures, to simultaneously estimate the number of components and their parameters. Our main contribution extends the BLASSO methodology to multivariate GMMs with component-specific unknown diagonal covariance matrices-a significantly more flexible setting than previous approaches requiring known and identical covariances. We establish non-asymptotic recovery guarantees with nearly parametric convergence rates for component means, diagonal covariances, and weights, as well as for density prediction. A key theoretical contribution is the identification of an explicit separation condition on mixture components that enables the construction of non-degenerate dual certificates-essential tools for establishing statistical guarantees for the BLASSO. Our analysis leverages the Fisher-Rao geometry of the statistical model and introduces a novel semi-distance adapted to our framework, providing new insights into the interplay between component separation, parameter space geometry, and achievable statistical recovery.
