Unveiling the Cycloid Trajectory of EM Iterations in Mixed Linear Regression
Zhankun Luo, Abolfazl Hashemi
TL;DR
This work analyzes EM convergence for the two-component mixed linear regression (2MLR) with unlabeled data. It derives explicit population EM updates across all SNR using Bessel functions, and reveals that in the noiseless setting the EM iterates follow a cycloid trajectory within the span of the initialization and the truth, enabling a precise recurrence for the sub-optimality angle. The authors prove a transition from linear to quadratic convergence and establish finite-sample error bounds for regression parameters and mixing weights, with a three-stage convergence scheme and minimal dependence on mixing weights. Empirical results validate the cycloid trajectory, show robust quadratic convergence at high SNR, and demonstrate that the regression update is largely independent of the true mixing weights, supporting the practicality of the theory and suggesting avenues for extensions to weak separation and more components.
Abstract
We study the trajectory of iterations and the convergence rates of the Expectation-Maximization (EM) algorithm for two-component Mixed Linear Regression (2MLR). The fundamental goal of MLR is to learn the regression models from unlabeled observations. The EM algorithm finds extensive applications in solving the mixture of linear regressions. Recent results have established the super-linear convergence of EM for 2MLR in the noiseless and high SNR settings under some assumptions and its global convergence rate with random initialization has been affirmed. However, the exponent of convergence has not been theoretically estimated and the geometric properties of the trajectory of EM iterations are not well-understood. In this paper, first, using Bessel functions we provide explicit closed-form expressions for the EM updates under all SNR regimes. Then, in the noiseless setting, we completely characterize the behavior of EM iterations by deriving a recurrence relation at the population level and notably show that all the iterations lie on a certain cycloid. Based on this new trajectory-based analysis, we exhibit the theoretical estimate for the exponent of super-linear convergence and further improve the statistical error bound at the finite-sample level. Our analysis provides a new framework for studying the behavior of EM for Mixed Linear Regression.
