Nonparametric Evaluation of Noisy ICA Solutions
Syamantak Kumar, Purnamrita Sarkar, Peter Bickel, Derek Bean
TL;DR
The paper tackles Blind Source Separation under Gaussian noise by introducing a nonparametric independence score based on the characteristic function, enabling data-driven selection among ICA algorithms without knowing noise parameters. It formalizes the independence score $\Delta(\mathbf{t},F|P)$, proves its consistency and uniform convergence, and presents two new CHF/CGF contrast functions that are robust to heavy tails and do not rely on higher moments. A convergence framework in pseudo-Euclidean space is developed, with global and local convergence guarantees for a broad class of contrasts, including the new CHF/CGF forms. Empirically, a Meta algorithm leveraging the independence score matches or surpasses the best candidate method across diverse distributions, noise levels, and tasks such as image demixing and MNIST denoising, highlighting practical impact for reliable ICA in noisy settings.
Abstract
Independent Component Analysis (ICA) was introduced in the 1980's as a model for Blind Source Separation (BSS), which refers to the process of recovering the sources underlying a mixture of signals, with little knowledge about the source signals or the mixing process. While there are many sophisticated algorithms for estimation, different methods have different shortcomings. In this paper, we develop a nonparametric score to adaptively pick the right algorithm for ICA with arbitrary Gaussian noise. The novelty of this score stems from the fact that it just assumes a finite second moment of the data and uses the characteristic function to evaluate the quality of the estimated mixing matrix without any knowledge of the parameters of the noise distribution. In addition, we propose some new contrast functions and algorithms that enjoy the same fast computability as existing algorithms like FASTICA and JADE but work in domains where the former may fail. While these also may have weaknesses, our proposed diagnostic, as shown by our simulations, can remedy them. Finally, we propose a theoretical framework to analyze the local and global convergence properties of our algorithms.
