Robustness of Minimum-Volume Nonnegative Matrix Factorization under an Expanded Sufficiently Scattered Condition
Giovanni Barbarino, Nicolas Gillis, Subhayan Saha
TL;DR
This work proves that minimum-volume NMF can robustly recover ground-truth factors under noisy data when the ground-truth abundances satisfy an expanded sufficiently scattered condition ($p$-SSC). By linking the estimated and ground-truth factors through a connecting matrix $R$ and carefully bounding perturbations, the authors derive explicit stability results for general $p$-SSC and for the near-separable case ($p$ near 1), showing that recovery remains feasible as long as the noise level is controlled relative to the SSC strength and the conditioning of the ground-truth basis. The analysis leverages dual cone geometry, the $H_p$ construction, and a permutation-approximation argument to establish that the min-vol solution identifies $W^{\#}$ (up to permutation) and enables recovery of $H^{\#}$. These results clarify how data spread within the latent simplex and the expanded SSC contribute to identifiability in the presence of noise, with practical implications for hyperspectral unmixing, topic modeling, and related NMF applications.
Abstract
Minimum-volume nonnegative matrix factorization (min-vol NMF) has been used successfully in many applications, such as hyperspectral imaging, chemical kinetics, spectroscopy, topic modeling, and audio source separation. However, its robustness to noise has been a long-standing open problem. In this paper, we prove that min-vol NMF identifies the groundtruth factors in the presence of noise under a condition referred to as the expanded sufficiently scattered condition which requires the data points to be sufficiently well scattered in the latent simplex generated by the basis vectors.
