Checking the Sufficiently Scattered Condition using a Global Non-Convex Optimization Software
Nicolas Gillis, Robert Luce
TL;DR
The paper addresses identifiability in matrix factorization by focusing on the sufficiently scattered condition ($SSC$). It reformulates the SSC check as a bounded non-convex quadratic program and solves it with a global solver (Gurobi), augmented by a polynomial-time necessary condition (NC-SSC) and McCormick envelope relaxations. The authors prove a tightening result that reduces the test to a bounded problem whose optimum equals $1$ if and only if $SSC$ holds, and demonstrate practical feasibility on synthetic data and real hyperspectral images, including minimum-volume NMF. This provides a practical, exact post hoc certificate of identifiability for a broad class of constrained factorizations, with meaningful impact in domains like hyperspectral imaging.
Abstract
The sufficiently scattered condition (SSC) is a key condition in the study of identifiability of various matrix factorization problems, including nonnegative, minimum-volume, symmetric, simplex-structured, and polytopic matrix factorizations. The SSC allows one to guarantee that the computed matrix factorization is unique/identifiable, up to trivial ambiguities. However, this condition is NP-hard to check in general. In this paper, we show that it can however be checked in a reasonable amount of time in realistic scenarios, when the factorization rank is not too large. This is achieved by formulating the problem as a non-convex quadratic optimization problem over a bounded set. We use the global non-convex optimization software Gurobi, and showcase the usefulness of this code on synthetic data sets and on real-world hyperspectral images.
