Orthogonal Nonnegative Matrix Factorization with the Kullback-Leibler divergence
Jean Pacifique Nkurunziza, Fulgence Nahayo, Nicolas Gillis
TL;DR
The paper addresses clustering with ONMF under a KL divergence objective, suitable for nonnegative, Poisson-like data such as word-count vectors and certain imaging data. It develops KL-ONMF using alternating optimization with closed-form updates: W is updated as a cluster-wise average of data columns, and H is updated by selecting active clusters per data point via a KL-consistent criterion, ensuring HH^⊤ = I_r. Empirically, KL-ONMF achieves higher clustering accuracy on document datasets and improved material endmember extraction in hyperspectral unmixing compared to Frobenius-norm ONMF, with competitive run times and favorable convergence. The approach is scalable, with a per-iteration cost on the order of O(nnz(X) r), and the authors provide code and discuss extending the framework to additional divergences for broader applicability.
Abstract
Orthogonal nonnegative matrix factorization (ONMF) has become a standard approach for clustering. As far as we know, most works on ONMF rely on the Frobenius norm to assess the quality of the approximation. This paper presents a new model and algorithm for ONMF that minimizes the Kullback-Leibler (KL) divergence. As opposed to the Frobenius norm which assumes Gaussian noise, the KL divergence is the maximum likelihood estimator for Poisson-distributed data, which can model better sparse vectors of word counts in document data sets and photo counting processes in imaging. We develop an algorithm based on alternating optimization, KL-ONMF, and show that it performs favorably with the Frobenius-norm based ONMF for document classification and hyperspectral image unmixing.
