Deep Nonnegative Matrix Factorization with Beta Divergences
Valentin Leplat, Le Thi Khanh Hien, Akwum Onwunta, Nicolas Gillis
TL;DR
This work develops deep nonnegative matrix factorization methods based on β-divergences, with a focus on KL divergence, to extract multi-layered features across domains. It advocates a layer-centric loss for identifiability and proposes two models: (i) a nonregularized deep β-NMF and (ii) a minimum-volume deep KL-NMF with log-determinant regularization. The authors derive multiplicative-update–based algorithms within a block majorization-minimization framework, prove convergence under perturbations, and demonstrate improved layer balance, sparser features, and meaningful hierarchical structures in facial features, topic modeling, and hyperspectral unmixing. The approach shows practical advantages in interpretability and endmember localization, with code available for reproducibility. Collectively, the paper advances regularized deep NMF with β-divergences and provides scalable optimization tools for real-world data.
Abstract
Deep Nonnegative Matrix Factorization (deep NMF) has recently emerged as a valuable technique for extracting multiple layers of features across different scales. However, all existing deep NMF models and algorithms have primarily centered their evaluation on the least squares error, which may not be the most appropriate metric for assessing the quality of approximations on diverse datasets. For instance, when dealing with data types such as audio signals and documents, it is widely acknowledged that $β$-divergences offer a more suitable alternative. In this paper, we develop new models and algorithms for deep NMF using some $β$-divergences, with a focus on the Kullback-Leibler divergence. Subsequently, we apply these techniques to the extraction of facial features, the identification of topics within document collections, and the identification of materials within hyperspectral images.
