Table of Contents
Fetching ...

Smoothed Separable Nonnegative Matrix Factorization

Nicolas Nadisic, Nicolas Gillis, Christophe Kervazo

TL;DR

This work strengthens separable NMF by relaxing the pure-pixel assumption to proximate latent points and introducing smoothed variants of two classic algorithms. By aggregating or smoothing data around multiple near-vertex points, SVCA and SSPA achieve improved robustness to noise and enhanced vertex recovery, with SVCA supported by theoretical guarantees analogous to ALLS. Empirically, the smoothed methods outperform VCA, SPA, and ALLS on synthetic data, hyperspectral unmixing, and facial feature extraction, with median aggregation providing robustness to non-Gaussian noise. Overall, the paper demonstrates that smoothed separable NMF is a practically effective and theoretically grounded approach for convex-hull–based vertex estimation in noisy settings.

Abstract

Given a set of data points belonging to the convex hull of a set of vertices, a key problem in linear algebra, signal processing, data analysis and machine learning is to estimate these vertices in the presence of noise. Many algorithms have been developed under the assumption that there is at least one nearby data point to each vertex; two of the most widely used ones are vertex component analysis (VCA) and the successive projection algorithm (SPA). This assumption is known as the pure-pixel assumption in blind hyperspectral unmixing, and as the separability assumption in nonnegative matrix factorization. More recently, Bhattacharyya and Kannan (ACM-SIAM Symposium on Discrete Algorithms, 2020) proposed an algorithm for learning a latent simplex (ALLS) that relies on the assumption that there is more than one nearby data point to each vertex. In that scenario, ALLS is probalistically more robust to noise than algorithms based on the separability assumption. In this paper, inspired by ALLS, we propose smoothed VCA (SVCA) and smoothed SPA (SSPA) that generalize VCA and SPA by assuming the presence of several nearby data points to each vertex. We illustrate the effectiveness of SVCA and SSPA over VCA, SPA and ALLS on synthetic data sets, on the unmixing of hyperspectral images, and on feature extraction on facial images data sets. In addition, our study highlights new theoretical results for VCA.

Smoothed Separable Nonnegative Matrix Factorization

TL;DR

This work strengthens separable NMF by relaxing the pure-pixel assumption to proximate latent points and introducing smoothed variants of two classic algorithms. By aggregating or smoothing data around multiple near-vertex points, SVCA and SSPA achieve improved robustness to noise and enhanced vertex recovery, with SVCA supported by theoretical guarantees analogous to ALLS. Empirically, the smoothed methods outperform VCA, SPA, and ALLS on synthetic data, hyperspectral unmixing, and facial feature extraction, with median aggregation providing robustness to non-Gaussian noise. Overall, the paper demonstrates that smoothed separable NMF is a practically effective and theoretically grounded approach for convex-hull–based vertex estimation in noisy settings.

Abstract

Given a set of data points belonging to the convex hull of a set of vertices, a key problem in linear algebra, signal processing, data analysis and machine learning is to estimate these vertices in the presence of noise. Many algorithms have been developed under the assumption that there is at least one nearby data point to each vertex; two of the most widely used ones are vertex component analysis (VCA) and the successive projection algorithm (SPA). This assumption is known as the pure-pixel assumption in blind hyperspectral unmixing, and as the separability assumption in nonnegative matrix factorization. More recently, Bhattacharyya and Kannan (ACM-SIAM Symposium on Discrete Algorithms, 2020) proposed an algorithm for learning a latent simplex (ALLS) that relies on the assumption that there is more than one nearby data point to each vertex. In that scenario, ALLS is probalistically more robust to noise than algorithms based on the separability assumption. In this paper, inspired by ALLS, we propose smoothed VCA (SVCA) and smoothed SPA (SSPA) that generalize VCA and SPA by assuming the presence of several nearby data points to each vertex. We illustrate the effectiveness of SVCA and SSPA over VCA, SPA and ALLS on synthetic data sets, on the unmixing of hyperspectral images, and on feature extraction on facial images data sets. In addition, our study highlights new theoretical results for VCA.

Paper Structure

This paper contains 18 sections, 2 theorems, 15 equations, 20 figures, 5 tables, 5 algorithms.

Key Result

Theorem 1

gillis2013fast Let ${X} = WH+N$ as in Problem prob:main, and let Assumptions ass:sep and ass:cwn be satisfied, that is, $W = X(:,\mathcal{K}^*)$ for some index set $\mathcal{K}^*$ of cardinality $r$, and $\| N(:,j)\|_2 \leq \epsilon$ for all $j$ where $\epsilon \leq \mathcal{O}\left( \frac{\sigma_r^ where $\mathcal{K}(k)$ denotes the $k$th index in $\mathcal{K}$.

Figures (20)

  • Figure 1: Results for ALLS, SVCA, and SSPA for different values of $p$, when $\epsilon$ varies, for fixed $n=1000$ and purity $\alpha=0.05$ ($\delta=2.7\%$). Values for ALLS and SVCA are the medians over 30 trials. Note that ALLS(1)=SVCA(1)=VCA.
  • Figure 2: Comparison of SSPA with best, median, and worst result for ALLS and SVCA among 30 runs, when $\epsilon$ varies, for fixed $n=1000$, purity $\alpha=0.05$ ($\delta=2.7\%$) and parameter $p=25$. "Best err SVCA" represents the MRSA of the solution of SVCA that has the smallest relative reconstruction error.
  • Figure 3: Results for SVCA and SSPA for different values of $p$, when $\epsilon$ varies, for fixed $n=1000$ and purity $\alpha=0.05$ ($\delta=2.7\%$), Values for SVCA are the medians over 30 trials.
  • Figure 4: Results for SVCA and SSPA using either the median or the mean to average points, when $p$ varies, for fixed $n=1000$, purity $\alpha=0.05$ ($\delta=2.7\%$), and noise $\epsilon=0.05$. Values for SVCA are the medians over 30 trials.
  • Figure 5: Results for SVCA and SSPA using either the median or the mean to average points, when $p$ varies, for Poisson noise of level $\epsilon=0.1$, for fixed $n=1000$, purity $\alpha=0.02$ ($\delta=5.9\%$). Values for SVCA are the medians over 30 trials.
  • ...and 15 more figures

Theorems & Definitions (3)

  • Remark 1: On the projection $P^\bot$
  • Theorem 1
  • Theorem 2