Table of Contents
Fetching ...

Applying non-negative matrix factorization with covariates to label matrix for classification

Kenichi Satoh

TL;DR

NMF-LAB recasts classification as the inverse problem of non-negative matrix tri-factorization, yielding a direct, probabilistic mapping from covariates to class labels without an external classifier. By normalizing the factorization coefficients, it outputs class-membership probabilities and supports seamless generalization to unseen covariates, enabling semi-supervised learning. The framework offers two variants: a linear, interpretable direct model and a kernel-based nonlinear model that achieves competitive accuracy on diverse datasets, including MNIST via Nyström approximation. Empirical results demonstrate favorable predictive performance, robustness to label noise, interpretability trade-offs, and scalable applicability to high-dimensional data, establishing a unified, probabilistic approach to regression and classification within the tri-NMF paradigm.

Abstract

Non-negative matrix factorization (NMF) is widely used for dimensionality reduction and interpretable analysis, but standard formulations are unsupervised and cannot directly exploit class labels. Existing supervised or semi-supervised extensions usually incorporate labels only via penalties or graph constraints, still requiring an external classifier. We propose \textit{NMF-LAB} (Non-negative Matrix Factorization for Label Matrix), which redefines classification as the inverse problem of non-negative matrix tri-factorization (tri-NMF). Unlike joint NMF methods, which reconstruct both features and labels, NMF-LAB directly factorizes the label matrix $Y$ as the observation, while covariates $A$ are treated as given explanatory variables. This yields a direct probabilistic mapping from covariates to labels, distinguishing our method from label-matrix factorization approaches that mainly model label correlations or impute missing labels. Our inversion offers two key advantages: (i) class-membership probabilities are obtained directly from the factorization without a separate classifier, and (ii) covariates, including kernel-based similarities, can be seamlessly integrated to generalize predictions to unseen samples. In addition, unlabeled data can be encoded as uniform distributions, supporting semi-supervised learning. Experiments on diverse datasets, from small-scale benchmarks to the large-scale MNIST dataset, demonstrate that NMF-LAB achieves competitive predictive accuracy, robustness to noisy or incomplete labels, and scalability to high-dimensional problems, while preserving interpretability. By unifying regression and classification within the tri-NMF framework, NMF-LAB provides a novel, probabilistic, and scalable approach to modern classification tasks.

Applying non-negative matrix factorization with covariates to label matrix for classification

TL;DR

NMF-LAB recasts classification as the inverse problem of non-negative matrix tri-factorization, yielding a direct, probabilistic mapping from covariates to class labels without an external classifier. By normalizing the factorization coefficients, it outputs class-membership probabilities and supports seamless generalization to unseen covariates, enabling semi-supervised learning. The framework offers two variants: a linear, interpretable direct model and a kernel-based nonlinear model that achieves competitive accuracy on diverse datasets, including MNIST via Nyström approximation. Empirical results demonstrate favorable predictive performance, robustness to label noise, interpretability trade-offs, and scalable applicability to high-dimensional data, establishing a unified, probabilistic approach to regression and classification within the tri-NMF paradigm.

Abstract

Non-negative matrix factorization (NMF) is widely used for dimensionality reduction and interpretable analysis, but standard formulations are unsupervised and cannot directly exploit class labels. Existing supervised or semi-supervised extensions usually incorporate labels only via penalties or graph constraints, still requiring an external classifier. We propose \textit{NMF-LAB} (Non-negative Matrix Factorization for Label Matrix), which redefines classification as the inverse problem of non-negative matrix tri-factorization (tri-NMF). Unlike joint NMF methods, which reconstruct both features and labels, NMF-LAB directly factorizes the label matrix as the observation, while covariates are treated as given explanatory variables. This yields a direct probabilistic mapping from covariates to labels, distinguishing our method from label-matrix factorization approaches that mainly model label correlations or impute missing labels. Our inversion offers two key advantages: (i) class-membership probabilities are obtained directly from the factorization without a separate classifier, and (ii) covariates, including kernel-based similarities, can be seamlessly integrated to generalize predictions to unseen samples. In addition, unlabeled data can be encoded as uniform distributions, supporting semi-supervised learning. Experiments on diverse datasets, from small-scale benchmarks to the large-scale MNIST dataset, demonstrate that NMF-LAB achieves competitive predictive accuracy, robustness to noisy or incomplete labels, and scalability to high-dimensional problems, while preserving interpretability. By unifying regression and classification within the tri-NMF framework, NMF-LAB provides a novel, probabilistic, and scalable approach to modern classification tasks.

Paper Structure

This paper contains 25 sections, 20 equations, 2 figures, 7 tables.

Figures (2)

  • Figure 1: Orthodontic longitudinal data analyzed by NMF with covariates. Thin lines denote individual trajectories, labeled with subject IDs at both ends. Bold curves indicate fitted trajectories for males (gray) and females (black)
  • Figure 2: Predicted probability of Leicester as a function of Fe (horizontal axis) and P (vertical axis), with all other covariates fixed at their mean values. Darker shading indicates higher probability. Circles denote samples from Leicester, and crosses denote samples from Mancetter