Heaviside Low-Rank Support Matrix Machine

Xianchao Xiu; Shenghao Sun; Xinrong Li; Jiyuan Tao

Heaviside Low-Rank Support Matrix Machine

Xianchao Xiu, Shenghao Sun, Xinrong Li, Jiyuan Tao

TL;DR

A novel Heaviside low-rank SMM model called HL-SMM is proposed, which leverages the Heaviside loss instead of the common hinge or ramp losses for robustness and the low-rank constraint is adopted to accurately characterize the inherent global structure.

Abstract

Support matrix machine (SMM) is an emerging classification framework that directly handles matrix-structured observations, thereby avoiding the spatial correlations destroyed by vectorization. However, most existing SMM variants rely on convex or nonconvex surrogate loss functions, which may lead to high sensitivity to noise. To address this issue, we propose a novel Heaviside low-rank SMM model called HL-SMM, which leverages the Heaviside loss instead of the common hinge or ramp losses for robustness. Moreover, the low-rank constraint is adopted to accurately characterize the inherent global structure. In theory, we analyze the Karush-Kuhn-Tucker (KKT) points and rigorously prove the sufficient and necessary conditions. In algorithms, we develop an effective proximal alternating minimization (PAM) scheme, where all subproblems have closed-form solutions. Extensive experiments on benchmark datasets validate that the proposed HL-SMM achieves superior classification accuracy and robustness compared to state-of-the-art methods.

Heaviside Low-Rank Support Matrix Machine

TL;DR

Abstract

Paper Structure (14 sections, 6 theorems, 38 equations, 4 figures, 4 tables, 1 algorithm)

This paper contains 14 sections, 6 theorems, 38 equations, 4 figures, 4 tables, 1 algorithm.

Introduction
Preliminaries
The Proposed Method
New Formulation
Optimality Conditions
Optimization Algorithm
Convergence Analysis
Numerical Experiments
Experimental Setup
Experimental Results
Discussions
Parameter Analysis
Convergence Visualization
Conclusion

Key Result

Lemma 1

Given $\mathbf{W}\in\mathbb{R}^{p\times q}$ with singular values $\sigma_1(\mathbf{W})\ge\cdots\ge\sigma_s(\mathbf{W})>0$, the (possibly set-valued) projection onto the rank-constrained set is when $\sigma_r(\mathbf{W})>\sigma_{r+1}(\mathbf{W})$ (or $\sigma_r(\mathbf{W})=0$), the projection is single-valued.

Figures (4)

Figure 1: Coefficient matrices obtained by different loss functions.
Figure 2: Robustness comparison under 20% Gaussian noise.
Figure 3: Parameter sensitivity analysis of $r$ and $\beta$.
Figure 4: Convergence curves of the proposed algorithm.

Theorems & Definitions (11)

Lemma 1: Projection of $\mathcal{R}$
Lemma 2: Normal cone of $\mathcal{R}$
Lemma 3: Proximal operator of $\|(\cdot)_+\|_0$
Definition 1
Theorem 1: Necessary optimality condition
proof
Theorem 2: Suffcient optimality condition
proof
Theorem 3
proof
...and 1 more

Heaviside Low-Rank Support Matrix Machine

TL;DR

Abstract

Heaviside Low-Rank Support Matrix Machine

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (11)