Learning to Normalize on the SPD Manifold under Bures-Wasserstein Geometry

Rui Wang; Shaocheng Jin; Ziheng Chen; Xiaoqing Luo; Xiao-Jun Wu

Learning to Normalize on the SPD Manifold under Bures-Wasserstein Geometry

Rui Wang, Shaocheng Jin, Ziheng Chen, Xiaoqing Luo, Xiao-Jun Wu

TL;DR

Covariance matrices live on the SPD manifold $oldsymbol{{ m S}_{++}^d}$, where ill-conditioning hinders normalization in RBN. The paper introduces GBWBN, a Riemannian batch normalization framework built on the generalized Bures-Wasserstein metric $(g^{( heta) ext{-GBW}})$ with a learnable SPD matrix $oldsymbol{M}$ and a power deformation parameter $ heta$, enabling a flexible normalization geometry. A RBN layer under BW is extended via a Riemannian isometry to GBWM, with training of the bias $oldsymbol{ m G}$ and metric components using Riemannian stochastic gradient descent. Empirical results on skeleton-based action recognition (HDM05, NTU RGB+D) and EEG/SSVEP (MAMEM-SSVEP-II) demonstrate improved conditioning and accuracy over AIM-based and other RBN baselines, validating the practical impact of geometry-aware SPD normalization.

Abstract

Covariance matrices have proven highly effective across many scientific fields. Since these matrices lie within the Symmetric Positive Definite (SPD) manifold - a Riemannian space with intrinsic non-Euclidean geometry, the primary challenge in representation learning is to respect this underlying geometric structure. Drawing inspiration from the success of Euclidean deep learning, researchers have developed neural networks on the SPD manifolds for more faithful covariance embedding learning. A notable advancement in this area is the implementation of Riemannian batch normalization (RBN), which has been shown to improve the performance of SPD network models. Nonetheless, the Riemannian metric beneath the existing RBN might fail to effectively deal with the ill-conditioned SPD matrices (ICSM), undermining the effectiveness of RBN. In contrast, the Bures-Wasserstein metric (BWM) demonstrates superior performance for ill-conditioning. In addition, the recently introduced Generalized BWM (GBWM) parameterizes the vanilla BWM via an SPD matrix, allowing for a more nuanced representation of vibrant geometries of the SPD manifold. Therefore, we propose a novel RBN algorithm based on the GBW geometry, incorporating a learnable metric parameter. Moreover, the deformation of GBWM by matrix power is also introduced to further enhance the representational capacity of GBWM-based RBN. Experimental results on different datasets validate the effectiveness of our proposed method.

Learning to Normalize on the SPD Manifold under Bures-Wasserstein Geometry

TL;DR

Abstract

Learning to Normalize on the SPD Manifold under Bures-Wasserstein Geometry

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (8)