Schur's Positive-Definite Network: Deep Learning in the SPD cone with structure
Can Pouliquen, Mathurin Massias, Titouan Vayer
TL;DR
This work tackles the problem of estimating symmetric positive-definite (SPD) matrices while enforcing additional structure such as elementwise sparsity. It introduces SpodNet, an SPD-to-SPD learning module that preserves SPD outputs via Schur's condition and supports arbitrary constraints through learned updates of column-row pairs and diagonals. The framework is instantiated in three architectures—UBG, PNP, and E2E—each motivated by proximal-block-coordinate descent, and it includes a stability-preserving update strategy. Empirically, SpodNet achieves competitive or superior performance to traditional estimators in sparse precision-matrix recovery and yields coherent graphs in unsupervised real-world graph learning, highlighting its potential for practical SPD-constrained learning tasks.
Abstract
Estimating matrices in the symmetric positive-definite (SPD) cone is of interest for many applications ranging from computer vision to graph learning. While there exist various convex optimization-based estimators, they remain limited in expressivity due to their model-based approach. The success of deep learning motivates the use of learning-based approaches to estimate SPD matrices with neural networks in a data-driven fashion. However, designing effective neural architectures for SPD learning is challenging, particularly when the task requires additional structural constraints, such as element-wise sparsity. Current approaches either do not ensure that the output meets all desired properties or lack expressivity. In this paper, we introduce SpodNet, a novel and generic learning module that guarantees SPD outputs and supports additional structural constraints. Notably, it solves the challenging task of learning jointly SPD and sparse matrices. Our experiments illustrate the versatility and relevance of SpodNet layers for such applications.
