Orthogonal Constrained Neural Networks for Solving Structured Inverse Eigenvalue Problems
Shuai Zhang, Xuelian Jiang, Hao Qian, Yingxiang Xu
TL;DR
The paper tackles algebraic Structured Inverse Eigenvalue Problems (SIEPs) by introducing a unified, unsupervised neural framework that embeds orthogonality constraints via a Stiefel-layer within a multilayer perceptron (SMLP). A unified loss function combines nonnegativity, prescribed-entry, and row-sum constraints, enabling a hard-constraint optimization on the Stiefel manifold $Q \,\in\, \mathcal{O}(n)$. The approach demonstrates strong performance across symmetric NIEP/NIEP with prescribed entries, Euclidean Distance Matrix IEPs, stochastic and generalized stochastic IEPs, and large-scale problems, including graph-theoretic IEPs and microwave-filter network transformations, with notable speedups and high convergence rates. The work provides a versatile, scalable method for solving diverse SIEPs and suggests broad applicability to other orthogonality-constrained problems in numerical linear algebra and engineering.
Abstract
This paper introduces a novel neural network for efficiently solving Structured Inverse Eigenvalue Problems (SIEPs). The main contributions lie in two aspects: firstly, a unified framework is proposed that can handle various SIEPs instances. Particularly, an innovative method for handling nonnegativity constraints is devised using the ReLU function. Secondly, a novel neural network based on multilayer perceptrons, utilizing the Stiefel layer, is designed to efficiently solve SIEP. By incorporating the Stiefel layer through matrix orthogonal decomposition, the orthogonality of similarity transformations is ensured, leading to accurate solutions for SIEPs. Hence, we name this new network Stiefel Multilayer Perceptron (SMLP). Furthermore, SMLP is an unsupervised learning approach with a lightweight structure that is easy to train. Several numerical tests from literature and engineering domains demonstrate the efficiency of SMLP.
