Operator SVD with Neural Networks via Nested Low-Rank Approximation
J. Jon Ryu, Xiangxiang Xu, H. S. Melihcan Erol, Yuheng Bu, Lizhong Zheng, Gregory W. Wornell
TL;DR
This work introduces NeuralSVD, an unconstrained neural network-based framework for learning the top-$L$ ordered singular functions of a linear operator by casting SVD as a low-rank approximation problem (LoRA) and applying nesting to impose the correct ordering. The key innovations are Schmidt's LoRA objective, sequential and joint nesting strategies, and their realization in NeuralSVD with disjoint or shared networks, enabling efficient gradient-based optimization for non-self-adjoint operators and providing EVD as a special case. Empirical results on analytical PDE operators (e.g., 2D hydrogen and harmonic oscillator) and a cross-domain retrieval task based on canonical dependence kernels demonstrate that NeuralSVD can recover accurate, orthogonal singular functions and yield structured, compact representations that outperform prior parametric methods like SpIN and NeuralEF. The approach scales favorably with the number of modes and offers practical benefits for PDEs, spectral embeddings, and cross-domain learning, with open-source implementations to facilitate adoption and comparison.
Abstract
Computing eigenvalue decomposition (EVD) of a given linear operator, or finding its leading eigenvalues and eigenfunctions, is a fundamental task in many machine learning and scientific computing problems. For high-dimensional eigenvalue problems, training neural networks to parameterize the eigenfunctions is considered as a promising alternative to the classical numerical linear algebra techniques. This paper proposes a new optimization framework based on the low-rank approximation characterization of a truncated singular value decomposition, accompanied by new techniques called \emph{nesting} for learning the top-$L$ singular values and singular functions in the correct order. The proposed method promotes the desired orthogonality in the learned functions implicitly and efficiently via an unconstrained optimization formulation, which is easy to solve with off-the-shelf gradient-based optimization algorithms. We demonstrate the effectiveness of the proposed optimization framework for use cases in computational physics and machine learning.
