Riemannian Optimization on Tree Tensor Networks with Application in Machine Learning
Marius Willner, Marco Trenti, Dirk Lebiedz
TL;DR
The paper addresses optimization over tree tensor networks (TTNs) by formulating TTN-parameter spaces as quotient manifolds to remove gauge ambiguity. It develops two explicit horizontal spaces (Cartesian and orthogonal) and corresponding projectors, enabling first- and second-order Riemannian optimization (gradient descent and Newton/trust-region) on TTN quotients, along with a backpropagation scheme for kernel learning. The authors provide explicit gradient, Hessian, and retraction formulations tailored to TTN geometry and demonstrate their effectiveness on a handwritten digits classification task, achieving high accuracies and favorable training speed, especially when using the non-orthogonal horizontal space. The work offers a principled, scalable framework for TTN-based machine learning and sets the stage for extensions to more complex tensor networks and stochastic optimization strategies. The proposed approach has practical impact for kernel learning and ML tasks where structured low-rank representations are advantageous, combining differential geometry with efficient recursive TTN computations.
Abstract
Tree tensor networks (TTNs) are widely used in low-rank approximation and quantum many-body simulation. In this work, we present a formal analysis of the differential geometry underlying TTNs. Building on this foundation, we develop efficient first- and second-order optimization algorithms that exploit the intrinsic quotient structure of TTNs. Additionally, we devise a backpropagation algorithm for training TTNs in a kernel learning setting. We validate our methods through numerical experiments on a representative machine learning task.
