Low-rank optimization on Tucker tensor varieties
Bin Gao, Renfeng Peng, Ya-xiang Yuan
TL;DR
This paper addresses optimization over Tucker tensor varieties $\mathcal{M}_{\le\mathbf{r}}$, where the Tucker rank is bounded, by deriving an explicit tangent-cone description and leveraging it to design geometry-guided gradient methods. It introduces a gradient-related approximate projection (GRAP) and a retraction-free variant (rfGRAP), along with a Tucker-rank adaptive method (TRAM) that can increase or decrease rank during iterations, all with convergence guarantees via the Łojasiewicz framework. Central to the approach is an explicit characterization of the tangent cone and practical projection operators, including an approximate projection that avoids expensive exact metric projections while preserving descent. Empirical results on tensor completion demonstrate that GRAP, rfGRAP, and especially TRAM outperform state-of-the-art Tucker-based methods across varying rank choices and data regimes, with TRAM effectively identifying appropriate ranks in practice.
Abstract
In the realm of tensor optimization, the low-rank Tucker decomposition is crucial for reducing the number of parameters and for saving storage. We explore the geometry of Tucker tensor varieties -- the set of tensors with bounded Tucker rank -- which is notably more intricate than the well-explored matrix varieties. We give an explicit parametrization of the tangent cone of Tucker tensor varieties and leverage its geometry to develop provable gradient-related line-search methods for optimization on Tucker tensor varieties. To the best of our knowledge, this is the first work concerning geometry and optimization on Tucker tensor varieties. In practice, low-rank tensor optimization suffers from the difficulty of choosing a reliable rank parameter. To this end, we incorporate the established geometry and propose a Tucker rank-adaptive method that aims to identify an appropriate rank with guaranteed convergence. Numerical experiments on tensor completion reveal that the proposed methods are in favor of recovering performance over other state-of-the-art methods. The rank-adaptive method performs the best across various rank parameter selections and is indeed able to find an appropriate rank.
